Glossary

ARCore

ARCore is Google's platform for building augmented reality experiences on Android, offering motion tracking, environmental understanding, and light estimation.

Get in touch Learn more

Research scientist tracking AI experiments on laptop, experiment results visible, casual lab environment.

SPATIAL COMPUTING ARCHITECTURE

What is ARCore?

ARCore is Google's foundational platform for building augmented reality experiences on Android, enabling devices to understand and interact with the physical world.

ARCore is Google's software development kit (SDK) for creating augmented reality applications on Android. It provides three core capabilities: motion tracking to understand the device's position relative to the world, environmental understanding to detect horizontal and vertical surfaces, and light estimation to match the lighting of virtual objects to their surroundings. These features allow developers to anchor digital content convincingly within the user's physical environment.

As a spatial computing architecture, ARCore operates by fusing visual data from the camera with inertial readings from the device's IMU (Inertial Measurement Unit) in a process akin to Visual-Inertial Odometry (VIO). It builds a sparse point cloud of the environment for tracking and can perform plane detection for content placement. This on-device processing enables robust, markerless AR without requiring specialized hardware, forming the perceptual foundation for applications ranging from interactive gaming to practical digital twin visualization.

SPATIAL COMPUTING PLATFORM

Core Capabilities of ARCore

ARCore is Google's platform for building augmented reality experiences on Android, enabling digital content to interact with the real world through three foundational pillars: motion tracking, environmental understanding, and light estimation.

Motion Tracking

ARCore uses Visual-Inertial Odometry (VIO) to understand the device's position and orientation relative to the world. It tracks feature points across the camera feed and fuses this with data from the device's Inertial Measurement Unit (IMU) to estimate its 6DoF pose (position and orientation) with high precision. This allows virtual objects to remain anchored in place as the user moves.

Key Technology: Combines camera-based feature tracking with gyroscope and accelerometer data.
Primary Output: A continuous stream of device pose updates.
Challenge Solved: Maintains stable AR content placement even during rapid device movement.

EXPLORE

Environmental Understanding

This capability allows ARCore to detect and interpret the geometry of the physical world. It identifies feature points and uses them to detect horizontal and vertical surfaces through plane detection. This creates a sparse understanding of the environment, enabling apps to place virtual objects on real surfaces like floors, tables, and walls.

Plane Detection: Identifies flat surfaces (e.g., ground planes, vertical walls).
Feature Points: Distinct visual points used to understand surface geometry.
Application: Essential for realistic occlusion and object placement.

EXPLORE

Light Estimation

ARCore analyzes the camera image to estimate the environment's current lighting conditions. It provides the average intensity and color correction of ambient light, allowing virtual objects to cast believable shadows and exhibit lighting that matches the real world. This dramatically increases the visual coherence and realism of AR scenes.

Process: Samples the camera feed to determine ambient light color and intensity.
Output: A directional light source and an ambient spherical harmonics representation.
Benefit: Virtual objects appear to be lit by the same light sources as the physical environment.

EXPLORE

Depth API

The Depth API provides a per-pixel depth map of the scene, enabling sophisticated environmental interactions. Using motion stereo from a single moving camera or data from a dedicated time-of-flight (ToF) sensor, it calculates the distance from the device to real-world surfaces. This enables occlusion (virtual objects passing behind real ones), realistic physics, and surface-aware interactions.

Technology: Primarily uses motion parallax; can utilize hardware depth sensors.
Key Use Case: Enabling accurate occlusion for immersive AR.
Example: A virtual ball can roll behind a real couch.

EXPLORE

Cloud Anchors & Persistent Cloud Anchors

These features enable shared and persistent AR experiences. Cloud Anchors allow multiple users to view and interact with the same virtual object in the same physical location by resolving local maps to a common coordinate frame in the cloud. Persistent Cloud Anchors extend this by allowing anchors to be saved and retrieved across different sessions, even days later, enabling location-based AR applications.

Shared Experiences: Multiple devices see the same content in the same place.
Persistence: Anchors can be saved to the cloud and reloaded in future sessions.
Infrastructure: Relies on Google's cloud services for map hosting and resolution.

EXPLORE

Augmented Faces & Augmented Images

These are specialized tracking modes for specific use cases. Augmented Faces uses a front-facing camera and a 3D face mesh to attach virtual content like masks, makeup, or accessories to a user's face in real-time. Augmented Images allows apps to detect and track 2D images (like posters or product packaging) and attach virtual content to them, useful for interactive marketing or manuals.

Augmented Faces: Provides a 468-point 3D face mesh for precise attachment regions.
Augmented Images: Can be pre-trained in the app or detected at runtime.
Specialization: Optimized, high-performance tracking for targeted scenarios.

EXPLORE

SPATIAL COMPUTING ARCHITECTURE

How ARCore Works: The Technical Pipeline

ARCore, Google's platform for Android augmented reality, operates through a real-time pipeline that fuses sensor data to understand and interact with the physical world.

ARCore's pipeline begins with motion tracking, which uses the device's camera and Inertial Measurement Unit (IMU) to estimate its 6DoF pose in real time. It identifies visual feature points across frames and fuses this data with gyroscope and accelerometer readings via sensor fusion, creating a stable coordinate system for virtual content. This process is a form of Visual-Inertial Odometry (VIO), a core component of Visual SLAM systems.

Concurrently, environmental understanding detects horizontal and vertical planes, like floors and walls, through plane detection. Light estimation analyzes the camera image to match the lighting of virtual objects to the real scene. For advanced geometry, depth mapping uses the device's sensors to create a real-time depth map, enabling occlusion and more realistic interactions. These components collectively enable spatial mapping and scene understanding for persistent AR experiences.

SPATIAL COMPUTING ARCHITECTURES

ARCore in the Development Ecosystem

ARCore is Google's platform for building augmented reality experiences on Android, offering motion tracking, environmental understanding, and light estimation. This section details its core technical subsystems and their role in the spatial computing stack.

Motion Tracking & Visual-Inertial Odometry (VIO)

ARCore's foundational capability is 6DoF pose estimation through Visual-Inertial Odometry (VIO). It fuses data from the device's camera and Inertial Measurement Unit (IMU) to track the phone's position and orientation in real-time.

Process: Identifies feature points in the camera feed and tracks them across frames, using IMU data to maintain accuracy during fast motion or poor lighting.
Output: Continuously provides a pose graph representing the device's movement through space, enabling virtual objects to remain anchored.
Key Benefit: Enables persistent AR content placement without markers or pre-scanned environments.

Environmental Understanding & Plane Detection

This subsystem interprets the geometry of the physical world. Using feature points and depth data (when available), ARCore performs plane detection to identify flat, horizontal, and vertical surfaces like floors, tables, and walls.

Mechanism: Clusters feature points into large, connected planes and provides their boundaries and pose.
Application: Essential for placing virtual objects that appear to rest on real surfaces. This data can feed into higher-level scene understanding or spatial mapping.
Advanced Output: Can generate a coarse world mesh, a real-time 3D polygonal representation of surfaces for occlusion and physics.

Light Estimation

For virtual objects to appear believably integrated, they must match the ambient lighting. ARCore's light estimation analyzes the camera image to determine the environment's average color temperature and intensity.

Function: Provides a dominant directional light source (often mimicking the main light in the scene) and ambient spherical harmonics.
Result: Virtual objects cast consistent shadows and exhibit accurate specular highlights, dramatically increasing visual coherence.
Evolution: Earlier versions provided simple ambient intensity; modern implementations offer more sophisticated HDR lighting estimation for higher fidelity.

Depth API & Scene Reconstruction

On supported devices with time-of-flight sensors or dual cameras, ARCore's Depth API provides a dense depth map in real-time. This enables advanced interactions and detailed 3D scene reconstruction.

Capabilities: Allows virtual objects to occlude behind real-world geometry and enables physics interactions with complex surfaces.
Technical Basis: Creates a point cloud or depth image that can be used for surface reconstruction, moving beyond simple plane detection to understand complex geometry.
Use Case: Critical for applications like measuring real objects, scanning environments for digital twins, or creating immersive occlusion effects.

Cloud Anchors & Persistent AR

ARCore Cloud Anchors enable multi-user, persistent AR experiences by creating spatial anchors that can be resolved by different devices at different times.

Process: The device uploads visual features from its environment to Google's cloud. The cloud processes this data to create a unique anchor that other devices can later recognize and localize against.
Function: Solves the problem of shared frame-of-reference, enabling collaborative AR apps and experiences that persist across sessions (persistent AR).
Underlying Tech: Relies on visual recognition and large-scale feature matching rather than GPS, providing room-scale precision.

Integration with the Android Sensor Stack

ARCore is not a standalone sensor but a sophisticated sensor fusion platform deeply integrated with Android's hardware abstraction layer (HAL). It optimally manages the camera, IMU, and other sensors.

Synchronization: Precisely time-aligns camera frames with high-frequency IMU gyroscope and accelerometer readings, which is critical for robust VIO.
Calibration: Manages device-specific camera intrinsics and IMU-camera extrinsics (their relative position) to ensure accurate measurements.
System Resource Management: Dynamically adjusts CPU/GPU usage and camera parameters to balance AR performance with device battery life and thermal constraints.

SPATIAL COMPUTING PLATFORMS

ARCore vs. ARKit: A Technical Comparison

A feature-by-feature comparison of Google's ARCore and Apple's ARKit, the dominant SDKs for building augmented reality applications on mobile devices.

Core Feature / Metric	ARCore (Google)	ARKit (Apple)	Primary Use Case
Primary Platform	Android (7.0+ / API Level 24+)	iOS (11.0+)	Mobile AR Development
Core Tracking Method	Visual-Inertial Odometry (VIO)	Visual-Inertial Odometry (VIO)	6DoF device pose estimation
Environmental Understanding	Plane detection (horizontal & vertical), feature points	Plane detection (horizontal & vertical), feature points	Virtual object placement & occlusion
Light Estimation	Environmental HDR light estimation	Environmental lighting with intensity & color temperature	Realistic virtual object shading
Depth API / LiDAR Integration	Depth API (software-based, ToF/LiDAR on supported devices)	Scene Geometry API & LiDAR Scanner (hardware on Pro devices)	Occlusion, physics, mesh generation
Cloud Anchors / Persistence	Cloud Anchors (cross-platform)	Persistent World Tracking & Collaborative Sessions	Shared multi-user & persistent AR experiences
Face Tracking	via separate ML Kit API	TrueDepth Camera system (front-facing)	Selfie filters & facial expression analysis
Image Tracking	Augmented Images API	Image Tracking & Detection	Triggering AR from 2D markers or pictures
Object Tracking	Augmented Objects API (limited)	Object Scanning & Detection	Placing AR content on/around specific 3D objects
Motion Tracking	Supports 3DoF & 6DoF	6DoF only	Device orientation & position tracking
World Mapping / Meshing	Generates feature points & planes; mesh via Depth API	Generates coarse world mesh (enhanced with LiDAR)	Environmental understanding for occlusion & physics
People Occlusion	via Depth API on supported hardware	People Occlusion (with LiDAR or A12+ Bionic)	Realistic AR content interaction with people
Development Language	Java, Kotlin, C/C++/NDK, Unity, Unreal	Swift, Objective-C, C++, Unity, Unreal	Native & game engine SDK integration
Key Hardware Dependency	Motion & camera sensors; Google Play Services for AR	A9+ chip (iOS 11), A12+ for advanced features	Performance & feature availability

ARCORE

Frequently Asked Questions

ARCore is Google's platform for building augmented reality experiences on Android. This FAQ addresses common technical questions for developers and architects implementing spatial computing solutions.

ARCore is Google's software development kit (SDK) for building augmented reality (AR) applications on Android devices. It works by using a process called concurrent odometry and mapping (COM) to understand the device's position relative to the world. ARCore uses the phone's camera to detect visually distinct feature points, fuses this data with readings from the Inertial Measurement Unit (IMU) for robust motion tracking, and constructs a geometric understanding of flat surfaces and environmental lighting to enable realistic virtual object placement and interaction.

Key underlying processes include:

Motion Tracking: Estimates the device's 6DoF pose (position and orientation) in real-time.
Environmental Understanding: Detects horizontal and vertical surfaces like floors and walls.
Light Estimation: Assesses ambient lighting to correctly shade virtual objects.

Enabling Efficiency, Speed & Accuracy

Intelligent Analysis, Decision & Execution

We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.

Talk to Us

Search across company data

Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.

Useful when people spend too long searching or get different answers from different systems.

Enterprise searchRAGPermissions

Automate internal workflows

Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.

Useful when repetitive work moves across multiple tools and teams.

AI agentsWorkflow automationGovernance

Add AI to products and internal tools

Build assistants, guided actions, or decision support into the software your team or customers already use.

Useful when AI needs to be part of the product, not a separate tool.

AI integrationDecision supportModel routing

SPATIAL COMPUTING ARCHITECTURES

Related Terms

ARCore operates within a broader ecosystem of spatial computing technologies. These related concepts define the underlying systems for mapping, understanding, and interacting with the physical world.

Simultaneous Localization and Mapping (SLAM)

Simultaneous Localization and Mapping (SLAM) is the foundational computational technique that enables ARCore's core functionality. It is the process by which a device constructs a map of an unknown environment while simultaneously tracking its own position within that map. ARCore implements a Visual-Inertial SLAM system that fuses camera images with inertial sensor data from the device's IMU.

Key Components: Feature tracking, sparse mapping, and pose estimation.
Contrast with ARCore: SLAM is the generic algorithmic family; ARCore is Google's specific, production-hardened implementation for Android, incorporating additional APIs for plane detection and light estimation.

Visual-Inertial Odometry (VIO)

Visual-Inertial Odometry (VIO) is the real-time pose estimation engine at the heart of ARCore's motion tracking. It is a sensor fusion technique that combines a continuous stream of visual data from the camera with high-frequency motion data from the Inertial Measurement Unit (IMU).

Purpose: To estimate the device's 6-degree-of-freedom (6DoF) position and orientation.
Advantage over pure vision: The IMU provides robust tracking during fast motion, temporary occlusion, or low-texture environments where visual features are scarce. ARCore's VIO system is optimized for power efficiency and accuracy on mobile System-on-a-Chip (SoC) architectures.

Spatial Anchor

A Spatial Anchor is a persistent point of reference in the real world that ARCore creates and manages. It allows virtual content to be precisely placed and recalled in the same physical location across multiple app sessions, even if the environment changes slightly.

Mechanism: ARCore generates a unique descriptor for the local geometry and visual features surrounding the anchor point.
Cloud Anchors: ARCore's Cloud Anchors service enables shared multi-user experiences by allowing anchors to be hosted online and resolved by other devices in the same location.
Use Case: Placing a virtual sculpture in a lobby that multiple visitors can see days later from their own devices.

Scene Understanding

Scene Understanding refers to ARCore's ability to parse the physical environment beyond simple geometry. It involves identifying semantic and functional properties of surfaces and objects.

Core Capabilities:
- Plane Detection: Identifying horizontal (floors, tables) and vertical (walls) surfaces.
- Depth API: Generating real-time depth maps using the device's camera(s) or time-of-flight sensor.
- Semantic Understanding: More advanced classification of detected planes (e.g., 'floor', 'seat', 'table').
Purpose: This understanding allows virtual objects to interact realistically with the world—sitting on tables, occluding behind real objects, or bouncing on the floor.

ARKit

ARKit is Apple's counterpart framework to ARCore, providing augmented reality capabilities for iOS and iPadOS devices. It serves the same fundamental purpose but within Apple's ecosystem.

Technical Comparison: Both platforms offer motion tracking, plane detection, light estimation, and image anchoring. They differ in underlying sensor fusion implementations, specific API features (e.g., ARCore's Cloud Anchors vs. ARKit's RealityKit and Object Capture), and hardware optimization targets.
Developer Impact: The existence of these parallel platforms led to the creation of cross-platform SDKs like Unity's AR Foundation and Google's own ARCore SDK for Unity, which abstract the underlying native APIs.

OpenXR

OpenXR is a royalty-free, open standard from the Khronos Group that provides native access to a wide range of virtual reality and augmented reality hardware and platforms. It aims to reduce fragmentation in the XR industry.

Relationship to ARCore: While ARCore is a proprietary, platform-specific API for Android, OpenXR is a cross-platform standard. In theory, an application built against the OpenXR API could run on an ARCore-powered device if the device's runtime provides an OpenXR implementation.
Strategic Context: The adoption of OpenXR by major hardware and platform vendors represents a long-term industry trend towards standardized access, which could influence the evolution of platform-specific APIs like ARCore.

EXPLORE

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.

Limited slotsGet a Free AI Consultation

How We Work

Custom AI workflows for your Business

One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.

Talk to Us

ARCore

What is ARCore?

Core Capabilities of ARCore

Motion Tracking

Environmental Understanding

Light Estimation

Depth API

Cloud Anchors & Persistent Cloud Anchors

Augmented Faces & Augmented Images

How ARCore Works: The Technical Pipeline

ARCore in the Development Ecosystem

Motion Tracking & Visual-Inertial Odometry (VIO)

Environmental Understanding & Plane Detection

Light Estimation

Depth API & Scene Reconstruction

Cloud Anchors & Persistent AR

Integration with the Android Sensor Stack

ARCore vs. ARKit: A Technical Comparison

Frequently Asked Questions

Intelligent Analysis, Decision & Execution

Search across company data

Automate internal workflows

Add AI to products and internal tools

OpenXR

Prasad Kumkar

Partnered with leading AI, data, and software stack.

Custom AI workflows for your Business

Review the use case

Pick the right approach

Build the first useful version

Improve from there