MPEG_anchor extensions

Contributors

ISO/IEC SC29 WG3 (MPEG Systems) - Scene Description Breakout Group
Contacts
- Thomas Stockhammer (MPEG-I Scene Description BoG Chair, tsto@qti.qualcomm.com)

Status

Based on ISO/IEC DIS 23090-14 2nd Edition

Dependencies

Written against the glTF 2.0 spec.

Overview

AR anchoring allows for anchoring of a scene or some root nodes of a scene into the user’s physical environment to provide an AR experience. AR anchoring is supported at the scene level and at the node level, as well as an extension through the definition of the MPEG_anchor extension. Multiple AR anchoring is then supported by adding the MPEG_anchor extension to several root nodes.

Semantics

Semantics at glTF level

Semantics at scene or node level

Semantics at glTF level

The MPEG_anchor extension consists of Trackable and Anchor objects as provided in Table 1. The definition of Trackable and Anchor objects shall be provided within an MPEG_anchor extension at the glTF file level.

Table 1 – MPEG_anchor extension provided at the glTF file level

Name	Type	Required	Default	Description
trackables	`MPEG_anchor.trackable` `[1-*]`	✓ Yes		Provides a list of trackables used by the anchor objects.
anchors	`MPEG_anchor.anchor` `[1-*]`	✓ Yes		Provides a list of anchors for a scene or for root nodes to enable AR anchoring

The syntax and semantics of the Anchor object is provided in Table 2.

Table 2 – Definition of the Anchor object

Name	Type	Required	Default	Description
trackable	integer	✓ Yes		Index of the trackable in the trackables array that will be used for this anchor.
requiresAnchoring	boolean	✓ Yes		Indicates if AR anchoring is required for the rendering of the associated nodes. If TRUE, the application shall skip the virtual assets attached to this anchor until the pose of this anchor in the real world is known. if FALSE, the application shall process the virtual assets attached to this anchor
minimumRequiredSpace	VEC3	No	(0,0,0)	Space required to anchor the AR asset (width, height, depth in meters). This space corresponds to an axis-aligned bounding box, placed at the origin of the trackable space. Width is aligned with x axis, height with y axis and depth with z axis, expressed in trackable local space. This value shall be compared to the bounding box of the real-world available space associated with the trackable as estimated by the XR runtime.
aligned	enumeration	No	NOT_USED	The aligned flag may take one of the following values: NOT_USED=0, ALIGNED_NOTSCALED=1, ALIGNED_SCALED=2. If ALIGNED_SCALED is set, the bounding box of the virtual assets attached to that anchor is aligned and scaled to match the bounding box of the real-world available space associated with the trackable as estimated by the XR runtime.
actions	array(number)	No	N/A	Indices of the actions in the actions array of the interactivity extension to be executed once the pose of this anchor is determined. An example is a setTransform action to place the virtual assets attached to that anchor.
light	integer	No	N/A	Reference to an item in the lights array of the MPEG_lights_texture_based extension.

The definition of the Trackable object is provided in the Table 3.

Table 3: Definition of the Trackable object

Name	Type	Required	Description
type	enumeration	✓ Yes	The type of the trackable as defined in Table 4.
if (type == TRACKABLE_CONTROLLER) {
path	string	✓ Yes	A path that describes the action space as specified by the OpenXR specification in clause 6.2. An example is “/user/hand/left/input”.
}
if (type== TRACKABLE_GEOMETRIC) {
geometricConstraint	enumeration	✓ Yes	The geometricConstraint flag may take one of the following values: HORIZONTAL_PLANE=0, VERTICAL_PLANE=1
}
if (type== TRACKABLE_MARKER_2D OR MARKER_3D) {
markerNode	number	✓ Yes	Index to the node in the nodes array in which the marker geometry and texture are described.
}
if (type== TRACKABLE_MARKER_GEO) {
coordinates	array	✓ Yes	Array of three float numbers giving the longitude, the latitude, and the elevation of the geolocation of the center, that shall be as specified in [GeoJSON] for the point geometry type as follows: the longitude in units of decimal degrees. The value is in range of −180.0 to 180.0, inclusive. Positive values represent eastern longitude and negative values represent western longitude. the latitude in units of decimal degrees. The value is in range of −90.0 to 90.0, inclusive. Positive value represents northern latitude and negative value represents southern latitude. the elevation is in units of meters above the WGS 84 reference ellipsoid.
}
if (type== TRACKABLE_APPLICATION) {
trackableId	string	✓ Yes	An application-defined trackable id, that is known to the application.
}

Table 4: Definition of the Trackable type

Trackable type	Description
TRACKABLE_FLOOR = 0	See here
TRACKABLE_VIEWER = 1	See here
TRACKABLE_CONTROLLER = 2	See here
TRACKABLE_PLANE = 3	See here
TRACKABLE_MARKER_2D = 4	See here
TRACKABLE_MARKER_3D = 5	See here
TRACKABLE_MARKER_GEO = 6	See here
TRACKABLE_APPLICATION = 7	See here

Semantics at scene or node level

Table 5: MPEG_anchor object instantiation at the node or scene level

Name	Type	Required	Default	Description
anchor	integer	✓ Yes		Reference to an item in the anchors array of the MPEG_anchor extension.

Trackable Definitions

TRACKABLE_FLOOR

A trackable of type TRACKABLE_FLOOR is an anchor with a plane that spans the xz-plane in the anchor’s local coordinate system. The origin of the local coordinate system is located at the center of the detected flat rectangular surface considered to be on the floor of the scene. The y-axis of the anchor’s local coordinate system is the plane’s normal vector and the X and Z axes are aligned with the rectangle edges.

NOTE This corresponds to the XR_REFERENCE_SPACE_TYPE_STAGE defined in OpenXR in 7.1. Reference Spaces.

TRACKABLE_VIEWER

A trackable of type TRACKABLE_VIEWER is a trackable that corresponds to the viewer’s pose. It tracks the primary viewer with the center at the viewer’s position, with +Y up, +X to the right, and -Z forward. This trackable is viewer head locked. NOTE This corresponds to the XR_REFERENCE_SPACE_TYPE_VIEW defined in OpenXR in 7.1. Reference Spaces.

TRACKABLE_CONTROLLER

A trackable of type TRACKABLE_CONTROLLER is a trackable that corresponds to one of the active controllers. Three controller types are defined: grip, aim, and palm. A grip pose allows applications to reliably render a virtual object held in the user’s hand, whether it is tracked directly or by a motion controller. The grip pose is defined as follows:

The grip position:
- For tracked hands: The user’s palm centroid when closing the fist, at the surface of the palm.
- For handheld motion controllers: A fixed position within the controller that generally lines up with the palm centroid when held by a hand in a neutral position. This position should be adjusted left or right to center the position within the controller’s grip.
The grip orientation
- +X axis: When you completely open your hand to form a flat 5-finger pose, the ray that is normal to the user’s palm (away from the palm in the left hand, into the palm in the right hand).
- -Z axis: When you close your hand partially (as if holding the controller), the ray that goes through the center of the tube formed by your non-thumb fingers, in the direction of little finger to thumb.
- +Y axis: orthogonal to +Z and +X using the right-hand rule. An aim pose allows applications to point in the world using the input source, according to the platform’s conventions for aiming with that kind of source. The aim pose is defined as follows:
For tracked hands: The ray that follows platform conventions for how the user aims at objects in the world with their entire hand, with +Y up, +X to the right, and -Z forward. The ray chosen will be runtime-dependent, for example, a ray emerging from the palm parallel to the forearm.
For handheld motion controllers: The ray that follows platform conventions for how the user targets objects in the world with the motion controller, with +Y up, +X to the right, and -Z forward. This is usually for applications that are rendering a model matching the physical controller, as an application rendering a virtual object in the user’s hand likely prefers to point based on the geometry of that virtual object. The ray chosen will be runtime-dependent, although this will often emerge from the frontmost tip of a motion controller. A palm pose allows applications to reliably anchor visual content relative to the user’s physical hand, whether the user’s hand is tracked directly or its position and orientation is inferred by a physical controller. The palm pose is defined as follows:
The palm position: The user’s physical palm centroid, at the surface of the palm.
The palm orientation:
- +X axis: When a user is holding the controller and straightens their index finger, the ray that is normal to the user’s palm (away from the palm in the left hand, into the palm in the right hand)
- -Z axis: When a user is holding the controller and straightens their index finger, the ray that is parallel to their finger’s pointing direction.
- +Y axis: orthogonal to +Z and +X using the right-hand rule. NOTE This corresponds to the Standard pose identifiers defined in OpenXR in 6.3.2. Input subpaths.

TRACKABLE_PLANE

The width and length of a plane span the xz-plane of the anchor instance's local coordinate system. The origin of the local coordinate system is located at the center of the detected flat rectangular surface. The y-axis of the plane anchor is the plane’s normal vector, and the X and Z axes are aligned with the rectangle edges.

TRACKABLE_MARKER_2D

The width and length of the marker 2D span the xz-plane of the anchor instance's local coordinate system. The origin of the local coordinate system is located at the center of the detected marker 2D surface. The y-axis of the anchor is the plane’s normal vector, and the X and Z axes are aligned with the rectangle edges.

TRACKABLE_MARKER_3D

For 3D models, the origin is the center of the mesh. The X, Y, and Z axes correspond to the axes of the world space.

TRACKABLE_MARKER_GEO

The y-axis matches the direction of gravity as detected by the device's motion sensing hardware, y points downward. The x- and z-axes match the longitude and latitude directions. -Z points to true north -X points west.

TRACKABLE_APPLICATION

The application-defined trackable object must have a right-handed coordinate space.

Processing model

Each trackable provides a local reference space in which an anchor pose can be expressed. This local reference space is right-handed and depends on the type of trackable as described below. The MPEG_anchor extension shall not be present at scene and node level at the same time in a glTF file conforming to this document. Upon activation of the XR mode, the Presentation Engine identifies the anchor points associated with the scene or with the root nodes and the related trackables. In the case of a trackable of type TRACKABLE_MARKER_GEO, it is up to the application to decide when to render the virtual object. If the requiresAnchoring Boolean parameter of an anchor is set to TRUE, the application shall skip the virtual assets attached to this anchor until the pose of this anchor in the real world is known. Otherwise, the application shall process the virtual assets attached to this anchor. Upon the detection of a trackable within the user’s physical environment, the application creates an XR space and possibly determines the bounding box of the real world available space in which the virtual assets can be added. If the optional minimumRequiredSpace parameter of the anchor referencing the trackable is present and if the estimated real world available space is smaller in volume than the minimumRequiredSpace, the application shall skip the virtual assets attached to that anchor. Otherwise, the application shall start the tracking, shall launch the actions provided in the actions parameter of the anchor and shall apply the necessary spatial transformations to the virtual assets with respect to the local space of the anchor prior to rendering. The application could update at runtime a tracking status of each trackable defined in the trackables array at the scene level, for instance as follows:

A tracking status set to TRUE when the trackable is detected and when its pose is well identified.
A tracking status set to FALSE when the trackable position is lost (e.g., no more detected). To provide the flexibility to define an anchor space different than the trackable’s space,
The TRS of the node having the MPEG_anchor extension defines the relative transformation between the trackable’s space and the anchor space in which the child nodes containing the virtual assets are placed.
A setTransform action may be defined to affect the scene root nodes in the case of a MPEG_anchor extension. This transformation defines the TRS between the trackable’s space and the anchor space in which all the scene root nodes are placed. Actions are defined in MPEG_scene_interactivity extension. If the array of action is not empty, the actions are executed once the pose of anchor is determined. If the tracking status is set to FALSE after being TRUE, actions are not canceled.

Schema

JSON schema: MPEG_anchor.schema.json

Known Implementations

ISO/IEC WD 23090-24

Resources

ISO/IEC FDIS 23090-14, Information technology — Coded representation of immersive media — Part 14: Scene Description
ISO/IEC WD 23090-24, Information technology — Coded representation of immersive media — Part 24: Conformance and Reference Software for Scene Description for MPEG Media

License

The use of the "MPEG scene description extensions" is subject to the license as accessible here: https://standards.iso.org/ and is subject to the IPR policy as accessible here: https://www.iso.org/iso-standards-and-patents.html.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MPEG_anchor extensions

Contributors

Status

Dependencies

Overview

Semantics

Semantics at glTF level

Semantics at scene or node level

Semantics at glTF level

Semantics at scene or node level

Trackable Definitions

TRACKABLE_FLOOR

TRACKABLE_VIEWER

TRACKABLE_CONTROLLER

TRACKABLE_PLANE

TRACKABLE_MARKER_2D

TRACKABLE_MARKER_3D

TRACKABLE_MARKER_GEO

TRACKABLE_APPLICATION

Processing model

Schema

Known Implementations

Resources

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

MPEG_anchor extensions

Contributors

Status

Dependencies

Overview

Semantics

Semantics at glTF level

Semantics at scene or node level

Semantics at glTF level

Semantics at scene or node level

Trackable Definitions

TRACKABLE_FLOOR

TRACKABLE_VIEWER

TRACKABLE_CONTROLLER

TRACKABLE_PLANE

TRACKABLE_MARKER_2D

TRACKABLE_MARKER_3D

TRACKABLE_MARKER_GEO

TRACKABLE_APPLICATION

Processing model

Schema

Known Implementations

Resources

License