Process

The Python toolkit for reading, processing, and visualizing Stera recordings.

Process is the Python toolkit that turns raw recordings from the Stera App into research-ready data. The app captures synchronized RGB, LiDAR depth, 6-DOF camera pose, and IMU on an iPhone Pro and exports an MCAP recording. Process reads those recordings, runs detectors and refiners over them, visualises everything in Rerun, and exports a self-contained episode directory you can hand to downstream tasks.

import stera
from stera import Evaluate
from stera.data import MCAPReader
from stera.models import HandTracker, UpperBodyEstimator
from stera.viz import Visualizer

stera.setup_logging()

session   = MCAPReader("recording.mcap")
hands     = HandTracker(model="mediapipe")
estimator = UpperBodyEstimator(session=session)
viz       = Visualizer(session, map_3d="both", mesh_refine={"color_speed": 0.5})

for frame in session.frames():
    hand_poses = hands.detect_hands(frame)
    skeleton   = estimator.estimate(frame, hands=hand_poses)
    session.add_hand_pose(frame.index, hand_poses)
    viz.log_frame(frame, hands=hand_poses, skeleton=skeleton)

session.export("episodes/run_01", visualizer=viz)
Evaluate(session).show()    # interactive HTML QC report

What's inside

Next

On this page