Spaces:

Rice-RobotPI-Lab
/

EgoInfinity

Running

App Files Files Community

EgoInfinity / README.md

VectorW

Initial commit

66d097c 9 days ago

preview code

Raw

History Blame Contribute Delete

2.79 kB

metadata

title: EgoInfinity Browser
emoji: 🪞
colorFrom: indigo
colorTo: pink
sdk: static
pinned: false
license: mit
short_description: Static viewer for the EgoInfinity dataset

EgoInfinity Browser

A static, browser-only viewer for curated EgoInfinity clips. No backend, no Python, no live viser server. Every clip is served as pre-baked assets and rendered with three.js / the viser-build-client.

Source code: https://github.com/Rice-RobotPI-Lab/EgoInfinity

Dataset: https://huggingface.co/datasets/Rice-RobotPI-Lab/egoinfinity

What you see

For each clip:

YouTube iframe: the original source video, embedded from YouTube.
Depth video (MoGe-2) and optical flow (MEMFOF), rendered as colormapped MP4s.
SAM-tracked object mask highlighting the active object across the clip.
3D scene: viser recording showing the per-frame point cloud, hand mesh, object meshes (Gaussian Splat baked), and skeleton.
Hand skeleton overlay projected onto the depth video.
Action signals: per-frame timeseries (hand velocity, contact, etc.).

Get the data

The full processed dataset is hosted on Hugging Face Datasets:

# Snapshot download (entire dataset, ~370 MB)
pip install huggingface_hub
hf download Rice-RobotPI-Lab/egoinfinity --repo-type=dataset --local-dir ./egoinfinity-data

# Or browse interactively at:
# https://huggingface.co/datasets/Rice-RobotPI-Lab/egoinfinity

While this Space is private, you must be logged into a Hugging Face account with read access to both repositories for assets to load in the viewer.

License & attribution

Code in this Space (HTML, JS, viser-client) is released under the MIT License. See LICENSE.
Data assets (depth/flow/mask videos, viser recordings, hand parameters, scene metadata) are derivative works of Action100M (Meta FAIR) and are governed by the FAIR Noncommercial Research License v1. See LICENSE-Action100M.
Original videos remain on YouTube and are embedded via the YouTube IFrame API. Only the SAM-tracked region of each frame appears in the derivative mask.mp4 (the rest is painted black).
Hand visualization uses pose parameters derived from the MANO hand model (Romero et al., 2017). MANO is not redistributed; only baked vertex positions appear in the viewer.
WiLoR weights (CC-BY-NC-ND) are not redistributed.

Citation

If you find this useful in your research, please cite:

@misc{egoinfinity2026,
  title  = {EgoInfinity: A Web-Scale Data Engine for Video-to-Action Robot Learning through Egocentric Views},
  author = {Rice RobotPI Lab},
  year   = {2026},
  note   = {Preview release}
}