Artifact Guide
This guide is the human-readable map for the public Ropedia Xperience-10M task
suite artifacts. It complements the machine-readable
docs/data/artifact_index.json.
The project separates these reading layers:
- Project status: one compact table for first-pass current-state decisions.
- Project scope and roadmap: what is implemented now, what is setup-stage, what remains gated by multi-episode data access, and how the staged research path progresses.
- Official source alignment: what the upstream Xperience-10M dataset card, public sample card, and HF API metadata say, and which parts this repo currently covers.
- Evaluation protocol: windowing, split policy, per-task metrics, leakage controls, and current limitations.
- Visual evidence: public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts.
- Data contract: how one public Xperience-10M sample episode becomes aligned model windows and feature blocks.
- Task evidence: minimal and neural results for the 12 task contracts plus audio contribution variants, and four research-direction extension probes.
- Reproducibility: public commands, expected outputs, and exact-match evidence for the single-episode pipeline.
- Public project surface: repo, website, and Hugging Face pages, accessibility semantics, links, and reader-facing copy.
- Multi-episode pilot status: scripts and reports for the selected-episode Qwen3-Omni pilot, with the data-access requirement kept visible.
- Foundation-model selection: Qwen3-Omni, Cosmos 3, GR00T, OpenVLA, openpi, Gemini Robotics, and lightweight policy candidates separated by task fit and current evidence level.
Start Here
| Artifact | Why to open it first |
|---|---|
PROJECT_STATUS.md |
Gives the fastest current-state table: implemented, in staging, and outside current scope. |
RESEARCH_ROADMAP.md |
Shows the staged path from public-sample task development to multi-episode data staging, Qwen3-Omni LoRA, robustness runs, and larger omni-model extensions. |
FOUNDATION_MODEL_PLAN.md |
Explains which foundation backbones fit which Xperience-10M objective: Qwen3-Omni first, Cosmos 3 for world modeling, and VLA/policy models after action-target conversion. |
EVIDENCE_CONTRACT.md |
Defines the implemented scope, setup-stage artifacts, and multi-episode prerequisites. |
QUALITY_GATES.md |
Lists the automated release checks and post-publish verification used to keep the release current. |
PUBLIC_SURFACE_QA.md |
Describes whether repo, website, and Hugging Face cards read as one cohesive research project surface. |
EVALUATION_PROTOCOL.md |
Defines the task unit, chronological split, metrics, leakage controls, and current limitations. |
XPERIENCE10M_DATASET_CARD_ALIGNMENT.md |
Aligns this repo's public dataset wording with the official gated Xperience-10M card, sample card, and HF API metadata. |
SOURCE_ALIGNMENT_AUDIT.md |
Summarizes official dataset facts, sample-card facts, API-listing notes, and project coverage across repo, website, and HF cards. |
FIGURE_INDEX.md |
Catalogs public figures, charts, modality thumbnails, dimensions, hashes, roles, and source scripts. |
docs/data/brand_assets.json |
Catalogs the generated logo system, favicon, app icon, social card, dimensions, hashes, and usage roles. |
REPRODUCIBILITY.md |
Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
docs/data/artifact_index.json |
Lists project-critical files with existence, size, and stable hashes. |
docs/data/figure_index.json |
Machine-readable visual asset index for website and HF mirrors. |
docs/data/project_status.json |
Machine-readable copy of the project status table. |
docs/data/research_roadmap.json |
Machine-readable roadmap for website and Hugging Face mirrors. |
docs/data/foundation_model_plan.json |
Machine-readable foundation-model selection matrix for website and Hugging Face mirrors. |
docs/data/xperience10m_dataset_card_alignment.json |
Machine-readable source-alignment summary, including gated metadata, sample license/tooling, and current project coverage. |
docs/data/source_alignment_audit.json |
Machine-readable source metadata and HF card parity report. |
docs/data/evaluation_protocol.json |
Machine-readable evaluation protocol generated from committed metrics. |
results/audio_ablation/AUDIO_ABLATION_SUMMARY.md |
Shows measured current-audio and raw log-mel replacement deltas across the 12 task contracts. |
docs/data/audio_ablation_summary.json |
Machine-readable audio ablation summary for website and HF mirrors. |
docs/data/quality_gates.json |
Machine-readable release-check summary for website and HF mirrors. |
docs/data/public_surface_qa.json |
Machine-readable public project-surface report for website, repo, and Hugging Face pages. |
docs/data/live_publication_status.json |
Last live GitHub/HF verification after upload. |
docs/data/mirror_parity.json |
Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
docs/data/publication_audit.json |
Summarizes public bundle contents and exclusions for raw data, Python caches, heavy archives, token strings, and public-card figure references. |
docs/data/scope_claims_audit.json |
Separates setup identifiers from completed held-out-episode results. |
docs/data/task_surface_integrity.json |
Confirms the public 12-task cards use readable task names, modality thumbnails, and the interactive walkthrough/player data contract. |
docs/data/website_integrity.json |
Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
RENDERED_SITE_CHECK.md and docs/data/rendered_site_check.json |
Records the latest browser-level page load, tab navigation, walkthrough deep link, player interaction, and console-health check. |
docs/data/project_packet.json |
Gives the shortest machine-readable project route. |
Official Source Alignment
| Artifact | What it shows |
|---|---|
XPERIENCE10M_DATASET_CARD_ALIGNMENT.md |
Human-readable summary of the official gated Xperience-10M dataset card, public sample card, API listing snapshot, scale, modalities, access terms, intended uses, and limitations. |
docs/data/xperience10m_dataset_card_alignment.json |
Machine-readable copy of the same alignment facts for website and HF mirrors. |
SOURCE_ALIGNMENT_AUDIT.md |
Generated source-alignment report showing source facts, sample license/tooling, API-listing notes, and current project scope. |
docs/data/source_alignment_audit.json |
Machine-readable source metadata and HF card parity report. |
scripts/validate_source_alignment.py |
Regenerates the source-alignment report from committed alignment facts and public card text. |
Evaluation Protocol
| Artifact | What it shows |
|---|---|
EVALUATION_PROTOCOL.md |
Human-readable task protocol: window unit, chronological split, input/target contracts, primary metrics, leakage controls, and current limitations. |
docs/data/evaluation_protocol.json |
Machine-readable protocol generated from committed task metrics. |
scripts/build_evaluation_protocol.py |
Regenerates the protocol from docs/data/summary_metrics.json and source task artifacts. |
Visual Evidence
| Artifact | What it shows |
|---|---|
FIGURE_INDEX.md |
Human-readable catalog of public visual assets, dimensions, hashes, roles, and source scripts. |
docs/data/figure_index.json |
Machine-readable visual asset index mirrored to the website, artifact dataset, and model repo. |
scripts/build_figure_index.py |
Regenerates visual-asset hashes, dimensions, and source-script provenance. |
docs/data/brand_assets.json |
Machine-readable logo/brand manifest for the website, README, Hugging Face cards, favicon, app icon, and social preview. |
docs/assets/brand/xperience10m-logo-social-card.png |
Project logo card used by README and Hugging Face cards. |
scripts/build_brand_assets.py |
Regenerates deterministic logo derivatives, favicon variants, app icons, and the social card from the generated logo mark. |
docs/assets/task_suite_infographic.png |
Primary 12-task suite map with sample modality thumbnails. |
docs/assets/pipeline_diagram.png |
Episode-to-task pipeline overview. |
docs/assets/task_architectures.png |
Minimal and neural task-head architecture map. |
Data Contract
| Artifact | What it shows |
|---|---|
results/episode_task_suite/windows.csv |
The sample episode is converted into 1,161 aligned 20-frame windows. |
results/episode_task_suite/feature_manifest.json |
The current input vector has 8,546 dimensions with explicit modality-group boundaries, including a 168-d audio group. |
results/episode_task_suite/available_modalities.json |
The sample modality coverage is recorded, including the current audio-featurization status. |
results/audio_ablation/raw_logmel_fisheye_cam0_sr16000_mels64_fft512_hop160.npz |
Derived 588-d raw log-mel window features decoded from the local public-sample MP4 audio stream; raw audio itself is not redistributed. |
docs/data/modality_atlas.json |
The responsive website modality cards and derived thumbnail assets are documented without redistributing raw data. |
docs/assets/modalities/ |
Small public-sample thumbnails used by the readable modality atlas. |
Task Evidence
| Artifact | What it shows |
|---|---|
results/episode_task_suite/summary_report.json |
The 12 task contracts, chronological split, and minimal/neural metrics. |
results/episode_task_suite/neural_mlp/ |
Matching PyTorch MLP heads for the same task contracts and feature windows. |
results/episode_task_suite/research_directions/ |
Mapping from the 12 tasks to the four Ropedia research directions. |
results/episode_task_suite/research_direction_extensions/ |
Four additional coded probes, one per research direction. |
results/episode_task_suite/task_walkthroughs/ |
Human-readable research names and case studies explaining input, process modules, output, metric, limitation, and the website task-player data. |
results/audio_ablation/audio_ablation_metrics.csv |
All 72 measured audio rows: 12 tasks times six variants, including no-audio, audio-only, alternate-audio-only, representation replacement, and all-input variants. |
results/audio_ablation/audio_delta_summary.csv |
Compact per-task audio delta table for quick manual inspection. |
scripts/audio_ablation_and_raw_upgrade.py |
Regenerates audio contribution results from real task-suite artifacts plus the local public-sample MP4. |
scripts/validate_task_surface.py |
Fails publication if public task cards drift back to raw artifact ids or lose their thumbnail/player wiring. |
Reproducibility
| Artifact | What it shows |
|---|---|
REPRODUCIBILITY.md |
Public commands, expected outputs, and non-reproducible boundaries are explicit. |
docs/data/reproducibility_matrix.json |
Machine-readable command matrix for website and HF mirrors. |
notes/reproducibility_audit.md |
The last exact metric rebuild reproduced the public-sample metrics and matched committed artifacts. |
Platform Mirrors
| Surface | Purpose |
|---|---|
| GitHub Pages dashboard | Primary public website and visual research flow. |
| Hugging Face Space | Static app mirror for HF users. |
| HF artifact dataset | Derived CSV/JSON/Markdown/figure artifacts without raw Xperience-10M data. |
| HF baseline model repo | Lightweight minimal and neural task-head model files. |
| HF collection | One grouped landing page for the Space, artifact dataset, and baseline model repo. |
| Public surface artifact | What it keeps aligned |
|---|---|
PUBLIC_SURFACE_QA.md |
Human-readable public project-surface report for repo, website, and Hugging Face cards. |
docs/data/public_surface_qa.json |
Machine-readable report for SEO/social metadata, accessible tabs, public links, project links, and reader-facing copy. |
scripts/build_public_surface_qa.py |
Regenerates the public project-surface report before release. |
Scale-Up Readiness
| Artifact | Current status |
|---|---|
results/omni_finetune/DATA_ACCESS_STATUS.md |
Summarizes the staging requirement before the held-out Qwen3-Omni pilot can report metrics. |
results/omni_finetune/MULTI_EPISODE_ACCESS_STATUS.md |
Documents the public multi-episode access path, selected relay plan, and data requirements. |
scripts/omni/discover_xperience10m_sources.py |
Discovery gate for valid multi-episode Xperience-10M sources. |
scripts/omni/train_qwen3_omni_lora.py |
Training entrypoint for the Qwen3-Omni LoRA pilot after the data gate passes. |
FOUNDATION_MODEL_PLAN.md |
Adds the post-data-gate backbone selection plan: Qwen3-Omni first, Cosmos 3 for world modeling, and OpenVLA/openpi/GR00T for policy/action branches. |
docs/data/foundation_model_plan.json |
Machine-readable model-family registry with source links, entry conditions, and evaluation additions. |
What Is Not Included
The public repo and Hugging Face mirrors do not redistribute raw Xperience-10M
videos, raw annotation.hdf5, gated private dataset files, full Qwen weights,
or large full checkpoints. Dataset use remains governed by the official
Ropedia/Xperience-10M terms.