Publish Ropedia Xperience-10M task baseline cards

Browse files

Files changed (15) hide show

ARTIFACT_GUIDE.md +23 -21
EVIDENCE_CONTRACT.md +33 -33
QUALITY_GATES.md +41 -0
README.md +4 -1
metrics/artifact_index.json +34 -11
metrics/evidence_contract.json +11 -0
metrics/mirror_parity.json +153 -62
metrics/publication_audit.json +12 -9
metrics/quality_gates.json +101 -0
metrics/scope_claims_audit.json +1 -1
metrics/website_integrity.json +11 -6
scripts/build_artifact_index.py +16 -0
scripts/build_quality_gates.py +189 -0
scripts/validate_mirror_parity.py +25 -0
scripts/validate_publication_package.py +3 -0

ARTIFACT_GUIDE.md CHANGED Viewed

@@ -2,9 +2,9 @@
 This guide is the human-readable map for the public Ropedia Xperience-10M task
 suite artifacts. It complements the machine-readable
-[`metrics/artifact_index.json`](metrics/artifact_index.json).
-The project intentionally separates four layers:
 1. **Proof boundary:** what is claimed, what is smoke-only, and what remains
    gated by data access.
@@ -22,40 +22,42 @@ The project intentionally separates four layers:
 | Artifact | Why to open it first |
 | --- | --- |
 | [`EVIDENCE_CONTRACT.md`](EVIDENCE_CONTRACT.md) | Defines which claims are verified and which are explicitly not claimed. |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
-| [`metrics/artifact_index.json`](metrics/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
-| [`metrics/mirror_parity.json`](metrics/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
-| [`metrics/publication_audit.json`](metrics/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, token strings, and stale public-card figure references. |
-| [`metrics/scope_claims_audit.json`](metrics/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
-| [`metrics/website_integrity.json`](metrics/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
-| [`metrics/reviewer_packet.json`](metrics/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |
 ## Data Contract
 | Artifact | What it proves |
 | --- | --- |
-| [`artifacts/episode_task_suite/windows.csv`](artifacts/episode_task_suite/windows.csv) | The sample episode is converted into 1,161 aligned 20-frame windows. |
-| [`artifacts/episode_task_suite/feature_manifest.json`](artifacts/episode_task_suite/feature_manifest.json) | The current input vector has 8,378 dimensions with explicit feature-block boundaries. |
-| [`artifacts/episode_task_suite/available_modalities.json`](artifacts/episode_task_suite/available_modalities.json) | The sample modality coverage is recorded, including the current audio-featurization boundary. |
-| [`metrics/modality_atlas.json`](metrics/modality_atlas.json) | The responsive website modality cards and derived thumbnail assets are documented without redistributing raw data. |
-| [`assets/modalities/`](assets/modalities/) | Small public-sample thumbnails used by the readable modality atlas. |
 ## Task Evidence
 | Artifact | What it proves |
 | --- | --- |
-| [`artifacts/episode_task_suite/summary_report.json`](artifacts/episode_task_suite/summary_report.json) | The 12 task contracts, chronological split, and minimal/neural metrics. |
-| [`artifacts/episode_task_suite/neural_mlp/`](artifacts/episode_task_suite/neural_mlp/) | Matching PyTorch MLP heads for the same task contracts and feature windows. |
-| [`artifacts/episode_task_suite/research_directions/`](artifacts/episode_task_suite/research_directions/) | Mapping from the 12 tasks to the four Ropedia research directions. |
-| [`artifacts/episode_task_suite/research_direction_extensions/`](artifacts/episode_task_suite/research_direction_extensions/) | Four additional coded probes, one per research direction. |
-| [`artifacts/episode_task_suite/task_walkthroughs/`](artifacts/episode_task_suite/task_walkthroughs/) | Junior-friendly case studies explaining input, process modules, output, metric, and limitation. |
 ## Reproducibility
 | Artifact | What it proves |
 | --- | --- |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Public commands, expected outputs, and non-reproducible boundaries are explicit. |
-| [`metrics/reproducibility_matrix.json`](metrics/reproducibility_matrix.json) | Machine-readable command matrix for website and HF mirrors. |
 | [`notes/reproducibility_audit.md`](notes/reproducibility_audit.md) | The last exact metric audit rebuilt the public-sample metrics and matched committed artifacts. |
 ## Platform Mirrors
@@ -72,8 +74,8 @@ The project intentionally separates four layers:
 | Artifact | Current status |
 | --- | --- |
-| Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md` | Documents why no real 32-episode Qwen3-Omni result is claimed yet. |
-| Companion GitHub repo: `results/omni_finetune/A100_HF_RELAY_STATUS.md` | Documents the pending A100-to-H20 relay and selected 32-session pilot plan. |
 | [`scripts/omni/discover_xperience10m_sources.py`](scripts/omni/discover_xperience10m_sources.py) | Discovery gate for valid multi-episode Xperience-10M sources. |
 | [`scripts/omni/train_qwen3_omni_lora.py`](scripts/omni/train_qwen3_omni_lora.py) | Training entrypoint for the Qwen3-Omni LoRA pilot after the data gate passes. |

 This guide is the human-readable map for the public Ropedia Xperience-10M task
 suite artifacts. It complements the machine-readable
+[`docs/data/artifact_index.json`](docs/data/artifact_index.json).
+The project intentionally separates five layers:
 1. **Proof boundary:** what is claimed, what is smoke-only, and what remains
    gated by data access.
 | Artifact | Why to open it first |
 | --- | --- |
 | [`EVIDENCE_CONTRACT.md`](EVIDENCE_CONTRACT.md) | Defines which claims are verified and which are explicitly not claimed. |
+| [`QUALITY_GATES.md`](QUALITY_GATES.md) | Lists the automated release gates and post-publish checks required before presenting a release as current. |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Defines public reproduction commands, expected outputs, and unreproducible boundaries. |
+| [`docs/data/artifact_index.json`](docs/data/artifact_index.json) | Lists reviewer-critical files with existence, size, and stable hashes. |
+| [`docs/data/quality_gates.json`](docs/data/quality_gates.json) | Machine-readable quality-gate summary for website and HF mirrors. |
+| [`docs/data/mirror_parity.json`](docs/data/mirror_parity.json) | Confirms prepared HF Space, artifact, and model mirrors match the repo for critical data, figures, website HTML, and validator scripts. |
+| [`docs/data/publication_audit.json`](docs/data/publication_audit.json) | Confirms public bundles exclude raw data, Python caches, heavy archives, token strings, and stale public-card figure references. |
+| [`docs/data/scope_claims_audit.json`](docs/data/scope_claims_audit.json) | Confirms historical `32ep` smoke-run identifiers are not presented as real 32-episode results. |
+| [`docs/data/website_integrity.json`](docs/data/website_integrity.json) | Confirms local site links, anchors, JSON bundles, and referenced images resolve. |
+| [`docs/data/reviewer_packet.json`](docs/data/reviewer_packet.json) | Gives the shortest machine-readable reviewer route. |
 ## Data Contract
 | Artifact | What it proves |
 | --- | --- |
+| [`results/episode_task_suite/windows.csv`](results/episode_task_suite/windows.csv) | The sample episode is converted into 1,161 aligned 20-frame windows. |
+| [`results/episode_task_suite/feature_manifest.json`](results/episode_task_suite/feature_manifest.json) | The current input vector has 8,378 dimensions with explicit feature-block boundaries. |
+| [`results/episode_task_suite/available_modalities.json`](results/episode_task_suite/available_modalities.json) | The sample modality coverage is recorded, including the current audio-featurization boundary. |
+| [`docs/data/modality_atlas.json`](docs/data/modality_atlas.json) | The responsive website modality cards and derived thumbnail assets are documented without redistributing raw data. |
+| [`docs/assets/modalities/`](docs/assets/modalities/) | Small public-sample thumbnails used by the readable modality atlas. |
 ## Task Evidence
 | Artifact | What it proves |
 | --- | --- |
+| [`results/episode_task_suite/summary_report.json`](results/episode_task_suite/summary_report.json) | The 12 task contracts, chronological split, and minimal/neural metrics. |
+| [`results/episode_task_suite/neural_mlp/`](results/episode_task_suite/neural_mlp/) | Matching PyTorch MLP heads for the same task contracts and feature windows. |
+| [`results/episode_task_suite/research_directions/`](results/episode_task_suite/research_directions/) | Mapping from the 12 tasks to the four Ropedia research directions. |
+| [`results/episode_task_suite/research_direction_extensions/`](results/episode_task_suite/research_direction_extensions/) | Four additional coded probes, one per research direction. |
+| [`results/episode_task_suite/task_walkthroughs/`](results/episode_task_suite/task_walkthroughs/) | Junior-friendly case studies explaining input, process modules, output, metric, and limitation. |
 ## Reproducibility
 | Artifact | What it proves |
 | --- | --- |
 | [`REPRODUCIBILITY.md`](REPRODUCIBILITY.md) | Public commands, expected outputs, and non-reproducible boundaries are explicit. |
+| [`docs/data/reproducibility_matrix.json`](docs/data/reproducibility_matrix.json) | Machine-readable command matrix for website and HF mirrors. |
 | [`notes/reproducibility_audit.md`](notes/reproducibility_audit.md) | The last exact metric audit rebuilt the public-sample metrics and matched committed artifacts. |
 ## Platform Mirrors
 | Artifact | Current status |
 | --- | --- |
+| [`results/omni_finetune/DATA_BLOCKER_REPORT.md`](results/omni_finetune/DATA_BLOCKER_REPORT.md) | Documents why no real 32-episode Qwen3-Omni result is claimed yet. |
+| [`results/omni_finetune/A100_HF_RELAY_STATUS.md`](results/omni_finetune/A100_HF_RELAY_STATUS.md) | Documents the pending A100-to-H20 relay and selected 32-session pilot plan. |
 | [`scripts/omni/discover_xperience10m_sources.py`](scripts/omni/discover_xperience10m_sources.py) | Discovery gate for valid multi-episode Xperience-10M sources. |
 | [`scripts/omni/train_qwen3_omni_lora.py`](scripts/omni/train_qwen3_omni_lora.py) | Training entrypoint for the Qwen3-Omni LoRA pilot after the data gate passes. |

EVIDENCE_CONTRACT.md CHANGED Viewed

@@ -5,51 +5,51 @@ local artifact that a reader can inspect before trusting the dashboard.
 | Claim | Current evidence | Status | Boundary |
 | --- | --- | --- | --- |
-| The public Xperience-10M sample has been converted into aligned model windows. | `artifacts/episode_task_suite/windows.csv`, `artifacts/episode_task_suite/shared_windows.npz`, `artifacts/episode_task_suite/summary_report.json` | Verified for 5,821 frames and 1,161 windows | One public sample episode only |
-| The current feature contract is explicit and reviewable. | `artifacts/episode_task_suite/feature_manifest.json`, `artifacts/episode_task_suite/available_modalities.json` | Verified for an 8,378-d feature vector | Audio is present in MP4 streams but not yet a feature block |
-| The public sample modalities are inspectable without raw data redistribution. | `metrics/modality_atlas.json`, `assets/modalities/`, website modality atlas | Verified derived thumbnail atlas | Thumbnails are presentation/review assets, not a replacement for official raw data access |
-| The 12 task heads are real scripts and artifacts, not presentation placeholders. | `scripts/episode_task_suite.py`, `artifacts/episode_task_suite/*/metrics.json`, `artifacts/episode_task_suite/*/predictions.*` | Verified for all 12 task definitions | Chronological single-episode split, not cross-episode generalization |
-| Minimal and neural heads use the same task contracts. | `scripts/neural_task_models.py`, `artifacts/episode_task_suite/neural_mlp/`, `assets/task_architectures.png` | Verified for 12 minimal heads and 12 neural MLP heads | Small heads only; not a foundation model |
-| Four Ropedia research directions are mapped honestly as direct, proxy, or diagnostic evidence. | `artifacts/episode_task_suite/research_directions/research_direction_taxonomy.json`, `metrics/research_directions.json` | Verified taxonomy | Some directions remain proxy-only |
-| Four extra direction probes are coded and evaluated. | `artifacts/episode_task_suite/research_direction_extensions/research_direction_extension_results.json`, `metrics/research_direction_extensions.json` | Verified single-episode probes | Not full human modeling, neural rendering, intent modeling, or world modeling solutions |
-| Qwen3-Omni infrastructure has passed technical smoke checks. | Companion GitHub repo: `results/omni_finetune/RUN_REPORT.md`, `results/omni_finetune/dataset_manifest.json`, `results/omni_finetune/metrics_eval.json` | Smoke-only evidence | One episode, 128 train windows; not a 32-episode pilot |
-| The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | Companion GitHub repo: `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
-| Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `metrics/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
-| Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `metrics/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
-| The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `metrics/publication_audit.json` | Verified pass | Checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded |
-| The public website has checked local references. | `scripts/validate_website_integrity.py`, `metrics/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
-| The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `metrics/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
-| The public reproduction path is documented. | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |
-| The project is externally citable and machine-readable. | `CITATION.cff`, `codemeta.json`, `metrics/project_manifest.json`, `LICENSE` | Verified metadata files | Code license does not override original Xperience-10M dataset terms |
-| A first-time reviewer has an explicit audit path. | `metrics/reviewer_packet.json`, website reviewer section, README reviewer path | Verified reviewer packet | It guides inspection; it does not add new experimental claims |
 ## Review Order
-1. Read `metrics/reviewer_packet.json` for the shortest audit path and proof
    boundary.
-2. Read `ARTIFACT_GUIDE.md` and `metrics/artifact_index.json` to see grouped
    reviewer artifacts, indexed proof artifacts,
    sizes, and stable-file hashes.
-3. Read `assets/task_suite_infographic.png` and
-   `metrics/modality_atlas.json` for the high-level map and modality atlas.
-4. Read `REPRODUCIBILITY.md` and `metrics/reproducibility_matrix.json` before
    rerunning the public pipeline.
-5. Inspect `artifacts/episode_task_suite/summary_report.json` for the task and
    metric source of truth.
-6. Inspect `artifacts/episode_task_suite/feature_manifest.json` to see which
    modalities enter the current feature vector.
-7. Inspect `artifacts/episode_task_suite/neural_mlp/` to compare minimal and
    neural heads under the same splits.
-8. Inspect `metrics/scope_claims_audit.json` before interpreting historical
    `32ep` strings in Qwen3-Omni smoke artifacts.
-9. Inspect `metrics/mirror_parity.json` before assuming the GitHub and
    Hugging Face mirrors contain the same critical data, visual, HTML, and
    validator files.
-10. Inspect the companion GitHub repo's
-   `results/omni_finetune/DATA_BLOCKER_REPORT.md` before interpreting any
-   Qwen3-Omni artifact.
-11. Inspect `metrics/publication_audit.json` and
-   `metrics/website_integrity.json` before publishing or sharing the project
-   externally.
 12. Inspect `CITATION.cff`, `codemeta.json`, and `LICENSE` before reusing or
    citing the project.

 | Claim | Current evidence | Status | Boundary |
 | --- | --- | --- | --- |
+| The public Xperience-10M sample has been converted into aligned model windows. | `results/episode_task_suite/windows.csv`, `results/episode_task_suite/shared_windows.npz`, `results/episode_task_suite/summary_report.json` | Verified for 5,821 frames and 1,161 windows | One public sample episode only |
+| The current feature contract is explicit and reviewable. | `results/episode_task_suite/feature_manifest.json`, `results/episode_task_suite/available_modalities.json` | Verified for an 8,378-d feature vector | Audio is present in MP4 streams but not yet a feature block |
+| The public sample modalities are inspectable without raw data redistribution. | `docs/data/modality_atlas.json`, `docs/assets/modalities/`, website modality atlas | Verified derived thumbnail atlas | Thumbnails are presentation/review assets, not a replacement for official raw data access |
+| The 12 task heads are real scripts and artifacts, not presentation placeholders. | `scripts/episode_task_suite.py`, `results/episode_task_suite/*/metrics.json`, `results/episode_task_suite/*/predictions.*` | Verified for all 12 task definitions | Chronological single-episode split, not cross-episode generalization |
+| Minimal and neural heads use the same task contracts. | `scripts/neural_task_models.py`, `results/episode_task_suite/neural_mlp/`, `docs/assets/task_architectures.png` | Verified for 12 minimal heads and 12 neural MLP heads | Small heads only; not a foundation model |
+| Four Ropedia research directions are mapped honestly as direct, proxy, or diagnostic evidence. | `results/episode_task_suite/research_directions/research_direction_taxonomy.json`, `docs/data/research_directions.json` | Verified taxonomy | Some directions remain proxy-only |
+| Four extra direction probes are coded and evaluated. | `results/episode_task_suite/research_direction_extensions/research_direction_extension_results.json`, `docs/data/research_direction_extensions.json` | Verified single-episode probes | Not full human modeling, neural rendering, intent modeling, or world modeling solutions |
+| Qwen3-Omni infrastructure has passed technical smoke checks. | `results/omni_finetune/RUN_REPORT.md`, `results/omni_finetune/dataset_manifest.json`, `results/omni_finetune/metrics_eval.json` | Smoke-only evidence | One episode, 128 train windows; not a 32-episode pilot |
+| The real 32-episode LoRA pilot is blocked on gated data access, not on repo presentation. | `results/omni_finetune/DATA_BLOCKER_REPORT.md`, `results/omni_finetune/A100_HF_RELAY_STATUS.md`, `results/omni_finetune/source_discovery.json` | Blocker documented | No 32-episode metric should be claimed until the gate passes |
+| Historical `32ep` path strings are not treated as 32-episode results. | `scripts/validate_scope_claims.py`, `docs/data/scope_claims_audit.json` | Verified pass | Classifies old run/path identifiers and fails if public presentation claims real 32-episode metrics |
+| Prepared GitHub/Hugging Face mirrors carry matching critical files. | `scripts/validate_mirror_parity.py`, `docs/data/mirror_parity.json` | Verified pass | Compares prepared data files, visual assets, website HTML, and validator scripts before upload; live URLs are checked after publishing |
+| The public GitHub and Hugging Face bundles are publication-clean. | `scripts/validate_publication_package.py`, `docs/data/publication_audit.json` | Verified pass | Checks public files, HF bundles, and public-card freshness; ignored local scratch outputs are excluded |
+| The public website has checked local references. | `scripts/validate_website_integrity.py`, `docs/data/website_integrity.json` | Verified pass | Checks local links, anchors, JSON data, and referenced images; external URLs are not fetched |
+| The release gate is explicit and reviewable. | `QUALITY_GATES.md`, `scripts/build_quality_gates.py`, `docs/data/quality_gates.json` | Verified pass | Summarizes packaging and live-mirror checks; it does not prove cross-episode model quality |
+| The core proof artifacts are indexed and grouped for fast review. | `ARTIFACT_GUIDE.md`, `scripts/build_artifact_index.py`, `docs/data/artifact_index.json` | Verified guide and index | Selective source-of-truth catalog, not a complete inventory of every output file |
+| The public reproduction path is documented. | `REPRODUCIBILITY.md`, `docs/data/reproducibility_matrix.json`, `notes/reproducibility_audit.md` | Verified documentation and prior exact-match audit | Publicly reproduces the single-episode pipeline, not the gated 32-episode Qwen3-Omni pilot |
+| The project is externally citable and machine-readable. | `CITATION.cff`, `codemeta.json`, `docs/data/project_manifest.json`, `LICENSE` | Verified metadata files | Code license does not override original Xperience-10M dataset terms |
+| A first-time reviewer has an explicit audit path. | `docs/data/reviewer_packet.json`, website reviewer section, README reviewer path | Verified reviewer packet | It guides inspection; it does not add new experimental claims |
 ## Review Order
+1. Read `docs/data/reviewer_packet.json` for the shortest audit path and proof
    boundary.
+2. Read `ARTIFACT_GUIDE.md` and `docs/data/artifact_index.json` to see grouped
    reviewer artifacts, indexed proof artifacts,
    sizes, and stable-file hashes.
+3. Read `docs/assets/task_suite_infographic.png` and
+   `docs/data/modality_atlas.json` for the high-level map and modality atlas.
+4. Read `REPRODUCIBILITY.md` and `docs/data/reproducibility_matrix.json` before
    rerunning the public pipeline.
+5. Inspect `results/episode_task_suite/summary_report.json` for the task and
    metric source of truth.
+6. Inspect `results/episode_task_suite/feature_manifest.json` to see which
    modalities enter the current feature vector.
+7. Inspect `results/episode_task_suite/neural_mlp/` to compare minimal and
    neural heads under the same splits.
+8. Inspect `docs/data/scope_claims_audit.json` before interpreting historical
    `32ep` strings in Qwen3-Omni smoke artifacts.
+9. Inspect `docs/data/mirror_parity.json` before assuming the GitHub and
    Hugging Face mirrors contain the same critical data, visual, HTML, and
    validator files.
+10. Inspect `results/omni_finetune/DATA_BLOCKER_REPORT.md` before interpreting
+   any Qwen3-Omni artifact.
+11. Inspect `QUALITY_GATES.md`, `docs/data/quality_gates.json`,
+   `docs/data/publication_audit.json`, and `docs/data/website_integrity.json`
+   before publishing or sharing the project externally.
 12. Inspect `CITATION.cff`, `codemeta.json`, and `LICENSE` before reusing or
    citing the project.

QUALITY_GATES.md ADDED Viewed

	@@ -0,0 +1,41 @@

+# Publication Quality Gates
+This file is the reviewer-facing release checklist for the Ropedia Xperience-10M Task Suite.
+Current gate status: **pass**
+Do not present a release as current unless every automated gate passes, then verify live GitHub/HF mirrors after publishing.
+These gates validate public packaging, claim boundaries, mirror parity, and website integrity. They do not prove cross-episode model quality; the 32-episode Qwen3-Omni pilot remains gated on data access.
+## Automated Gates
+| Gate | Command | Report | Current report status | Blocks publication if |
+| --- | --- | --- | --- | --- |
+| Scope claims guard | `python scripts/validate_scope_claims.py` | `docs/data/scope_claims_audit.json` | `pass` | Historical 32ep smoke/provenance strings are presented as real 32-episode metrics. |
+| Website integrity | `python scripts/validate_website_integrity.py` | `docs/data/website_integrity.json` | `pass` | Local links, anchors, JSON bundles, or referenced image assets are missing or invalid. |
+| Quality-gate manifest | `python scripts/build_quality_gates.py` | `docs/data/quality_gates.json` | `pass` | A public reviewer cannot see the current packaging gates in one place. |
+| Artifact index | `python scripts/build_artifact_index.py` | `docs/data/artifact_index.json` | `pass` | Reviewer-critical evidence files are missing from the indexed proof layer. |
+| Publication hygiene | `python scripts/validate_publication_package.py` | `docs/data/publication_audit.json` | `pass` | Raw data, caches, heavy archives, token strings, missing required assets, or stale public-card figure references enter public bundles. |
+| Prepared mirror parity | `python scripts/validate_mirror_parity.py` | `docs/data/mirror_parity.json` | `pass` | Prepared HF Space, artifact dataset, or model bundle diverges from the repo for critical files. |
+## Post-Publish Checks
+| Check | Evidence | Required result |
+| --- | --- | --- |
+| GitHub Pages deployment | `gh run list --repo ChaoYue0307/ropedia-xperience-10m-task-suite --limit 5` | latest pages-build-deployment run succeeds |
+| Live figure hash parity | `download live GitHub/HF task_suite_infographic.png and compare SHA-256 to docs/assets/task_suite_infographic.png` | all live hashes match the repo asset |
+| Rendered browser smoke | `Browser/Playwright page identity, nonblank render, console health, and one local interaction` | no relevant console warnings/errors and target links work |
+## Rerun Order
+```bash
+python scripts/validate_scope_claims.py
+python scripts/validate_website_integrity.py
+python scripts/build_quality_gates.py
+python scripts/build_artifact_index.py
+python scripts/validate_publication_package.py
+python scripts/validate_mirror_parity.py
+```
+After Hugging Face bundle sync, rerun `validate_publication_package.py` and `validate_mirror_parity.py` once more before upload.

README.md CHANGED Viewed

@@ -91,13 +91,14 @@ Their purpose is to make every input/output contract auditable before scaling to
 | Step | Question | Primary artifacts |
 | --- | --- | --- |
-| 1 | What is actually claimed? | `EVIDENCE_CONTRACT.md`, `ARTIFACT_GUIDE.md`, `metrics/artifact_index.json`, `metrics/mirror_parity.json`, `metrics/scope_claims_audit.json`, `metrics/publication_audit.json`, `metrics/website_integrity.json`, `metrics/project_manifest.json` |
 | 2 | How do I reproduce it? | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, companion GitHub `notes/reproducibility_audit.md` |
 | 3 | What is one model input? | `artifacts/episode_task_suite/feature_manifest.json`, `artifacts/episode_task_suite/available_modalities.json`, companion artifact dataset `windows.csv` |
 | 4 | Are the task results backed by files? | `artifacts/episode_task_suite/summary_report.json`, `artifacts/episode_task_suite/neural_mlp/`, `metrics/summary_metrics.json` |
 | 5 | What is still pending? | companion GitHub `results/omni_finetune/DATA_BLOCKER_REPORT.md` and `A100_HF_RELAY_STATUS.md` |
 Human-readable artifact guide mirror: `ARTIFACT_GUIDE.md`.
 Machine-readable reviewer packet mirror: `metrics/reviewer_packet.json`.
 Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
@@ -114,6 +115,7 @@ Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
 | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
 | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, token strings, or stale public-card figure references |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
 | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
 | Reproducibility | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json` | public commands, expected outputs, exact-match audit evidence, and non-reproducible boundaries |
@@ -148,6 +150,7 @@ transfers them to H20 for manifest building, training, and evaluation.
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
 | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
 | `metrics/publication_audit.json` | records the latest public-bundle hygiene and public-card freshness check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
 | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |

 | Step | Question | Primary artifacts |
 | --- | --- | --- |
+| 1 | What is actually claimed? | `EVIDENCE_CONTRACT.md`, `ARTIFACT_GUIDE.md`, `QUALITY_GATES.md`, `metrics/artifact_index.json`, `metrics/quality_gates.json`, `metrics/mirror_parity.json`, `metrics/scope_claims_audit.json`, `metrics/publication_audit.json`, `metrics/website_integrity.json`, `metrics/project_manifest.json` |
 | 2 | How do I reproduce it? | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json`, companion GitHub `notes/reproducibility_audit.md` |
 | 3 | What is one model input? | `artifacts/episode_task_suite/feature_manifest.json`, `artifacts/episode_task_suite/available_modalities.json`, companion artifact dataset `windows.csv` |
 | 4 | Are the task results backed by files? | `artifacts/episode_task_suite/summary_report.json`, `artifacts/episode_task_suite/neural_mlp/`, `metrics/summary_metrics.json` |
 | 5 | What is still pending? | companion GitHub `results/omni_finetune/DATA_BLOCKER_REPORT.md` and `A100_HF_RELAY_STATUS.md` |
 Human-readable artifact guide mirror: `ARTIFACT_GUIDE.md`.
+Publication quality gates mirror: `QUALITY_GATES.md` and `metrics/quality_gates.json`.
 Machine-readable reviewer packet mirror: `metrics/reviewer_packet.json`.
 Source-of-truth artifact index mirror: `metrics/artifact_index.json`.
 | Mirror parity | `metrics/mirror_parity.json` and `scripts/validate_mirror_parity.py` | prepared repo/HF mirrors carry matching critical data, figures, website HTML, and validator files |
 | Publication hygiene | `metrics/publication_audit.json` and validator script mirror | public bundles contain no raw data, generated caches, heavy archives, token strings, or stale public-card figure references |
 | Website integrity | `metrics/website_integrity.json` and validator script mirror | local links, anchors, JSON bundles, and referenced images only |
+| Quality gates | `QUALITY_GATES.md`, `metrics/quality_gates.json`, and `scripts/build_quality_gates.py` | automated release gates plus live post-publish checks |
 | Artifact index | `metrics/artifact_index.json` and `scripts/build_artifact_index.py` | compact catalog of the reviewer-critical proof artifacts |
 | Artifact guide | `ARTIFACT_GUIDE.md` | human-readable map of proof boundary, task evidence, mirrors, and scale-up status |
 | Reproducibility | `REPRODUCIBILITY.md`, `metrics/reproducibility_matrix.json` | public commands, expected outputs, exact-match audit evidence, and non-reproducible boundaries |
 | `metrics/artifact_index.json` | indexes proof artifacts with existence, size, and stable-file hashes |
 | `metrics/mirror_parity.json` | verifies prepared repo/HF mirrors have matching critical data, figures, website HTML, and validator files before upload |
 | `metrics/scope_claims_audit.json` | verifies historical `32ep` smoke-run identifiers are not presented as real 32-episode results |
+| `QUALITY_GATES.md`, `metrics/quality_gates.json` | summarizes the automated and post-publish release checks |
 | `metrics/publication_audit.json` | records the latest public-bundle hygiene and public-card freshness check |
 | `metrics/website_integrity.json` | records the latest local website link, anchor, JSON, and image integrity check |
 | `metrics/project_manifest.json` | mirrors the public URL and citation metadata bundle |

metrics/artifact_index.json CHANGED Viewed

@@ -1,12 +1,13 @@
 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
-  "generated_at_utc": "2026-06-01T05:58:15+00:00",
   "status": "pass",
-  "artifact_count": 29,
   "missing": [],
   "by_kind": {
     "claim_boundary": 1,
     "review_path": 3,
     "reproducibility": 2,
     "hygiene_report": 1,
     "scope_guard": 1,
@@ -35,8 +36,8 @@
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
-      "bytes": 6520,
-      "sha256": "d6b8d74a53b49778d38bff6f6857f79d481d451f938c6a4177a50374f541d219"
     },
     {
       "id": "reviewer_packet",
@@ -57,8 +58,30 @@
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
-      "bytes": 6520,
-      "sha256": "0a8740e19d56c9c7e1c3964d3abf838a8e33af140128a4fb95a69bdca0b45173"
     },
     {
       "id": "reproducibility_contract",
@@ -90,8 +113,8 @@
       "surface": "repo_hf",
       "proves": "Generates the selective proof-artifact catalog from local files.",
       "exists": true,
-      "bytes": 11579,
-      "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
     },
     {
       "id": "publication_audit",
@@ -102,7 +125,7 @@
       "volatile": true,
       "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
       "exists": true,
-      "bytes": 5292,
       "hash_policy": "existence_and_size_only"
     },
     {
@@ -126,7 +149,7 @@
       "volatile": true,
       "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
       "exists": true,
-      "bytes": 42567,
       "hash_policy": "existence_and_size_only"
     },
     {
@@ -138,7 +161,7 @@
       "volatile": true,
       "proves": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
       "exists": true,
-      "bytes": 5936,
       "hash_policy": "existence_and_size_only"
     },
     {

 {
   "title": "Ropedia Xperience-10M Task Suite Artifact Index",
+  "generated_at_utc": "2026-06-01T06:24:20+00:00",
   "status": "pass",
+  "artifact_count": 31,
   "missing": [],
   "by_kind": {
     "claim_boundary": 1,
     "review_path": 3,
+    "quality_gate": 2,
     "reproducibility": 2,
     "hygiene_report": 1,
     "scope_guard": 1,
       "surface": "repo",
       "proves": "Defines what is verified, what is smoke-only, and what must not be inferred.",
       "exists": true,
+      "bytes": 6818,
+      "sha256": "a6d184b6bab5c0bad50e85f9b899a3c9f90741660130c20843bbf53c17d44713"
     },
     {
       "id": "reviewer_packet",
       "surface": "repo_hf",
       "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
       "exists": true,
+      "bytes": 6807,
+      "sha256": "208a173c7805e6dc61c7c243c24fb69de93af4883de13fa51b451b02f374e847"
+    },
+    {
+      "id": "quality_gates",
+      "title": "Publication quality gates",
+      "path": "QUALITY_GATES.md",
+      "kind": "quality_gate",
+      "surface": "repo_hf",
+      "proves": "Lists the automated and post-publish gates required before presenting a release as current.",
+      "exists": true,
+      "bytes": 2865,
+      "sha256": "f3482dbc310d2ade60aa2b480211a9ee0cad1c814779a8b1d63d96432222897a"
+    },
+    {
+      "id": "quality_gate_manifest",
+      "title": "Quality-gate manifest",
+      "path": "docs/data/quality_gates.json",
+      "kind": "quality_gate",
+      "surface": "website_hf",
+      "proves": "Machine-readable release-gate summary for validators, mirrors, and reviewer surfaces.",
+      "exists": true,
+      "bytes": 4222,
+      "sha256": "274dd753853ea843b5413bbce68b371e4a664853924c9745a4163c1b68a54cf9"
     },
     {
       "id": "reproducibility_contract",
       "surface": "repo_hf",
       "proves": "Generates the selective proof-artifact catalog from local files.",
       "exists": true,
+      "bytes": 12194,
+      "sha256": "04083feaa7cd486e94fa4f313b54b5b04b588edcb1376234a7b279060e0b4058"
     },
     {
       "id": "publication_audit",
       "volatile": true,
       "proves": "Confirms public bundles pass raw-data, cache, archive, and token-string checks.",
       "exists": true,
+      "bytes": 5408,
       "hash_policy": "existence_and_size_only"
     },
     {
       "volatile": true,
       "proves": "Confirms prepared GitHub/HF Space/artifact/model mirrors share the same critical data, figure, website HTML, and validator files.",
       "exists": true,
+      "bytes": 46406,
       "hash_policy": "existence_and_size_only"
     },
     {
       "volatile": true,
       "proves": "Confirms local website links, anchors, JSON data files, and referenced images resolve.",
       "exists": true,
+      "bytes": 6042,
       "hash_policy": "existence_and_size_only"
     },
     {

metrics/evidence_contract.json CHANGED Viewed

@@ -138,6 +138,17 @@
       ],
       "boundary": "checks local links, anchors, JSON data, and referenced images; external URLs are not fetched"
     },
     {
       "id": "citation_metadata",
       "claim": "The project is externally citable and machine-readable.",

       ],
       "boundary": "checks local links, anchors, JSON data, and referenced images; external URLs are not fetched"
     },
+    {
+      "id": "quality_gates",
+      "claim": "The release gate is explicit and reviewable.",
+      "status": "verified",
+      "evidence": [
+        "QUALITY_GATES.md",
+        "scripts/build_quality_gates.py",
+        "docs/data/quality_gates.json"
+      ],
+      "boundary": "summarizes packaging and live-mirror checks; it does not prove cross-episode model quality"
+    },
     {
       "id": "citation_metadata",
       "claim": "The project is externally citable and machine-readable.",

metrics/mirror_parity.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:59:09+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
-    "group_count": 29,
     "failure_count": 0,
     "failures_by_surface": {}
   },
@@ -23,6 +23,10 @@
     {
       "name": "repo_hf_website_html_parity",
       "status": "pass"
     }
   ],
   "groups": [
@@ -32,27 +36,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
-        "bytes": 12916,
-        "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
-          "bytes": 12916,
-          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
-          "bytes": 12916,
-          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
-          "bytes": 12916,
-          "sha256": "0d12a2e36db90b4c6ae4205aab074e6c6b00083fa6c4cc18ea51342f5f1a05df"
         }
       },
       "failures": []
@@ -63,27 +67,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
-        "bytes": 7205,
-        "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7205,
-          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
-          "bytes": 7205,
-          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
-          "bytes": 7205,
-          "sha256": "78f2a0db49dc00f52f79dc4d3448d01a47bd28019458bd61000304e119a018e8"
         }
       },
       "failures": []
@@ -156,27 +160,58 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
-        "bytes": 5292,
-        "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
-          "bytes": 5292,
-          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
-          "bytes": 5292,
-          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
-          "bytes": 5292,
-          "sha256": "c273917a673c41fed0b1498ffef2d1e64ebd9691e7b870cbcd15b194785397ed"
         }
       },
       "failures": []
@@ -312,26 +347,26 @@
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
-        "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
-          "sha256": "a74d89124ddbecd1af7a6054fcaa0d905c639dec56a9fd455140817220fd91a1"
         }
       },
       "failures": []
@@ -404,27 +439,27 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
-        "bytes": 5936,
-        "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
-          "bytes": 5936,
-          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
-          "bytes": 5936,
-          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
-          "bytes": 5936,
-          "sha256": "0b9a1d8d3bbf953f86aed99fe22882dbee4ed2813f2cae91cbf15a2002752cc2"
         }
       },
       "failures": []
@@ -805,21 +840,46 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/build_artifact_index.py",
         "exists": true,
-        "bytes": 11579,
-        "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/build_artifact_index.py",
           "exists": true,
-          "bytes": 11579,
-          "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/build_artifact_index.py",
           "exists": true,
-          "bytes": 11579,
-          "sha256": "874a3813fb3a19d79be9ea4c0177f5922adf9e667760f927dd49163784eb6b48"
         }
       },
       "failures": []
@@ -830,21 +890,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_mirror_parity.py",
         "exists": true,
-        "bytes": 7617,
-        "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_mirror_parity.py",
           "exists": true,
-          "bytes": 7617,
-          "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_mirror_parity.py",
           "exists": true,
-          "bytes": 7617,
-          "sha256": "0a74954e50fbf7bff661c9499244fc9be704764b701431fc2035ab4cc29d43d0"
         }
       },
       "failures": []
@@ -855,21 +915,21 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
-        "bytes": 12444,
-        "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 12444,
-          "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
-          "bytes": 12444,
-          "sha256": "f8fc86b66a1fde0755004897dd307eb5c80f84bdaf917158b43c423ff6e7e9e7"
         }
       },
       "failures": []
@@ -930,21 +990,52 @@
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
         "exists": true,
-        "bytes": 89772,
-        "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
           "exists": true,
-          "bytes": 89772,
-          "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
           "exists": true,
-          "bytes": 89772,
-          "sha256": "3544638ab8dc809e126f347d942b4f7303674edd79858cd039a5b18b95500fcb"
         }
       },
       "failures": []

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T06:25:42+00:00",
   "hf_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish",
   "summary": {
+    "group_count": 32,
     "failure_count": 0,
     "failures_by_surface": {}
   },
     {
       "name": "repo_hf_website_html_parity",
       "status": "pass"
+    },
+    {
+      "name": "repo_hf_quality_doc_parity",
+      "status": "pass"
     }
   ],
   "groups": [
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/artifact_index.json",
         "exists": true,
+        "bytes": 13782,
+        "sha256": "499a9373836244474fe0db51f38d9ecb2211ae36a22e76a0ae4c323b0d45e05a"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/artifact_index.json",
           "exists": true,
+          "bytes": 13782,
+          "sha256": "499a9373836244474fe0db51f38d9ecb2211ae36a22e76a0ae4c323b0d45e05a"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/artifact_index.json",
           "exists": true,
+          "bytes": 13782,
+          "sha256": "499a9373836244474fe0db51f38d9ecb2211ae36a22e76a0ae4c323b0d45e05a"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/artifact_index.json",
           "exists": true,
+          "bytes": 13782,
+          "sha256": "499a9373836244474fe0db51f38d9ecb2211ae36a22e76a0ae4c323b0d45e05a"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/evidence_contract.json",
         "exists": true,
+        "bytes": 7587,
+        "sha256": "bb9172140a526b78523cbc5507ed6340bd7b439a51fd68ad9f4728fee721a766"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7587,
+          "sha256": "bb9172140a526b78523cbc5507ed6340bd7b439a51fd68ad9f4728fee721a766"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/evidence_contract.json",
           "exists": true,
+          "bytes": 7587,
+          "sha256": "bb9172140a526b78523cbc5507ed6340bd7b439a51fd68ad9f4728fee721a766"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/evidence_contract.json",
           "exists": true,
+          "bytes": 7587,
+          "sha256": "bb9172140a526b78523cbc5507ed6340bd7b439a51fd68ad9f4728fee721a766"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/publication_audit.json",
         "exists": true,
+        "bytes": 5408,
+        "sha256": "9d59186e18321215c55c8cddbf518b5d19fb19428f119a541c3b50ae54c4af4f"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/publication_audit.json",
           "exists": true,
+          "bytes": 5408,
+          "sha256": "9d59186e18321215c55c8cddbf518b5d19fb19428f119a541c3b50ae54c4af4f"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/publication_audit.json",
           "exists": true,
+          "bytes": 5408,
+          "sha256": "9d59186e18321215c55c8cddbf518b5d19fb19428f119a541c3b50ae54c4af4f"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/publication_audit.json",
           "exists": true,
+          "bytes": 5408,
+          "sha256": "9d59186e18321215c55c8cddbf518b5d19fb19428f119a541c3b50ae54c4af4f"
+        }
+      },
+      "failures": []
+    },
+    {
+      "name": "data/quality_gates.json",
+      "status": "pass",
+      "local": {
+        "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/quality_gates.json",
+        "exists": true,
+        "bytes": 4222,
+        "sha256": "274dd753853ea843b5413bbce68b371e4a664853924c9745a4163c1b68a54cf9"
+      },
+      "mirrors": {
+        "hf_space": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/quality_gates.json",
+          "exists": true,
+          "bytes": 4222,
+          "sha256": "274dd753853ea843b5413bbce68b371e4a664853924c9745a4163c1b68a54cf9"
+        },
+        "hf_artifacts": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/quality_gates.json",
+          "exists": true,
+          "bytes": 4222,
+          "sha256": "274dd753853ea843b5413bbce68b371e4a664853924c9745a4163c1b68a54cf9"
+        },
+        "hf_model": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/quality_gates.json",
+          "exists": true,
+          "bytes": 4222,
+          "sha256": "274dd753853ea843b5413bbce68b371e4a664853924c9745a4163c1b68a54cf9"
         }
       },
       "failures": []
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/scope_claims_audit.json",
         "exists": true,
         "bytes": 19964,
+        "sha256": "83ed49035d1ca96dc28351f9d76f9249319d59d9647f0c58cffe0243d5687f9c"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "83ed49035d1ca96dc28351f9d76f9249319d59d9647f0c58cffe0243d5687f9c"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "83ed49035d1ca96dc28351f9d76f9249319d59d9647f0c58cffe0243d5687f9c"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/scope_claims_audit.json",
           "exists": true,
           "bytes": 19964,
+          "sha256": "83ed49035d1ca96dc28351f9d76f9249319d59d9647f0c58cffe0243d5687f9c"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/data/website_integrity.json",
         "exists": true,
+        "bytes": 6042,
+        "sha256": "b4b98a25ca5095c92f84cd3d945ce4f89228fb4d4f7812922f86007a4095ba20"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/data/website_integrity.json",
           "exists": true,
+          "bytes": 6042,
+          "sha256": "b4b98a25ca5095c92f84cd3d945ce4f89228fb4d4f7812922f86007a4095ba20"
         },
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/data/website_integrity.json",
           "exists": true,
+          "bytes": 6042,
+          "sha256": "b4b98a25ca5095c92f84cd3d945ce4f89228fb4d4f7812922f86007a4095ba20"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/metrics/website_integrity.json",
           "exists": true,
+          "bytes": 6042,
+          "sha256": "b4b98a25ca5095c92f84cd3d945ce4f89228fb4d4f7812922f86007a4095ba20"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/build_artifact_index.py",
         "exists": true,
+        "bytes": 12194,
+        "sha256": "04083feaa7cd486e94fa4f313b54b5b04b588edcb1376234a7b279060e0b4058"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/build_artifact_index.py",
           "exists": true,
+          "bytes": 12194,
+          "sha256": "04083feaa7cd486e94fa4f313b54b5b04b588edcb1376234a7b279060e0b4058"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/build_artifact_index.py",
           "exists": true,
+          "bytes": 12194,
+          "sha256": "04083feaa7cd486e94fa4f313b54b5b04b588edcb1376234a7b279060e0b4058"
+        }
+      },
+      "failures": []
+    },
+    {
+      "name": "scripts/build_quality_gates.py",
+      "status": "pass",
+      "local": {
+        "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/build_quality_gates.py",
+        "exists": true,
+        "bytes": 7757,
+        "sha256": "e38c9b27836d694a4cd6ff03de1b10d20347bc7f7bb176bdc7e7d5ba3cbe7fba"
+      },
+      "mirrors": {
+        "hf_artifacts": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/build_quality_gates.py",
+          "exists": true,
+          "bytes": 7757,
+          "sha256": "e38c9b27836d694a4cd6ff03de1b10d20347bc7f7bb176bdc7e7d5ba3cbe7fba"
+        },
+        "hf_model": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/build_quality_gates.py",
+          "exists": true,
+          "bytes": 7757,
+          "sha256": "e38c9b27836d694a4cd6ff03de1b10d20347bc7f7bb176bdc7e7d5ba3cbe7fba"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_mirror_parity.py",
         "exists": true,
+        "bytes": 8353,
+        "sha256": "9673da079fdeb780b6d0767591ffb77d074f4958e511a4384fb9bd9a735af2ca"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_mirror_parity.py",
           "exists": true,
+          "bytes": 8353,
+          "sha256": "9673da079fdeb780b6d0767591ffb77d074f4958e511a4384fb9bd9a735af2ca"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_mirror_parity.py",
           "exists": true,
+          "bytes": 8353,
+          "sha256": "9673da079fdeb780b6d0767591ffb77d074f4958e511a4384fb9bd9a735af2ca"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/scripts/validate_publication_package.py",
         "exists": true,
+        "bytes": 12554,
+        "sha256": "0546124d5319fc5cc96881090049e5fcda301e6726f5e42b31141e599ab81711"
       },
       "mirrors": {
         "hf_artifacts": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 12554,
+          "sha256": "0546124d5319fc5cc96881090049e5fcda301e6726f5e42b31141e599ab81711"
         },
         "hf_model": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/scripts/validate_publication_package.py",
           "exists": true,
+          "bytes": 12554,
+          "sha256": "0546124d5319fc5cc96881090049e5fcda301e6726f5e42b31141e599ab81711"
         }
       },
       "failures": []
       "local": {
         "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs/index.html",
         "exists": true,
+        "bytes": 90729,
+        "sha256": "d1f35486ca171b6b6fcd3b4fd43263a54e4ff8e43379a20b57f6b62896e51fe8"
       },
       "mirrors": {
         "hf_space": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/index.html",
           "exists": true,
+          "bytes": 90729,
+          "sha256": "d1f35486ca171b6b6fcd3b4fd43263a54e4ff8e43379a20b57f6b62896e51fe8"
         },
         "hf_artifacts_docs": {
           "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/docs/index.html",
           "exists": true,
+          "bytes": 90729,
+          "sha256": "d1f35486ca171b6b6fcd3b4fd43263a54e4ff8e43379a20b57f6b62896e51fe8"
+        }
+      },
+      "failures": []
+    },
+    {
+      "name": "docs/QUALITY_GATES.md",
+      "status": "pass",
+      "local": {
+        "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/QUALITY_GATES.md",
+        "exists": true,
+        "bytes": 2865,
+        "sha256": "f3482dbc310d2ade60aa2b480211a9ee0cad1c814779a8b1d63d96432222897a"
+      },
+      "mirrors": {
+        "hf_space": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space/QUALITY_GATES.md",
+          "exists": true,
+          "bytes": 2865,
+          "sha256": "f3482dbc310d2ade60aa2b480211a9ee0cad1c814779a8b1d63d96432222897a"
+        },
+        "hf_artifacts": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts/QUALITY_GATES.md",
+          "exists": true,
+          "bytes": 2865,
+          "sha256": "f3482dbc310d2ade60aa2b480211a9ee0cad1c814779a8b1d63d96432222897a"
+        },
+        "hf_model": {
+          "path": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model/QUALITY_GATES.md",
+          "exists": true,
+          "bytes": 2865,
+          "sha256": "f3482dbc310d2ade60aa2b480211a9ee0cad1c814779a8b1d63d96432222897a"
         }
       },
       "failures": []

metrics/publication_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:58:39+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
@@ -44,6 +44,7 @@
     "LICENSE": true,
     "codemeta.json": true,
     "ARTIFACT_GUIDE.md": true,
     "REPRODUCIBILITY.md": true,
     "EVIDENCE_CONTRACT.md": true,
     "DATA_NOTICE.md": true,
@@ -54,6 +55,7 @@
     "docs/sitemap.xml": true,
     "docs/data/evidence_contract.json": true,
     "docs/data/artifact_index.json": true,
     "docs/data/project_manifest.json": true,
     "docs/data/reviewer_packet.json": true,
     "docs/data/reproducibility_matrix.json": true,
@@ -80,6 +82,7 @@
     "scripts/episode_task_suite.py": true,
     "scripts/neural_task_models.py": true,
     "scripts/build_artifact_index.py": true,
     "scripts/validate_mirror_parity.py": true,
     "scripts/validate_scope_claims.py": true,
     "scripts/validate_website_integrity.py": true,
@@ -131,8 +134,8 @@
     "github_repo": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
       "exists": true,
-      "file_count": 286,
-      "text_file_count": 231,
       "largest_file": {
         "path": "results/episode_task_suite/modality_reconstruction/predictions.npz",
         "bytes": 52601010
@@ -142,8 +145,8 @@
     "hf_space_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space",
       "exists": true,
-      "file_count": 51,
-      "text_file_count": 38,
       "largest_file": {
         "path": "assets/task_suite_infographic.png",
         "bytes": 2322389
@@ -153,8 +156,8 @@
     "hf_artifact_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts",
       "exists": true,
-      "file_count": 266,
-      "text_file_count": 224,
       "largest_file": {
         "path": "results/episode_task_suite/neural_mlp/temporal_order/model.pt",
         "bytes": 13406129
@@ -164,8 +167,8 @@
     "hf_model_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model",
       "exists": true,
-      "file_count": 198,
-      "text_file_count": 155,
       "largest_file": {
         "path": "artifacts/episode_task_suite/cross_modal_retrieval/model.npz",
         "bytes": 41310574

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T06:25:11+00:00",
   "checks": [
     {
       "name": "required_publication_assets_present",
     "LICENSE": true,
     "codemeta.json": true,
     "ARTIFACT_GUIDE.md": true,
+    "QUALITY_GATES.md": true,
     "REPRODUCIBILITY.md": true,
     "EVIDENCE_CONTRACT.md": true,
     "DATA_NOTICE.md": true,
     "docs/sitemap.xml": true,
     "docs/data/evidence_contract.json": true,
     "docs/data/artifact_index.json": true,
+    "docs/data/quality_gates.json": true,
     "docs/data/project_manifest.json": true,
     "docs/data/reviewer_packet.json": true,
     "docs/data/reproducibility_matrix.json": true,
     "scripts/episode_task_suite.py": true,
     "scripts/neural_task_models.py": true,
     "scripts/build_artifact_index.py": true,
+    "scripts/build_quality_gates.py": true,
     "scripts/validate_mirror_parity.py": true,
     "scripts/validate_scope_claims.py": true,
     "scripts/validate_website_integrity.py": true,
     "github_repo": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy",
       "exists": true,
+      "file_count": 289,
+      "text_file_count": 234,
       "largest_file": {
         "path": "results/episode_task_suite/modality_reconstruction/predictions.npz",
         "bytes": 52601010
     "hf_space_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/space",
       "exists": true,
+      "file_count": 53,
+      "text_file_count": 40,
       "largest_file": {
         "path": "assets/task_suite_infographic.png",
         "bytes": 2322389
     "hf_artifact_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/artifacts",
       "exists": true,
+      "file_count": 269,
+      "text_file_count": 227,
       "largest_file": {
         "path": "results/episode_task_suite/neural_mlp/temporal_order/model.pt",
         "bytes": 13406129
     "hf_model_bundle": {
       "root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/hf_publish/model",
       "exists": true,
+      "file_count": 201,
+      "text_file_count": 158,
       "largest_file": {
         "path": "artifacts/episode_task_suite/cross_modal_retrieval/model.npz",
         "bytes": 41310574

metrics/quality_gates.json ADDED Viewed

	@@ -0,0 +1,101 @@

+{
+  "title": "Ropedia Xperience-10M Publication Quality Gates",
+  "status": "pass",
+  "generated_at_utc": "2026-06-01T06:24:04+00:00",
+  "rule": "Do not present a release as current unless every automated gate passes, then verify live GitHub/HF mirrors after publishing.",
+  "automated_gates": [
+    {
+      "id": "scope_claims",
+      "title": "Scope claims guard",
+      "command": "python scripts/validate_scope_claims.py",
+      "report": "docs/data/scope_claims_audit.json",
+      "blocks_if": "Historical 32ep smoke/provenance strings are presented as real 32-episode metrics.",
+      "proves": "The public narrative does not overclaim the Qwen3-Omni smoke artifacts.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    },
+    {
+      "id": "website_integrity",
+      "title": "Website integrity",
+      "command": "python scripts/validate_website_integrity.py",
+      "report": "docs/data/website_integrity.json",
+      "blocks_if": "Local links, anchors, JSON bundles, or referenced image assets are missing or invalid.",
+      "proves": "The GitHub Pages / HF static surface is internally coherent before upload.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    },
+    {
+      "id": "quality_gate_manifest",
+      "title": "Quality-gate manifest",
+      "command": "python scripts/build_quality_gates.py",
+      "report": "docs/data/quality_gates.json",
+      "blocks_if": "A public reviewer cannot see the current packaging gates in one place.",
+      "proves": "The publication checklist is explicit, versioned, and mirrored with the repo.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    },
+    {
+      "id": "artifact_index",
+      "title": "Artifact index",
+      "command": "python scripts/build_artifact_index.py",
+      "report": "docs/data/artifact_index.json",
+      "blocks_if": "Reviewer-critical evidence files are missing from the indexed proof layer.",
+      "proves": "Core proof artifacts exist and stable files have SHA-256 hashes.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    },
+    {
+      "id": "publication_hygiene",
+      "title": "Publication hygiene",
+      "command": "python scripts/validate_publication_package.py",
+      "report": "docs/data/publication_audit.json",
+      "blocks_if": "Raw data, caches, heavy archives, token strings, missing required assets, or stale public-card figure references enter public bundles.",
+      "proves": "The repo and prepared HF bundles are clean enough to publish.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    },
+    {
+      "id": "mirror_parity",
+      "title": "Prepared mirror parity",
+      "command": "python scripts/validate_mirror_parity.py",
+      "report": "docs/data/mirror_parity.json",
+      "blocks_if": "Prepared HF Space, artifact dataset, or model bundle diverges from the repo for critical files.",
+      "proves": "The files staged for GitHub and Hugging Face are synchronized before upload.",
+      "current_report": {
+        "exists": true,
+        "status": "pass"
+      }
+    }
+  ],
+  "post_publish_checks": [
+    {
+      "id": "github_pages_deploy",
+      "title": "GitHub Pages deployment",
+      "evidence": "gh run list --repo ChaoYue0307/ropedia-xperience-10m-task-suite --limit 5",
+      "required_result": "latest pages-build-deployment run succeeds"
+    },
+    {
+      "id": "live_figure_hash_parity",
+      "title": "Live figure hash parity",
+      "evidence": "download live GitHub/HF task_suite_infographic.png and compare SHA-256 to docs/assets/task_suite_infographic.png",
+      "required_result": "all live hashes match the repo asset"
+    },
+    {
+      "id": "rendered_browser_smoke",
+      "title": "Rendered browser smoke",
+      "evidence": "Browser/Playwright page identity, nonblank render, console health, and one local interaction",
+      "required_result": "no relevant console warnings/errors and target links work"
+    }
+  ],
+  "scope_boundary": "These gates validate public packaging, claim boundaries, mirror parity, and website integrity. They do not prove cross-episode model quality."
+}

metrics/scope_claims_audit.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:46:47+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T06:18:18+00:00",
   "summary": {
     "qwen3_omni_32_episode_claim": false,
     "dataset_manifest_num_episodes": 1,

metrics/website_integrity.json CHANGED Viewed

@@ -1,13 +1,13 @@
 {
   "status": "pass",
-  "generated_at_utc": "2026-06-01T05:57:48+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
     "html_pages": 2,
-    "local_references": 57,
-    "external_reference_count": 56,
-    "json_files": 14,
     "image_assets_referenced": 18,
     "failure_count": 0
   },
@@ -44,7 +44,7 @@
     {
       "path": "index.html",
       "id_count": 31,
-      "reference_count": 56,
       "image_count": 20
     }
   ],
@@ -56,7 +56,7 @@
     },
     {
       "path": "data/evidence_contract.json",
-      "bytes": 7205,
       "top_level_type": "dict"
     },
     {
@@ -79,6 +79,11 @@
       "bytes": 5292,
       "top_level_type": "dict"
     },
     {
       "path": "data/reproducibility_matrix.json",
       "bytes": 4033,

 {
   "status": "pass",
+  "generated_at_utc": "2026-06-01T06:18:19+00:00",
   "docs_root": "/Users/chaoyue/Documents/Codex/2026-05-29/i-am-learning-this-dataset-https/working_repo_copy/docs",
   "site_base": "/ropedia-xperience-10m-task-suite/",
   "summary": {
     "html_pages": 2,
+    "local_references": 59,
+    "external_reference_count": 58,
+    "json_files": 15,
     "image_assets_referenced": 18,
     "failure_count": 0
   },
     {
       "path": "index.html",
       "id_count": 31,
+      "reference_count": 58,
       "image_count": 20
     }
   ],
     },
     {
       "path": "data/evidence_contract.json",
+      "bytes": 7587,
       "top_level_type": "dict"
     },
     {
       "bytes": 5292,
       "top_level_type": "dict"
     },
+    {
+      "path": "data/quality_gates.json",
+      "bytes": 4564,
+      "top_level_type": "dict"
+    },
     {
       "path": "data/reproducibility_matrix.json",
       "bytes": 4033,

scripts/build_artifact_index.py CHANGED Viewed

@@ -41,6 +41,22 @@ ARTIFACTS = [
         "surface": "repo_hf",
         "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
     },
     {
         "id": "reproducibility_contract",
         "title": "Reproducibility contract",

         "surface": "repo_hf",
         "proves": "Gives the human-readable map from proof boundary to data, tasks, platform mirrors, and scale-up status.",
     },
+    {
+        "id": "quality_gates",
+        "title": "Publication quality gates",
+        "path": "QUALITY_GATES.md",
+        "kind": "quality_gate",
+        "surface": "repo_hf",
+        "proves": "Lists the automated and post-publish gates required before presenting a release as current.",
+    },
+    {
+        "id": "quality_gate_manifest",
+        "title": "Quality-gate manifest",
+        "path": "docs/data/quality_gates.json",
+        "kind": "quality_gate",
+        "surface": "website_hf",
+        "proves": "Machine-readable release-gate summary for validators, mirrors, and reviewer surfaces.",
+    },
     {
         "id": "reproducibility_contract",
         "title": "Reproducibility contract",

scripts/build_quality_gates.py ADDED Viewed

	@@ -0,0 +1,189 @@

+#!/usr/bin/env python3
+"""Build the public quality-gate summary.
+This is a presentation artifact over the existing validators. It does not
+replace the validators; it makes the release gate readable in one file and one
+machine-readable JSON bundle.
+"""
+from __future__ import annotations
+import json
+from datetime import datetime, timezone
+from pathlib import Path
+ROOT = Path(__file__).resolve().parents[1]
+OUTPUT_JSON = ROOT / "docs/data/quality_gates.json"
+OUTPUT_MD = ROOT / "QUALITY_GATES.md"
+GATES = [
+    {
+        "id": "scope_claims",
+        "title": "Scope claims guard",
+        "command": "python scripts/validate_scope_claims.py",
+        "report": "docs/data/scope_claims_audit.json",
+        "blocks_if": "Historical 32ep smoke/provenance strings are presented as real 32-episode metrics.",
+        "proves": "The public narrative does not overclaim the Qwen3-Omni smoke artifacts.",
+    },
+    {
+        "id": "website_integrity",
+        "title": "Website integrity",
+        "command": "python scripts/validate_website_integrity.py",
+        "report": "docs/data/website_integrity.json",
+        "blocks_if": "Local links, anchors, JSON bundles, or referenced image assets are missing or invalid.",
+        "proves": "The GitHub Pages / HF static surface is internally coherent before upload.",
+    },
+    {
+        "id": "quality_gate_manifest",
+        "title": "Quality-gate manifest",
+        "command": "python scripts/build_quality_gates.py",
+        "report": "docs/data/quality_gates.json",
+        "blocks_if": "A public reviewer cannot see the current packaging gates in one place.",
+        "proves": "The publication checklist is explicit, versioned, and mirrored with the repo.",
+    },
+    {
+        "id": "artifact_index",
+        "title": "Artifact index",
+        "command": "python scripts/build_artifact_index.py",
+        "report": "docs/data/artifact_index.json",
+        "blocks_if": "Reviewer-critical evidence files are missing from the indexed proof layer.",
+        "proves": "Core proof artifacts exist and stable files have SHA-256 hashes.",
+    },
+    {
+        "id": "publication_hygiene",
+        "title": "Publication hygiene",
+        "command": "python scripts/validate_publication_package.py",
+        "report": "docs/data/publication_audit.json",
+        "blocks_if": "Raw data, caches, heavy archives, token strings, missing required assets, or stale public-card figure references enter public bundles.",
+        "proves": "The repo and prepared HF bundles are clean enough to publish.",
+    },
+    {
+        "id": "mirror_parity",
+        "title": "Prepared mirror parity",
+        "command": "python scripts/validate_mirror_parity.py",
+        "report": "docs/data/mirror_parity.json",
+        "blocks_if": "Prepared HF Space, artifact dataset, or model bundle diverges from the repo for critical files.",
+        "proves": "The files staged for GitHub and Hugging Face are synchronized before upload.",
+    },
+]
+POST_PUBLISH_CHECKS = [
+    {
+        "id": "github_pages_deploy",
+        "title": "GitHub Pages deployment",
+        "evidence": "gh run list --repo ChaoYue0307/ropedia-xperience-10m-task-suite --limit 5",
+        "required_result": "latest pages-build-deployment run succeeds",
+    },
+    {
+        "id": "live_figure_hash_parity",
+        "title": "Live figure hash parity",
+        "evidence": "download live GitHub/HF task_suite_infographic.png and compare SHA-256 to docs/assets/task_suite_infographic.png",
+        "required_result": "all live hashes match the repo asset",
+    },
+    {
+        "id": "rendered_browser_smoke",
+        "title": "Rendered browser smoke",
+        "evidence": "Browser/Playwright page identity, nonblank render, console health, and one local interaction",
+        "required_result": "no relevant console warnings/errors and target links work",
+    },
+]
+def read_status(path: Path) -> dict:
+    if not path.exists():
+        return {"exists": False, "status": "missing"}
+    try:
+        payload = json.loads(path.read_text(encoding="utf-8"))
+    except json.JSONDecodeError as exc:
+        return {"exists": True, "status": "invalid_json", "error": str(exc)}
+    return {
+        "exists": True,
+        "status": str(payload.get("status", "unknown")),
+    }
+def build_payload() -> dict:
+    gate_records = []
+    generated_at = datetime.now(timezone.utc).isoformat(timespec="seconds")
+    for gate in GATES:
+        if gate["id"] == "quality_gate_manifest":
+            status = {"exists": True, "status": "pass"}
+        else:
+            status = read_status(ROOT / gate["report"])
+        gate_records.append({**gate, "current_report": status})
+    overall_status = "pass" if all(item["current_report"]["status"] == "pass" for item in gate_records) else "fail"
+    return {
+        "title": "Ropedia Xperience-10M Publication Quality Gates",
+        "status": overall_status,
+        "generated_at_utc": generated_at,
+        "rule": "Do not present a release as current unless every automated gate passes, then verify live GitHub/HF mirrors after publishing.",
+        "automated_gates": gate_records,
+        "post_publish_checks": POST_PUBLISH_CHECKS,
+        "scope_boundary": "These gates validate public packaging, claim boundaries, mirror parity, and website integrity. They do not prove cross-episode model quality.",
+    }
+def markdown(payload: dict) -> str:
+    lines = [
+        "# Publication Quality Gates",
+        "",
+        "This file is the reviewer-facing release checklist for the Ropedia Xperience-10M Task Suite.",
+        "",
+        f"Current gate status: **{payload['status']}**",
+        "",
+        payload["rule"],
+        "",
+        "These gates validate public packaging, claim boundaries, mirror parity, and website integrity. They do not prove cross-episode model quality; the 32-episode Qwen3-Omni pilot remains gated on data access.",
+        "",
+        "## Automated Gates",
+        "",
+        "| Gate | Command | Report | Current report status | Blocks publication if |",
+        "| --- | --- | --- | --- | --- |",
+    ]
+    for gate in payload["automated_gates"]:
+        report_status = gate["current_report"]["status"]
+        lines.append(
+            f"| {gate['title']} | `{gate['command']}` | `{gate['report']}` | `{report_status}` | {gate['blocks_if']} |"
+        )
+    lines.extend([
+        "",
+        "## Post-Publish Checks",
+        "",
+        "| Check | Evidence | Required result |",
+        "| --- | --- | --- |",
+    ])
+    for check in payload["post_publish_checks"]:
+        lines.append(f"| {check['title']} | `{check['evidence']}` | {check['required_result']} |")
+    lines.extend([
+        "",
+        "## Rerun Order",
+        "",
+        "```bash",
+        "python scripts/validate_scope_claims.py",
+        "python scripts/validate_website_integrity.py",
+        "python scripts/build_quality_gates.py",
+        "python scripts/build_artifact_index.py",
+        "python scripts/validate_publication_package.py",
+        "python scripts/validate_mirror_parity.py",
+        "```",
+        "",
+        "After Hugging Face bundle sync, rerun `validate_publication_package.py` and `validate_mirror_parity.py` once more before upload.",
+        "",
+    ])
+    return "\n".join(lines)
+def main() -> int:
+    payload = build_payload()
+    OUTPUT_JSON.parent.mkdir(parents=True, exist_ok=True)
+    OUTPUT_JSON.write_text(json.dumps(payload, indent=2) + "\n", encoding="utf-8")
+    OUTPUT_MD.write_text(markdown(payload), encoding="utf-8")
+    print(f"{payload['status'].upper()}: wrote {OUTPUT_JSON}")
+    print(f"{payload['status'].upper()}: wrote {OUTPUT_MD}")
+    return 0 if payload["status"] == "pass" else 1
+if __name__ == "__main__":
+    raise SystemExit(main())

scripts/validate_mirror_parity.py CHANGED Viewed

@@ -25,6 +25,7 @@ DATA_FILES = [
     "modality_atlas.json",
     "project_manifest.json",
     "publication_audit.json",
     "reproducibility_matrix.json",
     "research_direction_extensions.json",
     "research_directions.json",
@@ -50,6 +51,7 @@ ASSET_FILES = [
 SCRIPT_FILES = [
     "build_artifact_index.py",
     "validate_mirror_parity.py",
     "validate_publication_package.py",
     "validate_scope_claims.py",
@@ -60,6 +62,10 @@ WEBSITE_FILES = [
     "index.html",
 ]
 def sha256(path: Path) -> str:
     digest = hashlib.sha256()
@@ -166,6 +172,19 @@ def build_report(hf_root: Path) -> dict:
             )
         )
     failures = [
         {"group": group["name"], **failure}
         for group in groups
@@ -209,6 +228,12 @@ def build_report(hf_root: Path) -> dict:
                 if not any(failure["group"].startswith("website/") for failure in failures)
                 else "fail",
             },
         ],
         "groups": groups,
         "failures": failures,

     "modality_atlas.json",
     "project_manifest.json",
     "publication_audit.json",
+    "quality_gates.json",
     "reproducibility_matrix.json",
     "research_direction_extensions.json",
     "research_directions.json",
 SCRIPT_FILES = [
     "build_artifact_index.py",
+    "build_quality_gates.py",
     "validate_mirror_parity.py",
     "validate_publication_package.py",
     "validate_scope_claims.py",
     "index.html",
 ]
+DOC_FILES = [
+    "QUALITY_GATES.md",
+]
 def sha256(path: Path) -> str:
     digest = hashlib.sha256()
             )
         )
+    for filename in DOC_FILES:
+        groups.append(
+            parity_group(
+                f"docs/{filename}",
+                ROOT / filename,
+                {
+                    "hf_space": hf_root / "space" / filename,
+                    "hf_artifacts": hf_root / "artifacts" / filename,
+                    "hf_model": hf_root / "model" / filename,
+                },
+            )
+        )
     failures = [
         {"group": group["name"], **failure}
         for group in groups
                 if not any(failure["group"].startswith("website/") for failure in failures)
                 else "fail",
             },
+            {
+                "name": "repo_hf_quality_doc_parity",
+                "status": "pass"
+                if not any(failure["group"].startswith("docs/") for failure in failures)
+                else "fail",
+            },
         ],
         "groups": groups,
         "failures": failures,

scripts/validate_publication_package.py CHANGED Viewed

@@ -193,6 +193,7 @@ def required_assets(root: Path) -> dict[str, bool]:
         "LICENSE",
         "codemeta.json",
         "ARTIFACT_GUIDE.md",
         "REPRODUCIBILITY.md",
         "EVIDENCE_CONTRACT.md",
         "DATA_NOTICE.md",
@@ -203,6 +204,7 @@ def required_assets(root: Path) -> dict[str, bool]:
         "docs/sitemap.xml",
         "docs/data/evidence_contract.json",
         "docs/data/artifact_index.json",
         "docs/data/project_manifest.json",
         "docs/data/reviewer_packet.json",
         "docs/data/reproducibility_matrix.json",
@@ -229,6 +231,7 @@ def required_assets(root: Path) -> dict[str, bool]:
         "scripts/episode_task_suite.py",
         "scripts/neural_task_models.py",
         "scripts/build_artifact_index.py",
         "scripts/validate_mirror_parity.py",
         "scripts/validate_scope_claims.py",
         "scripts/validate_website_integrity.py",

         "LICENSE",
         "codemeta.json",
         "ARTIFACT_GUIDE.md",
+        "QUALITY_GATES.md",
         "REPRODUCIBILITY.md",
         "EVIDENCE_CONTRACT.md",
         "DATA_NOTICE.md",
         "docs/sitemap.xml",
         "docs/data/evidence_contract.json",
         "docs/data/artifact_index.json",
+        "docs/data/quality_gates.json",
         "docs/data/project_manifest.json",
         "docs/data/reviewer_packet.json",
         "docs/data/reproducibility_matrix.json",
         "scripts/episode_task_suite.py",
         "scripts/neural_task_models.py",
         "scripts/build_artifact_index.py",
+        "scripts/build_quality_gates.py",
         "scripts/validate_mirror_parity.py",
         "scripts/validate_scope_claims.py",
         "scripts/validate_website_integrity.py",