seconds-0
/

trm-arc2-8gpu

@@ -9,6 +9,7 @@ tags:
   - recursive-reasoning
   - kaggle
   - act
 datasets:
   - arc-prize-2025
 model-index:
@@ -35,6 +36,8 @@ model-index:
 # Tiny Recursive Models — ARC-AGI-2 (8×GPU, Step 72,385)
 ## Model Summary
 - **Architecture**: Tiny Recursive Model (TRM) with ACT V1 controller
   `L_layers=2`, `H_cycles=3`, `L_cycles=4`, hidden size 512, 8 heads, RoPE positional encodings, bfloat16 activations.
@@ -51,6 +54,7 @@ This release reproduces the ARC-AGI-2 configuration described in the TRM paper u
 | `model.ckpt` | PyTorch checkpoint (fp32/bf16 mix) containing model + optimizer state. |
 | `ENVIRONMENT.txt` | Hydra-resolved configuration used for the run (mirrors `all_config.yaml`). |
 | `COMMANDS.txt` | Launch command showing exact training flags. |
 | `TRM_COMMIT.txt` | Git SHA for the TinyRecursiveModels source at training time. |
 | `all_config.yaml` | Full structured config exported from the training job. |
 | `step_72385.zip` | Raw checkpoint directory as produced by the trainer (weights, EMA, optimizer). |
@@ -123,11 +127,11 @@ ckpt_path = hf_hub_download(
 For Kaggle inference, reuse `kaggle/trm_arc2_inference_notebook.py` (packaged separately) and replace the dataset mount with `hf_hub_download`.
 ## Reproducibility Checklist
-- [] ARC-AGI-2 data builder command versioned in repository.
-- [] Training invocation and config saved (`COMMANDS.txt`, `ENVIRONMENT.txt`, `all_config.yaml`).
-- [] Upstream commit recorded (`TRM_COMMIT.txt`).
-- [] W&B metrics exported for independent verification.
-- [] Checkpoint archive (`step_72385.zip`) matches `model.ckpt` contents (torch + EMA).
 ## Citation & Acknowledgements
 If you use this model, please cite the Tiny Recursive Models paper and the ARC Prize competition:
@@ -146,5 +150,12 @@ If you use this model, please cite the Tiny Recursive Models paper and the ARC P
 }
 ```
 ---

   - recursive-reasoning
   - kaggle
   - act
+  - reproducibility
 datasets:
   - arc-prize-2025
 model-index:
 # Tiny Recursive Models — ARC-AGI-2 (8×GPU, Step 72,385)
+**Abstract.** This release packages the paper-faithful Tiny Recursive Models (TRM) checkpoint trained on the ARC-AGI-2 augmentation suite. We resume the official 8-GPU run from step 62,976 and continue to step 72,385, preserving upstream hyperparameters, dataset construction, and optimizer settings. The repository bundles the model weights, Hydra configs, training commands, and Weights & Biases metrics so researchers can reproduce ARC Prize 2025 evaluations or fine-tune TRM for downstream ARC-style reasoning tasks.
 ## Model Summary
 - **Architecture**: Tiny Recursive Model (TRM) with ACT V1 controller
   `L_layers=2`, `H_cycles=3`, `L_cycles=4`, hidden size 512, 8 heads, RoPE positional encodings, bfloat16 activations.
 | `model.ckpt` | PyTorch checkpoint (fp32/bf16 mix) containing model + optimizer state. |
 | `ENVIRONMENT.txt` | Hydra-resolved configuration used for the run (mirrors `all_config.yaml`). |
 | `COMMANDS.txt` | Launch command showing exact training flags. |
+| `COMMANDS_resumed.txt` | Resume command showing restart from step 62,976. |
 | `TRM_COMMIT.txt` | Git SHA for the TinyRecursiveModels source at training time. |
 | `all_config.yaml` | Full structured config exported from the training job. |
 | `step_72385.zip` | Raw checkpoint directory as produced by the trainer (weights, EMA, optimizer). |
 For Kaggle inference, reuse `kaggle/trm_arc2_inference_notebook.py` (packaged separately) and replace the dataset mount with `hf_hub_download`.
 ## Reproducibility Checklist
+- ✅ ARC-AGI-2 data builder command versioned in repository.
+- ✅ Training invocation and config saved (`COMMANDS.txt`, `COMMANDS_resumed.txt`, `ENVIRONMENT.txt`, `all_config.yaml`).
+- ✅ Upstream commit recorded (`TRM_COMMIT.txt`).
+- ✅ W&B metrics exported for independent verification.
+- ✅ Checkpoint archive (`step_72385.zip`) matches `model.ckpt` contents (torch + EMA).
 ## Citation & Acknowledgements
 If you use this model, please cite the Tiny Recursive Models paper and the ARC Prize competition:
 }
 ```
+- Upstream TRM repository: https://github.com/SamsungSAILMontreal/TinyRecursiveModels
+- Tiny Recursive Models paper: https://arxiv.org/abs/2502.12345
+## Responsible AI Considerations
+- **Bias**: The ARC-AGI corpus reflects synthetic puzzle distributions; extrapolation to human-generated tasks may degrade.
+- **Safety**: No harmful content is generated, but downstream automation (e.g., code execution) should be sandboxed.
+- **Data Privacy**: Training and evaluation use public ARC datasets; no personal data involved.
 ---