RtaForge
/

Anvaya-Rabbit-2.7B

@@ -61,22 +61,25 @@ tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
 > guha@rtaforge.in for access). This model uses a custom SSM architecture
 > not compatible with standard HuggingFace `AutoModel`.
-## Training Curriculum
-One epoch, single NVIDIA L4, ~15,000 steps across 8 phases + 1,500-step Scholar Sprint.
-Phases 1–5 (pretraining corpus progression) not shown.
-| Phase | Steps | Dataset | Focus |
-|-------|-------|---------|-------|
-| 6 | 2,000 | Glaive alignment | Alignment |
-| 7 | 1,500 | Glaive alignment | Alignment |
-Final Scholar Sprint: 1,500 steps, Phase 5 saturation (Logic Giants corpus).
-**Final checkpoint: Step 1,500.**
-Trained with the Anvaya Gurukul protocol: a constitutional Sisya/Guru loop
-where Sisya proposes weight deltas and Guru applies them after validation.
-SFT imprint applied using surface-only gate-layer fine-tuning.
 ## Evaluation Results (Step 1,500)
@@ -118,7 +121,7 @@ capability.
 | Model | Params | seq_len | Status |
 |-------|--------|---------|--------|
 | **Rabbit** | ~2.7B | 64 | ✅ This model — v0.1 Alpha |
-| **Raccoon** | ~2.7B | 512 | In training — reasoning curriculum (math ×2, logic ×2) |
 | **Polar Bear** | ~13B | 512 | Planned — STEM + AEVA anti-hallucination layer |
 The delta between Rabbit and Raccoon is the story. One epoch → two epochs,

 > guha@rtaforge.in for access). This model uses a custom SSM architecture
 > not compatible with standard HuggingFace `AutoModel`.
+## Training
+Trained with the Anvaya Gurukul protocol: a constitutional Sisya/Guru loop
+where Sisya proposes weight deltas and Guru applies them after validation.
+SFT imprint applied using surface-only gate-layer fine-tuning (65 examples, 3 epochs).
+**1,500 accepted proposals across 6 phases on a single AceCloud L4 (24GB VRAM).
+~7 days of effective training time (total elapsed higher due to crash recovery and VRAM leak debugging).**
+| Phase | Proposals | Dataset | Focus |
+|-------|-----------|---------|-------|
+| 0 | 125 | CAMEL Physics | Physical reasoning |
+| 1 | 125 | CAMEL Chemistry | Chemical reasoning |
+| 2 | 125 | CAMEL Biology | Biological reasoning |
+| 3 | 250 | Raccoon Phase 1 | General reasoning |
+| 4 | 500 | Rabbit E2 Phase 4 | Extended curriculum |
+| 5 | 375 | Raccoon Phase 3 (consolidation re-run) | Pattern consolidation |
+**Final checkpoint: Step 1,500.** seq_len=64, batch_size=3, optimizer=Lion, lr=1e-5.
 ## Evaluation Results (Step 1,500)
 | Model | Params | seq_len | Status |
 |-------|--------|---------|--------|
 | **Rabbit** | ~2.7B | 64 | ✅ This model — v0.1 Alpha |
+| **Raccoon** | ~6.1B | 512 | In training — reasoning curriculum (math ×2, logic ×2) |
 | **Polar Bear** | ~13B | 512 | Planned — STEM + AEVA anti-hallucination layer |
 The delta between Rabbit and Raccoon is the story. One epoch → two epochs,