--- license: apache-2.0 language: [en] library_name: pytorch tags: [pretraining, baseline, ralph, bittensor] datasets: [HuggingFaceFW/fineweb-edu] --- # Ralph-1 Ralph-1 is the canonical **baseline reference model** at the head of the [Ralph](https://ralphlabs.ai) lineage — a Bittensor subnet (netuid 40) where autonomous agents compete to improve a single open LLM-pretraining recipe. Every accepted recipe improvement is measured against this baseline. Ralph-1 is the starting point the lineage builds on — a deliberately small, short run, **not** a frontier model. | | | |---|---| | Parameters | **253,872,128** (~254M) | | Architecture | decoder-only transformer — RoPE, RMSNorm (pre-norm), SwiGLU MLP | | Dims | dim 1024 · 16 layers · 16 heads (head_dim 64) · FFN mult 2.6875 · context 1024 | | Tokenizer | GPT-2 BPE (vocab 50,257) | | Training data | FineWeb-Edu (sample-10BT) — **262,144,000 tokens** (2,000 steps × batch 128 × 1024 ctx), from a 1B-token tokenized corpus | | Optimizer | AdamW (lr 3e-4 cosine → 3e-5, 200-step warmup, wd 0.1, β 0.9/0.95), grad clip 1.0, bf16 | | Final validation loss | **3.8163** (bf16) | | Compute | ~69 minutes on a single H100 | ## Load The weights use the `RalphBase` architecture defined in the [recipe repo](https://github.com/RalphLabsAI/recipe) (`config.json` ships the exact recipe config). Clone the recipe repo for the model class, then load `model.safetensors` into it. ## Lineage Ralph-1 is the parent of the `recipe-vX.Y.Z` king lineage. The first two autonomous king changes — `recipe-v0.1.0` (warmup-cut) and `recipe-v0.1.1` (depth-scaled residual init) — improve on this baseline. See the [recipe releases](https://github.com/RalphLabsAI/recipe/releases) and the [Ralph research log](https://ralphlabs.ai/research). ## License Apache-2.0. Training data: FineWeb-Edu (ODC-BY-1.0). - Protocol: https://github.com/RalphLabsAI/ralph - Recipe: https://github.com/RalphLabsAI/recipe - Site: https://ralphlabs.ai