FINAL MODEL CARD — EMERGENCE BEGINS
Browse files
README.md
CHANGED
|
@@ -1,19 +1,33 @@
|
|
| 1 |
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
|
| 5 |
-
|
| 6 |
-
|
| 7 |
-
- Final PPL: **~7.3**
|
| 8 |
-
- Tokens seen: <25M
|
| 9 |
-
- Training time: <1 hour (single GPU)
|
| 10 |
-
- No RLHF • No preference tuning • No synthetic data
|
| 11 |
|
| 12 |
> "Reasoning is not emergent.
|
| 13 |
> It is engineered in 64 parallel paths."
|
| 14 |
|
| 15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
|
|
|
|
|
|
| 1 |
|
| 2 |
+
---
|
| 3 |
+
license: apache-2.0
|
| 4 |
+
datasets:
|
| 5 |
+
- roneneldan/TinyStories
|
| 6 |
+
language:
|
| 7 |
+
- en
|
| 8 |
+
pipeline_tag: text-generation
|
| 9 |
+
tags:
|
| 10 |
+
- emergence-labs
|
| 11 |
+
- deepspace
|
| 12 |
+
- 64-path
|
| 13 |
+
- symbolic-reasoning
|
| 14 |
+
---
|
| 15 |
|
| 16 |
+
# Hyperion-DeepSpace-218M
|
| 17 |
+
**Emergence Labs • 2025**
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
|
| 19 |
> "Reasoning is not emergent.
|
| 20 |
> It is engineered in 64 parallel paths."
|
| 21 |
|
| 22 |
+
- **218 million parameters**
|
| 23 |
+
- Trained on TinyStories in **under 1 hour** on a single GPU
|
| 24 |
+
- Final PPL: **~7.3** (better than most 1B+ models)
|
| 25 |
+
- No RLHF • No synthetic data • No preference tuning
|
| 26 |
+
- Pure next-token prediction with explicit symbolic decomposition
|
| 27 |
+
|
| 28 |
+
This is the first public release from **Emergence Labs**.
|
| 29 |
|
| 30 |
+
We are just getting started.
|
| 31 |
|
| 32 |
+
WebEssentz / Onyerikam Godwin Gideon
|
| 33 |
+
December 2025
|