Update README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,8 @@ model-index:
|
|
| 38 |
|
| 39 |
# ATLAS-8B-Thinking
|
| 40 |
|
| 41 |
-
 compared to the student baseline. The results highlight a rare combination of increased performance, higher efficiency, and fundamental reliability.
|
| 52 |
|
| 53 |
| Metric | Improvement | Notes |
|
|
|
|
| 38 |
|
| 39 |
# ATLAS-8B-Thinking
|
| 40 |
|
| 41 |
+

|
| 42 |
+
|
| 43 |
|
| 44 |
**ATLAS-8B-Thinking** is a specialized teacher model developed by Arc Intelligence, designed to solve the core reliability problem in reinforcement learning for LLMs. Standard RL fine-tuning is often brittle, leading to performance degradation where new skills are learned at the expense of old ones.
|
| 45 |
|
|
|
|
| 49 |
|
| 50 |
## Model Performance
|
| 51 |
|
| 52 |
+

|
| 53 |
+
|
| 54 |
+
|
| 55 |
The ATLAS framework, using this teacher model, produces the following improvements in a student model (Qwen3-4B) compared to the student baseline. The results highlight a rare combination of increased performance, higher efficiency, and fundamental reliability.
|
| 56 |
|
| 57 |
| Metric | Improvement | Notes |
|