Update README.md
Browse files
README.md
CHANGED
|
@@ -26,13 +26,14 @@ The model emphasizes **reasoning quality, instruction robustness, and stability
|
|
| 26 |
- 16K context length
|
| 27 |
- Optimized for reasoning-centric tasks
|
| 28 |
- Designed for modern GPU inference runtimes
|
| 29 |
-
- **Architecture:** Qwen3-compatible (DeepBrainz-R series, optimized
|
| 30 |
|
| 31 |
---
|
| 32 |
|
| 33 |
## Intended Use
|
| 34 |
|
| 35 |
-
- Advanced reasoning systems
|
|
|
|
| 36 |
- Research and evaluation
|
| 37 |
- Agentic workflows
|
| 38 |
- Inference-time scaling and test-time compute experiments
|
|
@@ -69,7 +70,7 @@ print(tok.decode(out[0], skip_special_tokens=True))
|
|
| 69 |
|
| 70 |
## Training Summary
|
| 71 |
|
| 72 |
-
The model was produced using a **multi-stage optimization process** involving large-scale
|
| 73 |
|
| 74 |
Specific training details are intentionally abstracted in this public release.
|
| 75 |
|
|
|
|
| 26 |
- 16K context length
|
| 27 |
- Optimized for reasoning-centric tasks
|
| 28 |
- Designed for modern GPU inference runtimes
|
| 29 |
+
- **Architecture:** Qwen3-compatible (DeepBrainz-R series, post-trained, and optimized for math and coding)
|
| 30 |
|
| 31 |
---
|
| 32 |
|
| 33 |
## Intended Use
|
| 34 |
|
| 35 |
+
- Advanced reasoning systems
|
| 36 |
+
- Math and Coding
|
| 37 |
- Research and evaluation
|
| 38 |
- Agentic workflows
|
| 39 |
- Inference-time scaling and test-time compute experiments
|
|
|
|
| 70 |
|
| 71 |
## Training Summary
|
| 72 |
|
| 73 |
+
The model was produced using a **multi-stage optimization process** involving large-scale on-policy optimization and **iterative refinement** to improve reasoning quality and robustness.
|
| 74 |
|
| 75 |
Specific training details are intentionally abstracted in this public release.
|
| 76 |
|