ArunkumarVR commited on
Commit
01981d4
·
verified ·
1 Parent(s): b02b2e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -26,13 +26,14 @@ The model emphasizes **reasoning quality, instruction robustness, and stability
26
  - 16K context length
27
  - Optimized for reasoning-centric tasks
28
  - Designed for modern GPU inference runtimes
29
- - **Architecture:** Qwen3-compatible (DeepBrainz-R series, optimized via OPD)
30
 
31
  ---
32
 
33
  ## Intended Use
34
 
35
- - Advanced reasoning systems
 
36
  - Research and evaluation
37
  - Agentic workflows
38
  - Inference-time scaling and test-time compute experiments
@@ -69,7 +70,7 @@ print(tok.decode(out[0], skip_special_tokens=True))
69
 
70
  ## Training Summary
71
 
72
- The model was produced using a **multi-stage optimization process** involving large-scale supervision and **iterative refinement** to improve reasoning quality and robustness.
73
 
74
  Specific training details are intentionally abstracted in this public release.
75
 
 
26
  - 16K context length
27
  - Optimized for reasoning-centric tasks
28
  - Designed for modern GPU inference runtimes
29
+ - **Architecture:** Qwen3-compatible (DeepBrainz-R series, post-trained, and optimized for math and coding)
30
 
31
  ---
32
 
33
  ## Intended Use
34
 
35
+ - Advanced reasoning systems
36
+ - Math and Coding
37
  - Research and evaluation
38
  - Agentic workflows
39
  - Inference-time scaling and test-time compute experiments
 
70
 
71
  ## Training Summary
72
 
73
+ The model was produced using a **multi-stage optimization process** involving large-scale on-policy optimization and **iterative refinement** to improve reasoning quality and robustness.
74
 
75
  Specific training details are intentionally abstracted in this public release.
76