Update README.md
Browse files
README.md
CHANGED
|
@@ -12,8 +12,7 @@ base_model:
|
|
| 12 |
- Nanbeige/Nanbeige4-3B-Base
|
| 13 |
---
|
| 14 |
<div align="center">
|
| 15 |
-
|
| 16 |
-
<img src="figures/nbg.png" width="220" alt="Nanbeige Logo">
|
| 17 |
</div>
|
| 18 |
|
| 19 |
|
|
@@ -23,8 +22,7 @@ base_model:
|
|
| 23 |
Nanbeige4.1-3B is built upon Nanbeige4-3B-Base and represents an enhanced iteration of our previous reasoning model, Nanbeige4-3B-Thinking-2511, achieved through further post-training optimization with supervised fine-tuning (SFT) and reinforcement learning (RL). As a highly competitive open-source model at a small parameter scale, Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust **reasoning**, **preference alignment**, and **effective agentic behaviors**.
|
| 24 |
|
| 25 |
<div align="center">
|
| 26 |
-
|
| 27 |
-
<img src="figures/model_performance_comparison.png">
|
| 28 |
</div>
|
| 29 |
|
| 30 |
Specifically, Nanbeige4.1-3B exhibits the following key strengths:
|
|
|
|
| 12 |
- Nanbeige/Nanbeige4-3B-Base
|
| 13 |
---
|
| 14 |
<div align="center">
|
| 15 |
+
<img src="https://huggingface.co/Nanbeige/Nanbeige4.1-3B/resolve/main/figures/nbg.png" width="220" alt="Nanbeige Logo">
|
|
|
|
| 16 |
</div>
|
| 17 |
|
| 18 |
|
|
|
|
| 22 |
Nanbeige4.1-3B is built upon Nanbeige4-3B-Base and represents an enhanced iteration of our previous reasoning model, Nanbeige4-3B-Thinking-2511, achieved through further post-training optimization with supervised fine-tuning (SFT) and reinforcement learning (RL). As a highly competitive open-source model at a small parameter scale, Nanbeige4.1-3B illustrates that compact models can simultaneously achieve robust **reasoning**, **preference alignment**, and **effective agentic behaviors**.
|
| 23 |
|
| 24 |
<div align="center">
|
| 25 |
+
<img src="https://huggingface.co/Nanbeige/Nanbeige4.1-3B/resolve/main/figures/model_performance_comparison.png" width="800" alt="Model Performance Comparison">
|
|
|
|
| 26 |
</div>
|
| 27 |
|
| 28 |
Specifically, Nanbeige4.1-3B exhibits the following key strengths:
|