Update README.md
Browse files
README.md
CHANGED
|
@@ -34,6 +34,11 @@ This repository hosts the **open-source training and evaluation pipeline** as we
|
|
| 34 |
**Repo purpose:** host the open-source training/eval pipeline and release artifacts.
|
| 35 |
|
| 36 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
## Highlights
|
| 39 |
|
|
@@ -156,14 +161,6 @@ User: "查一下 Solana 上的 SOL"
|
|
| 156 |
Model: <start_function_call>call:SEARCH_TOKEN{symbol:"SOL",chain:"solana"}<end_function_call>
|
| 157 |
```
|
| 158 |
|
| 159 |
-
## Performance Snapshot
|
| 160 |
-
|
| 161 |
-
- Function recognition: ~98.8% on validated set
|
| 162 |
-
- Argument extraction: ~97.4%
|
| 163 |
-
- Protocol adherence: SEARCH_TOKEN 98.5%, EXECUTE_SWAP 97.3%
|
| 164 |
-
- Multi-turn success: ~93.7%
|
| 165 |
-
|
| 166 |
-
Scope: tokens/chains listed in **Model Overview**; outside that set may be lower.
|
| 167 |
|
| 168 |
## License & Governance
|
| 169 |
|
|
|
|
| 34 |
**Repo purpose:** host the open-source training/eval pipeline and release artifacts.
|
| 35 |
|
| 36 |
|
| 37 |
+
## Performance Snapshot
|
| 38 |
+
<img src="figures/model_comparison_chart.png" width="720" />
|
| 39 |
+
|
| 40 |
+
|
| 41 |
+
*Figure 1. DMind-3-nano significantly outperforms both the untuned base model and a similarly sized general-purpose model (Qwen3-0.6B), especially in multi-turn success.*
|
| 42 |
|
| 43 |
## Highlights
|
| 44 |
|
|
|
|
| 161 |
Model: <start_function_call>call:SEARCH_TOKEN{symbol:"SOL",chain:"solana"}<end_function_call>
|
| 162 |
```
|
| 163 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 164 |
|
| 165 |
## License & Governance
|
| 166 |
|