yuzhe commited on
Commit
17d4b90
·
verified ·
1 Parent(s): 28a25de

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -8
README.md CHANGED
@@ -34,6 +34,11 @@ This repository hosts the **open-source training and evaluation pipeline** as we
34
  **Repo purpose:** host the open-source training/eval pipeline and release artifacts.
35
 
36
 
 
 
 
 
 
37
 
38
  ## Highlights
39
 
@@ -156,14 +161,6 @@ User: "查一下 Solana 上的 SOL"
156
  Model: <start_function_call>call:SEARCH_TOKEN{symbol:"SOL",chain:"solana"}<end_function_call>
157
  ```
158
 
159
- ## Performance Snapshot
160
-
161
- - Function recognition: ~98.8% on validated set
162
- - Argument extraction: ~97.4%
163
- - Protocol adherence: SEARCH_TOKEN 98.5%, EXECUTE_SWAP 97.3%
164
- - Multi-turn success: ~93.7%
165
-
166
- Scope: tokens/chains listed in **Model Overview**; outside that set may be lower.
167
 
168
  ## License & Governance
169
 
 
34
  **Repo purpose:** host the open-source training/eval pipeline and release artifacts.
35
 
36
 
37
+ ## Performance Snapshot
38
+ <img src="figures/model_comparison_chart.png" width="720" />
39
+
40
+
41
+ *Figure 1. DMind-3-nano significantly outperforms both the untuned base model and a similarly sized general-purpose model (Qwen3-0.6B), especially in multi-turn success.*
42
 
43
  ## Highlights
44
 
 
161
  Model: <start_function_call>call:SEARCH_TOKEN{symbol:"SOL",chain:"solana"}<end_function_call>
162
  ```
163
 
 
 
 
 
 
 
 
 
164
 
165
  ## License & Governance
166