yuzhe commited on
Commit
d983c7a
·
verified ·
1 Parent(s): d260b32

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -126,6 +126,8 @@ The model was fine-tuned on **82,000** high-value private samples:
126
 
127
  Evaluated on three key benchmarks: DMind Benchmark (Web3 Native Logic), FinanceQA (Financial Domain Knowledge), and AIME 2025 (Advanced Mathematical Reasoning).
128
 
 
 
129
  The evaluation compares DMind-3-mini (4B) against top-tier frontier models (GPT-5.1, Claude Sonnet 4.5) and other efficient models. Despite its compact size, the Mini model demonstrates exceptional efficiency, particularly in specialized domain tasks where it outperforms significantly larger generalist models.
130
 
131
  ## 7. ⚖️ Limitations & Disclaimer
 
126
 
127
  Evaluated on three key benchmarks: DMind Benchmark (Web3 Native Logic), FinanceQA (Financial Domain Knowledge), and AIME 2025 (Advanced Mathematical Reasoning).
128
 
129
+ ![Figure 3: Performance Benchmarks](./Figures/Figure3.png)
130
+
131
  The evaluation compares DMind-3-mini (4B) against top-tier frontier models (GPT-5.1, Claude Sonnet 4.5) and other efficient models. Despite its compact size, the Mini model demonstrates exceptional efficiency, particularly in specialized domain tasks where it outperforms significantly larger generalist models.
132
 
133
  ## 7. ⚖️ Limitations & Disclaimer