yuzhe commited on
Commit
90afe36
·
verified ·
1 Parent(s): 78dd4ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -9
README.md CHANGED
@@ -124,16 +124,9 @@ The model was fine-tuned on **82,000** high-value private samples:
124
 
125
  ## 6. 🏆 Performance Benchmarks
126
 
127
- Evaluated on **Web3-Finance-Eval-2026**:
128
 
129
- | Metric | DMind-3-mini (4B) | Llama-3-70B | GPT-4o (General) |
130
- | :--- | :---: | :---: | :---: |
131
- | **Ponzi Logic Detection** | **96.4%** | 78.2% | 85.5% |
132
- | **Impermanent Loss Calc** | **99.1%** | 85.0% | 92.3% |
133
- | **Contract Exploit ID** | **92.3%** | 88.5% | 89.1% |
134
- | **Inference Cost** | **Local (Free)** | High | High |
135
-
136
- *DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
137
 
138
  ## 7. ⚖️ Limitations & Disclaimer
139
 
 
124
 
125
  ## 6. 🏆 Performance Benchmarks
126
 
127
+ Evaluated on three key benchmarks: DMind Benchmark (Web3 Native Logic), FinanceQA (Financial Domain Knowledge), and AIME 2025 (Advanced Mathematical Reasoning).
128
 
129
+ The evaluation compares DMind-3-mini (4B) against top-tier frontier models (GPT-5.1, Claude Sonnet 4.5) and other efficient models. Despite its compact size, the Mini model demonstrates exceptional efficiency, particularly in specialized domain tasks where it outperforms significantly larger generalist models.
 
 
 
 
 
 
 
130
 
131
  ## 7. ⚖️ Limitations & Disclaimer
132