Update README.md
Browse files
README.md
CHANGED
|
@@ -124,16 +124,9 @@ The model was fine-tuned on **82,000** high-value private samples:
|
|
| 124 |
|
| 125 |
## 6. 🏆 Performance Benchmarks
|
| 126 |
|
| 127 |
-
Evaluated on
|
| 128 |
|
| 129 |
-
|
| 130 |
-
| :--- | :---: | :---: | :---: |
|
| 131 |
-
| **Ponzi Logic Detection** | **96.4%** | 78.2% | 85.5% |
|
| 132 |
-
| **Impermanent Loss Calc** | **99.1%** | 85.0% | 92.3% |
|
| 133 |
-
| **Contract Exploit ID** | **92.3%** | 88.5% | 89.1% |
|
| 134 |
-
| **Inference Cost** | **Local (Free)** | High | High |
|
| 135 |
-
|
| 136 |
-
*DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
|
| 137 |
|
| 138 |
## 7. ⚖️ Limitations & Disclaimer
|
| 139 |
|
|
|
|
| 124 |
|
| 125 |
## 6. 🏆 Performance Benchmarks
|
| 126 |
|
| 127 |
+
Evaluated on three key benchmarks: DMind Benchmark (Web3 Native Logic), FinanceQA (Financial Domain Knowledge), and AIME 2025 (Advanced Mathematical Reasoning).
|
| 128 |
|
| 129 |
+
The evaluation compares DMind-3-mini (4B) against top-tier frontier models (GPT-5.1, Claude Sonnet 4.5) and other efficient models. Despite its compact size, the Mini model demonstrates exceptional efficiency, particularly in specialized domain tasks where it outperforms significantly larger generalist models.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 130 |
|
| 131 |
## 7. ⚖️ Limitations & Disclaimer
|
| 132 |
|