Update README.md
Browse files
README.md
CHANGED
|
@@ -20,27 +20,22 @@ pipeline_tag: text-generation
|
|
| 20 |
inference: false
|
| 21 |
---
|
| 22 |
|
| 23 |
-
#
|
| 24 |
|
| 25 |
> *"Not your weights, not your intelligence."*
|
| 26 |
> *In the dark forest of Web3, an individual without private AI is merely prey.*
|
| 27 |
|
| 28 |
-
|
|
|
|
| 29 |
|
| 30 |
-
|
| 31 |
-
The DMind lineage was born from a singular conviction: **that decentralized finance deserves decentralized intelligence.** This journey began with **DMind-1**, which shattered the monopoly of closed-source AI by releasing the world's first Web3-native LLM, democratizing access to Solidity comprehension. It continued with **DMind-2**, which proved that domain-specific fine-tuning could outmaneuver trillion-parameter giants in vertical benchmarks, establishing a new standard for specialized AI.
|
| 32 |
|
| 33 |
-
###
|
| 34 |
-
However, **DMind-3** represents our most significant evolutionary leap yet. We recognized that in the high-stakes, adversarial "dark forest" of DeFi, standard knowledge retrieval is a liability. A model must do more than recite facts; it must possess **Reflective Intelligence (System 2 Thinking)** to navigate risk.
|
| 35 |
-
|
| 36 |
-
**DMind-3-mini** embodies this philosophy. Positioned as the "Brain" within our local ecosystem, it bridges the gap between the real-time reflexes of the edge-side *DMind-3-nano* and the macroscopic foresight of the cloud-native *DMind-3-max*. It is engineered not as a chatbot, but as a **"Computational Financial Actuary"**—a privacy-first, offline-capable engine designed to bring institutional-grade logic to the individual sovereign user.
|
| 37 |
-
|
| 38 |
-
### 2. From Copilot to Species
|
| 39 |
Traditional "Copilots" are merely assistants. In the 24/7 PVP environment of DeFi, you need an **Agent** capable of independent risk assessment. Humans—carbon-based lifeforms with limited processing speed—are increasingly outmatched by MEV bots and algorithmic predators.
|
| 40 |
|
| 41 |
We need an evolutionary tool: **An "Exo-Cortex" that runs locally, remains absolutely loyal, and deeply understands the dark logic of finance.**
|
| 42 |
|
| 43 |
-
###
|
| 44 |
**DMind-3-mini** was not built to score points on general benchmarks. It was engineered to arm the individual against institutional extraction. We refuse to upload your Alpha strategies to the cloud. True Web3 AI must be **Private**, **Local**, and **Antifragile**.
|
| 45 |
|
| 46 |
* **DMind-3-nano** is your Shield.
|
|
@@ -50,23 +45,16 @@ Welcome to the era of **Sovereign Intelligence**.
|
|
| 50 |
|
| 51 |
---
|
| 52 |
|
| 53 |
-
#
|
| 54 |
|
| 55 |
-
## 1.
|
| 56 |
|
| 57 |
-
|
| 58 |
-
The DMind series represents a continuous evolution in specialized Web3 artificial intelligence:
|
| 59 |
-
* **DMind-1 Series:** Pioneered the industry by open-sourcing the world's first LLM dedicated to Web3/Solidity contexts.
|
| 60 |
-
* **DMind-2 Series:** Achieved SOTA performance in vertical benchmarks, proving that domain-specific fine-tuning can outperform larger generalist models.
|
| 61 |
-
* **DMind-3 Series (Current):** Marks a paradigm shift from "Knowledge Retrieval" to **"Reflective Intelligence" (System 2 Thinking)**. We introduce the **C³-SFT** paradigm to solve the hallucination and logic deficit problems in high-stakes financial environments.
|
| 62 |
|
| 63 |
-
|
| 64 |
-
|
| 65 |
-
* **DMind-3-nano (270M):** **The Shield.** An edge-side Intent Recognition model. Optimized for <500MB memory footprint, delivering >98% accuracy in identifying malicious transaction signatures locally.
|
| 66 |
-
* **DMind-3-mini (4B) [THIS MODEL]:** **The Brain.** A "Local Financial Actuary". The core reasoning engine designed for high-end consumer workstations, balancing deep logic inference with privacy-first offline deployment.
|
| 67 |
-
* **DMind-3-max (MoE):** **The Oracle.** A cloud-native Mixture-of-Experts model designed for cross-chain macroeconomic simulation and ecosystem-wide governance.
|
| 68 |
|
| 69 |
-
|
| 70 |
|
| 71 |
## 2. Model Details
|
| 72 |
|
|
@@ -79,31 +67,29 @@ The DMind-3 lineup is architected for specific computational environments:
|
|
| 79 |
* **Context Window:** 128k tokens
|
| 80 |
* **Hardware Requirement:** GPU with $\ge$ **12GB VRAM** (Recommended: NVIDIA RTX 4070Ti+, Apple M3/M4 Pro/Max).
|
| 81 |
|
| 82 |
-
---
|
| 83 |
-
|
| 84 |
## 3. Methodology: C³-SFT
|
| 85 |
|
| 86 |
**DMind-3-mini** introduces **Contrastive Chain-of-Correction Supervised Fine-Tuning (C³-SFT)**.
|
| 87 |
-
Unlike standard SFT which models a direct mapping
|
| 88 |
|
| 89 |

|
| 90 |
*(Figure 1: The C³-SFT training pipeline, illustrating the Triplet Data Structure and Contextual Loss Masking)*
|
| 91 |
|
| 92 |
### 3.1 Mathematical Formalization
|
| 93 |
-
|
| 94 |
-
|
| 95 |
-
The optimization objective $\mathcal{L}_{C^3}$ is defined as maximizing the conditional probability of the correction path, given the error context:
|
| 96 |
|
| 97 |
$$
|
| 98 |
\mathcal{L}_{C^3}(\theta) = - \mathbb{E}_{\mathcal{D}} \left[ \sum_{t=1}^{T} \alpha_t \cdot \log P_\theta(y^+_{t} \mid x, y^-, y^+_{<t}) \right] + \lambda \mathbb{KL}[\pi_\theta || \pi_{\text{ref}}]
|
| 99 |
$$
|
| 100 |
|
| 101 |
-
|
| 102 |
-
*
|
| 103 |
-
*
|
|
|
|
|
|
|
| 104 |
|
| 105 |
### 3.2 Dual-State Inference Mechanism
|
| 106 |
-
During inference, DMind-3-mini operates in two distinct topological modes based on the presence of a trigger token
|
| 107 |
|
| 108 |
$$
|
| 109 |
\hat{y} =
|
|
@@ -113,10 +99,9 @@ $$
|
|
| 113 |
\end{cases}
|
| 114 |
$$
|
| 115 |
|
| 116 |
-
*
|
| 117 |
-
*
|
| 118 |
-
|
| 119 |
-
---
|
| 120 |
|
| 121 |
## 4. Intended Use: Web3 Financial Know-How
|
| 122 |
|
|
@@ -136,8 +121,6 @@ For maximum security, we recommend the **DMind Local Stack**:
|
|
| 136 |
* **The Brain (DMind-3-mini):** Runs on your high-performance laptop. Handles complex strategy formulation, deep research, and "System 2" logic.
|
| 137 |
* **The Shield (DMind-3-nano):** Runs in your browser/wallet background. Handles real-time transaction signing safety checks and "System 1" intuition.
|
| 138 |
|
| 139 |
-
---
|
| 140 |
-
|
| 141 |
## 5. Training Data
|
| 142 |
|
| 143 |
The model was fine-tuned on **82,000** high-value private samples:
|
|
@@ -146,11 +129,9 @@ The model was fine-tuned on **82,000** high-value private samples:
|
|
| 146 |
| :--- | :---: | :--- |
|
| 147 |
| **Institutional Alpha Reports** | 40% | Deep dive reports from top-tier firms (e.g., Paradigm, Delphi), structured into logic chains. |
|
| 148 |
| **Financial Post-Mortems** | 30% | Historical analysis of collapses (Luna, FTX, Euler Hack), focusing on pre-crash indicators. |
|
| 149 |
-
| **Smart Contract Audits** | 20% | C³-SFT formatted pairs: Vulnerable Code
|
| 150 |
| **On-Chain Behavior Logs** | 10% | Parsed intent analysis of "Smart Money" wallet operations during high volatility events. |
|
| 151 |
|
| 152 |
-
---
|
| 153 |
-
|
| 154 |
## 6. Performance Benchmarks
|
| 155 |
|
| 156 |
Evaluated on **Web3-Finance-Eval-2026**:
|
|
@@ -164,10 +145,20 @@ Evaluated on **Web3-Finance-Eval-2026**:
|
|
| 164 |
|
| 165 |
*DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
|
| 166 |
|
| 167 |
-
---
|
| 168 |
-
|
| 169 |
## 7. Limitations & Disclaimer
|
| 170 |
|
| 171 |
* **High Hardware Barrier:** Due to the decision to retain BF16 precision for financial accuracy, this model **requires >= 12GB VRAM**. It is not suitable for standard office laptops.
|
| 172 |
* **Knowledge Cutoff:** While the *logic* is robust, specific protocol data (TVL, Price) is limited to the training cutoff. Use with RAG for real-time data.
|
| 173 |
* **Legal Disclaimer:** This model is an **analytical tool**, not a financial advisor. The output (NFA) should never be the sole basis for investment decisions. The developers assume no liability for financial losses.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
inference: false
|
| 21 |
---
|
| 22 |
|
| 23 |
+
# The Era of Sovereign Intelligence
|
| 24 |
|
| 25 |
> *"Not your weights, not your intelligence."*
|
| 26 |
> *In the dark forest of Web3, an individual without private AI is merely prey.*
|
| 27 |
|
| 28 |
+
### Entropy & Value
|
| 29 |
+
We stand at the intersection of history's two most significant exponential curves: **Artificial Intelligence**, which seeks to solve **Knowledge & Logic**, and **Blockchain**, which seeks to solve **Trust & Value**.
|
| 30 |
|
| 31 |
+
However, the current convergence is flawed. Mainstream AI remains a "Computational Leviathan"—centralized, opaque, and running on servers that ingest your data but cannot comprehend the concept of "Ownership." When a Web2 AI reads a smart contract, it sees code. When we look, we see **Assets, Risks, and Game Theory**.
|
|
|
|
| 32 |
|
| 33 |
+
### From Copilot to Species
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
Traditional "Copilots" are merely assistants. In the 24/7 PVP environment of DeFi, you need an **Agent** capable of independent risk assessment. Humans—carbon-based lifeforms with limited processing speed—are increasingly outmatched by MEV bots and algorithmic predators.
|
| 35 |
|
| 36 |
We need an evolutionary tool: **An "Exo-Cortex" that runs locally, remains absolutely loyal, and deeply understands the dark logic of finance.**
|
| 37 |
|
| 38 |
+
### The Mission
|
| 39 |
**DMind-3-mini** was not built to score points on general benchmarks. It was engineered to arm the individual against institutional extraction. We refuse to upload your Alpha strategies to the cloud. True Web3 AI must be **Private**, **Local**, and **Antifragile**.
|
| 40 |
|
| 41 |
* **DMind-3-nano** is your Shield.
|
|
|
|
| 45 |
|
| 46 |
---
|
| 47 |
|
| 48 |
+
# DMind-3-mini: The Computational Financial Actuary
|
| 49 |
|
| 50 |
+
## 1. Evolution & Legacy
|
| 51 |
|
| 52 |
+
**The DMind lineage was born from a singular conviction: that decentralized finance deserves decentralized intelligence.** This journey began with **DMind-1**, which shattered the monopoly of closed-source AI by releasing the world's first Web3-native LLM, democratizing access to Solidity comprehension. It continued with **DMind-2**, which proved that domain-specific fine-tuning could outmaneuver trillion-parameter giants in vertical benchmarks, establishing a new standard for specialized AI.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
|
| 54 |
+
**A Paradigm Shift: From Retrieval to Reflection**
|
| 55 |
+
**DMind-3** represents our most significant evolutionary leap yet. We recognized that in the high-stakes, adversarial "dark forest" of DeFi, standard knowledge retrieval is a liability. A model must do more than recite facts; it must possess **Reflective Intelligence (System 2 Thinking)** to navigate risk.
|
|
|
|
|
|
|
|
|
|
| 56 |
|
| 57 |
+
**DMind-3-mini** embodies this philosophy. Positioned as the "Brain" within our local ecosystem, it bridges the gap between the real-time reflexes of the edge-side *DMind-3-nano* and the macroscopic foresight of the cloud-native *DMind-3-max*. It is engineered not as a chatbot, but as a **"Computational Financial Actuary"**—a privacy-first, offline-capable engine designed to bring institutional-grade logic to the individual sovereign user.
|
| 58 |
|
| 59 |
## 2. Model Details
|
| 60 |
|
|
|
|
| 67 |
* **Context Window:** 128k tokens
|
| 68 |
* **Hardware Requirement:** GPU with $\ge$ **12GB VRAM** (Recommended: NVIDIA RTX 4070Ti+, Apple M3/M4 Pro/Max).
|
| 69 |
|
|
|
|
|
|
|
| 70 |
## 3. Methodology: C³-SFT
|
| 71 |
|
| 72 |
**DMind-3-mini** introduces **Contrastive Chain-of-Correction Supervised Fine-Tuning (C³-SFT)**.
|
| 73 |
+
Unlike standard SFT which models a direct mapping \\(P(y|x)\\), C³-SFT forces the model to navigate a "Correction Trajectory" by contrasting against plausible but flawed reasoning (Negative Samples).
|
| 74 |
|
| 75 |

|
| 76 |
*(Figure 1: The C³-SFT training pipeline, illustrating the Triplet Data Structure and Contextual Loss Masking)*
|
| 77 |
|
| 78 |
### 3.1 Mathematical Formalization
|
| 79 |
+
The optimization objective \\(\mathcal{L}_{C^3}\\) is defined as maximizing the conditional probability of the correction path, given the error context:
|
|
|
|
|
|
|
| 80 |
|
| 81 |
$$
|
| 82 |
\mathcal{L}_{C^3}(\theta) = - \mathbb{E}_{\mathcal{D}} \left[ \sum_{t=1}^{T} \alpha_t \cdot \log P_\theta(y^+_{t} \mid x, y^-, y^+_{<t}) \right] + \lambda \mathbb{KL}[\pi_\theta || \pi_{\text{ref}}]
|
| 83 |
$$
|
| 84 |
|
| 85 |
+
* \\(\mathcal{D} = \{(x, y^-, y^+_{cot})\}_{i=1}^N\\) represents the training dataset containing financial query triplets.
|
| 86 |
+
* \\(y^-\\) denotes the "Negative Sample" containing common logical fallacies (e.g., overlooking Impermanent Loss).
|
| 87 |
+
* \\(y^+_{cot}\\) is the corrective Chain-of-Thought that the model aims to generate.
|
| 88 |
+
* \\(\alpha_t\\) is a dynamic attention weight that penalizes logical discontinuities during the reasoning steps.
|
| 89 |
+
* \\(\pi_{\text{ref}}\\) is the reference policy used for KL-divergence regularization to prevent mode collapse.
|
| 90 |
|
| 91 |
### 3.2 Dual-State Inference Mechanism
|
| 92 |
+
During inference, DMind-3-mini operates in two distinct topological modes based on the presence of a trigger token \\(\tau\\). Let \\(\mathcal{I}(x)\\) denote the inference function:
|
| 93 |
|
| 94 |
$$
|
| 95 |
\hat{y} =
|
|
|
|
| 99 |
\end{cases}
|
| 100 |
$$
|
| 101 |
|
| 102 |
+
* \\(\tau\\) represents the control token (e.g., `<REFLECT>`) injected into the system prompt.
|
| 103 |
+
* \\(\mathcal{G}_{neg}(x)\\) is the latent negative hypothesis generated internally by the model to simulate a potential error.
|
| 104 |
+
* \\(\mathcal{H}_{crit}\\) is the critique operator that filters the latent hypothesis to derive a rigorously verified conclusion.
|
|
|
|
| 105 |
|
| 106 |
## 4. Intended Use: Web3 Financial Know-How
|
| 107 |
|
|
|
|
| 121 |
* **The Brain (DMind-3-mini):** Runs on your high-performance laptop. Handles complex strategy formulation, deep research, and "System 2" logic.
|
| 122 |
* **The Shield (DMind-3-nano):** Runs in your browser/wallet background. Handles real-time transaction signing safety checks and "System 1" intuition.
|
| 123 |
|
|
|
|
|
|
|
| 124 |
## 5. Training Data
|
| 125 |
|
| 126 |
The model was fine-tuned on **82,000** high-value private samples:
|
|
|
|
| 129 |
| :--- | :---: | :--- |
|
| 130 |
| **Institutional Alpha Reports** | 40% | Deep dive reports from top-tier firms (e.g., Paradigm, Delphi), structured into logic chains. |
|
| 131 |
| **Financial Post-Mortems** | 30% | Historical analysis of collapses (Luna, FTX, Euler Hack), focusing on pre-crash indicators. |
|
| 132 |
+
| **Smart Contract Audits** | 20% | C³-SFT formatted pairs: \\(\text{Vulnerable Code} \to \text{Exploit Analysis} \to \text{Fix}\\). |
|
| 133 |
| **On-Chain Behavior Logs** | 10% | Parsed intent analysis of "Smart Money" wallet operations during high volatility events. |
|
| 134 |
|
|
|
|
|
|
|
| 135 |
## 6. Performance Benchmarks
|
| 136 |
|
| 137 |
Evaluated on **Web3-Finance-Eval-2026**:
|
|
|
|
| 145 |
|
| 146 |
*DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
|
| 147 |
|
|
|
|
|
|
|
| 148 |
## 7. Limitations & Disclaimer
|
| 149 |
|
| 150 |
* **High Hardware Barrier:** Due to the decision to retain BF16 precision for financial accuracy, this model **requires >= 12GB VRAM**. It is not suitable for standard office laptops.
|
| 151 |
* **Knowledge Cutoff:** While the *logic* is robust, specific protocol data (TVL, Price) is limited to the training cutoff. Use with RAG for real-time data.
|
| 152 |
* **Legal Disclaimer:** This model is an **analytical tool**, not a financial advisor. The output (NFA) should never be the sole basis for investment decisions. The developers assume no liability for financial losses.
|
| 153 |
+
|
| 154 |
+
---
|
| 155 |
+
|
| 156 |
+
## Citation
|
| 157 |
+
|
| 158 |
+
```bibtex
|
| 159 |
+
@model{dmind2026mini,
|
| 160 |
+
title={DMind-3-mini: A 4B Parameter LLM for Web3 Financial Intelligence},
|
| 161 |
+
author={Huang, Enhao and Zhulong AI Research Team},
|
| 162 |
+
year={2026},
|
| 163 |
+
url={[https://huggingface.co/zhulong-ai/dmind-3-mini](https://huggingface.co/zhulong-ai/dmind-3-mini)}
|
| 164 |
+
}
|