yuzhe commited on
Commit
6b54c9c
·
verified ·
1 Parent(s): 175239b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -59
README.md CHANGED
@@ -20,75 +20,66 @@ pipeline_tag: text-generation
20
  inference: false
21
  ---
22
 
23
- # The Era of Sovereign Intelligence
24
 
25
- > *"Not your weights, not your intelligence."*
26
  > *In the dark forest of Web3, an individual without private AI is merely prey.*
27
 
28
- ### Entropy & Value
29
- We stand at the intersection of history's two most significant exponential curves: **Artificial Intelligence**, which seeks to solve **Knowledge & Logic**, and **Blockchain**, which seeks to solve **Trust & Value**.
30
 
31
- However, the current convergence is flawed. Mainstream AI remains a "Computational Leviathan"—centralized, opaque, and running on servers that ingest your data but cannot comprehend the concept of "Ownership." When a Web2 AI reads a smart contract, it sees code. When we look, we see **Assets, Risks, and Game Theory**.
32
 
33
- ### From Copilot to Species
34
- Traditional "Copilots" are merely assistants. In the 24/7 PVP environment of DeFi, you need an **Agent** capable of independent risk assessment. Humans—carbon-based lifeforms with limited processing speed—are increasingly outmatched by MEV bots and algorithmic predators.
35
 
36
- We need an evolutionary tool: **An "Exo-Cortex" that runs locally, remains absolutely loyal, and deeply understands the dark logic of finance.**
37
-
38
- ### The Mission
39
- **DMind-3-mini** was not built to score points on general benchmarks. It was engineered to arm the individual against institutional extraction. We refuse to upload your Alpha strategies to the cloud. True Web3 AI must be **Private**, **Local**, and **Antifragile**.
40
-
41
- * **DMind-3-nano** is your Shield.
42
- * **DMind-3-mini** is your Spear.
43
 
44
  Welcome to the era of **Sovereign Intelligence**.
45
 
46
  ---
47
 
48
- # DMind-3-mini: The Computational Financial Actuary
49
 
50
  ## 1. Evolution & Legacy
51
 
52
- **The DMind lineage was born from a singular conviction: that decentralized finance deserves decentralized intelligence.** This journey began with **DMind-1**, which shattered the monopoly of closed-source AI by releasing the world's first Web3-native LLM, democratizing access to Solidity comprehension. It continued with **DMind-2**, which proved that domain-specific fine-tuning could outmaneuver trillion-parameter giants in vertical benchmarks, establishing a new standard for specialized AI.
53
 
54
- **A Paradigm Shift: From Retrieval to Reflection**
55
- **DMind-3** represents our most significant evolutionary leap yet. We recognized that in the high-stakes, adversarial "dark forest" of DeFi, standard knowledge retrieval is a liability. A model must do more than recite facts; it must possess **Reflective Intelligence (System 2 Thinking)** to navigate risk.
56
 
57
- **DMind-3-mini** embodies this philosophy. Positioned as the "Brain" within our local ecosystem, it bridges the gap between the real-time reflexes of the edge-side *DMind-3-nano* and the macroscopic foresight of the cloud-native *DMind-3-max*. It is engineered not as a chatbot, but as a **"Computational Financial Actuary"**—a privacy-first, offline-capable engine designed to bring institutional-grade logic to the individual sovereign user.
58
 
59
- ## 2. Model Details
60
 
61
  * **Model Name:** DMind-3-mini
62
  * **Organization:** DMind & Zhulong AI (Hangzhou, China)
63
  * **Base Architecture:** Qwen3-4B-Thinking-2507 (Customized Transformer w/ RoPE)
64
  * **Parameter Count:** 4.2 Billion
65
  * **Precision:** **BF16 (Native)**
66
- * *Note: We strictly advise against 4-bit quantization for financial logic tasks, as it degrades numerical precision in APY/IL calculations.*
67
  * **Context Window:** 128k tokens
68
  * **Hardware Requirement:** GPU with $\ge$ **12GB VRAM** (Recommended: NVIDIA RTX 4070Ti+, Apple M3/M4 Pro/Max).
69
 
70
- ## 3. Methodology: C³-SFT
71
 
72
- **DMind-3-mini** introduces **Contrastive Chain-of-Correction Supervised Fine-Tuning (C³-SFT)**.
73
- Unlike standard SFT which models a direct mapping \\(P(y|x)\\), C³-SFT forces the model to navigate a "Correction Trajectory" by contrasting against plausible but flawed reasoning (Negative Samples).
74
 
75
  ![Figure 1: C3-SFT Paradigm](https://your-image-host-url/figure1-c3sft.png)
76
  *(Figure 1: The C³-SFT training pipeline, illustrating the Triplet Data Structure and Contextual Loss Masking)*
77
 
78
- ### 3.1 Mathematical Formalization
79
- The optimization objective \\(\mathcal{L}_{C^3}\\) is defined as maximizing the conditional probability of the correction path, given the error context:
80
 
81
  $$
82
  \mathcal{L}_{C^3}(\theta) = - \mathbb{E}_{\mathcal{D}} \left[ \sum_{t=1}^{T} \alpha_t \cdot \log P_\theta(y^+_{t} \mid x, y^-, y^+_{<t}) \right] + \lambda \mathbb{KL}[\pi_\theta || \pi_{\text{ref}}]
83
  $$
84
 
85
  * \\(\mathcal{D} = \{(x, y^-, y^+_{cot})\}_{i=1}^N\\) represents the training dataset containing financial query triplets.
86
- * \\(y^-\\) denotes the "Negative Sample" containing common logical fallacies (e.g., overlooking Impermanent Loss).
87
  * \\(y^+_{cot}\\) is the corrective Chain-of-Thought that the model aims to generate.
88
- * \\(\alpha_t\\) is a dynamic attention weight that penalizes logical discontinuities during the reasoning steps.
89
- * \\(\pi_{\text{ref}}\\) is the reference policy used for KL-divergence regularization to prevent mode collapse.
90
 
91
- ### 3.2 Dual-State Inference Mechanism
92
  During inference, DMind-3-mini operates in two distinct topological modes based on the presence of a trigger token \\(\tau\\). Let \\(\mathcal{I}(x)\\) denote the inference function:
93
 
94
  $$
@@ -99,29 +90,28 @@ $$
99
  \end{cases}
100
  $$
101
 
102
- * \\(\tau\\) represents the control token (e.g., `<REFLECT>`) injected into the system prompt.
103
- * \\(\mathcal{G}_{neg}(x)\\) is the latent negative hypothesis generated internally by the model to simulate a potential error.
104
- * \\(\mathcal{H}_{crit}\\) is the critique operator that filters the latent hypothesis to derive a rigorously verified conclusion.
105
 
106
- ## 4. Intended Use: Web3 Financial Know-How
107
 
108
- DMind-3-mini is not just a coder; it is a **Financial Risk Assessor**.
109
 
110
  ### Key Capabilities
111
- 1. **Yield Attribution Analysis:** Deconstructs APY sources to distinguish between "Real Yield" (Protocol Revenue) and "Inflationary Yield" (Token Emissions).
112
- 2. **Liquidity Provisioning (LP) Simulation:** Calculates optimal tick ranges for Uniswap V3 positions by modeling volatility surfaces locally.
113
- 3. **Risk-Adjusted Code Auditing:** Beyond syntax errors, it identifies economic exploits (e.g., Flash Loan attack vectors based on price manipulation).
114
 
115
- ### The "Brain & Shield" Ecosystem
116
  For maximum security, we recommend the **DMind Local Stack**:
117
 
118
  ![Figure 2: Brain and Shield Ecosystem](https://your-image-host-url/figure2-ecosystem.png)
119
- *(Figure 2: The On-Device Inference Ecosystem showing the synergy between Nano (Shield) and Mini (Brain))*
120
 
121
- * **The Brain (DMind-3-mini):** Runs on your high-performance laptop. Handles complex strategy formulation, deep research, and "System 2" logic.
122
- * **The Shield (DMind-3-nano):** Runs in your browser/wallet background. Handles real-time transaction signing safety checks and "System 1" intuition.
123
 
124
- ## 5. Training Data
125
 
126
  The model was fine-tuned on **82,000** high-value private samples:
127
 
@@ -132,7 +122,7 @@ The model was fine-tuned on **82,000** high-value private samples:
132
  | **Smart Contract Audits** | 20% | C³-SFT formatted pairs: \\(\text{Vulnerable Code} \to \text{Exploit Analysis} \to \text{Fix}\\). |
133
  | **On-Chain Behavior Logs** | 10% | Parsed intent analysis of "Smart Money" wallet operations during high volatility events. |
134
 
135
- ## 6. Performance Benchmarks
136
 
137
  Evaluated on **Web3-Finance-Eval-2026**:
138
 
@@ -145,20 +135,8 @@ Evaluated on **Web3-Finance-Eval-2026**:
145
 
146
  *DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
147
 
148
- ## 7. Limitations & Disclaimer
149
 
150
  * **High Hardware Barrier:** Due to the decision to retain BF16 precision for financial accuracy, this model **requires >= 12GB VRAM**. It is not suitable for standard office laptops.
151
- * **Knowledge Cutoff:** While the *logic* is robust, specific protocol data (TVL, Price) is limited to the training cutoff. Use with RAG for real-time data.
152
- * **Legal Disclaimer:** This model is an **analytical tool**, not a financial advisor. The output (NFA) should never be the sole basis for investment decisions. The developers assume no liability for financial losses.
153
-
154
- ---
155
-
156
- ## Citation
157
-
158
- ```bibtex
159
- @model{dmind2026mini,
160
- title={DMind-3-mini: A 4B Parameter LLM for Web3 Financial Intelligence},
161
- author={Huang, Enhao and Zhulong AI Research Team},
162
- year={2026},
163
- url={[https://huggingface.co/zhulong-ai/dmind-3-mini](https://huggingface.co/zhulong-ai/dmind-3-mini)}
164
- }
 
20
  inference: false
21
  ---
22
 
23
+ # 🌌 The Era of Sovereign Intelligence
24
 
25
+ > *Not your weights, not your intelligence.*
26
  > *In the dark forest of Web3, an individual without private AI is merely prey.*
27
 
28
+ We stand at the intersection of history's two most significant exponential curves. **Artificial Intelligence** seeks to solve Knowledge & Logic, while **Blockchain** seeks to solve Trust & Value. However, the current convergence is flawed. Mainstream AI remains a centralized Computational Leviathan that runs on opaque servers, ingesting your data without comprehending the concept of Ownership. When a Web2 AI reads a smart contract, it sees code. When we look, we see Assets, Risks, and Game Theory.
 
29
 
30
+ In the 24/7 PVP environment of DeFi, traditional Copilots are merely assistants. You need an **Agent** capable of independent risk assessment. Humans are carbon-based lifeforms with limited processing speed and are increasingly outmatched by MEV bots and algorithmic predators. We need an evolutionary tool, an **Exo-Cortex** that runs locally, remains absolutely loyal, and deeply understands the dark logic of finance.
31
 
32
+ **DMind-3-mini** was not built to score points on general benchmarks. It was engineered to arm the individual against institutional extraction. We refuse to upload your Alpha strategies to the cloud. True Web3 AI must be Private, Local, and Antifragile.
 
33
 
34
+ 🛡️ **DMind-3-nano** is your Shield.
35
+ ⚔️ **DMind-3-mini** is your Spear.
 
 
 
 
 
36
 
37
  Welcome to the era of **Sovereign Intelligence**.
38
 
39
  ---
40
 
41
+ # 🧠 DMind-3-mini: The Computational Financial Actuary
42
 
43
  ## 1. Evolution & Legacy
44
 
45
+ The DMind lineage was born from a singular conviction that decentralized finance deserves decentralized intelligence. This journey began with **DMind-1**, which shattered the monopoly of closed-source AI by releasing the world's first Web3-native LLM. It continued with **DMind-2**, which proved that domain-specific fine-tuning could outmaneuver trillion-parameter giants in vertical benchmarks.
46
 
47
+ 🚀 **A Paradigm Shift**
48
+ **DMind-3** represents our most significant evolutionary leap yet. We recognized that in the high-stakes environment of DeFi, standard knowledge retrieval is a liability. A model must do more than recite facts. It must possess **Reflective Intelligence** (System 2 Thinking) to navigate risk.
49
 
50
+ **DMind-3-mini** embodies this philosophy. Positioned as the Brain within our local ecosystem, it bridges the gap between the real-time reflexes of the edge-side *DMind-3-nano* and the macroscopic foresight of the cloud-native *DMind-3-max*. It is engineered not as a chatbot, but as a **Computational Financial Actuary** designed to bring institutional-grade logic to the individual sovereign user within a privacy-first, offline-capable engine.
51
 
52
+ ## 2. ⚙️ Model Details
53
 
54
  * **Model Name:** DMind-3-mini
55
  * **Organization:** DMind & Zhulong AI (Hangzhou, China)
56
  * **Base Architecture:** Qwen3-4B-Thinking-2507 (Customized Transformer w/ RoPE)
57
  * **Parameter Count:** 4.2 Billion
58
  * **Precision:** **BF16 (Native)**
59
+ * *⚠️ Note: We strictly advise against 4-bit quantization for financial logic tasks to preserve numerical precision in APY/IL calculations.*
60
  * **Context Window:** 128k tokens
61
  * **Hardware Requirement:** GPU with $\ge$ **12GB VRAM** (Recommended: NVIDIA RTX 4070Ti+, Apple M3/M4 Pro/Max).
62
 
63
+ ## 3. 🔬 Methodology: C³-SFT
64
 
65
+ **DMind-3-mini** introduces **Contrastive Chain-of-Correction Supervised Fine-Tuning (C³-SFT)**. Unlike standard SFT which models a direct mapping \\(P(y|x)\\), C³-SFT forces the model to navigate a Correction Trajectory by contrasting against plausible but flawed reasoning.
 
66
 
67
  ![Figure 1: C3-SFT Paradigm](https://your-image-host-url/figure1-c3sft.png)
68
  *(Figure 1: The C³-SFT training pipeline, illustrating the Triplet Data Structure and Contextual Loss Masking)*
69
 
70
+ ### Mathematical Formalization
71
+ The optimization objective \\(\mathcal{L}_{C^3}\\) is defined as maximizing the conditional probability of the correction path given the error context:
72
 
73
  $$
74
  \mathcal{L}_{C^3}(\theta) = - \mathbb{E}_{\mathcal{D}} \left[ \sum_{t=1}^{T} \alpha_t \cdot \log P_\theta(y^+_{t} \mid x, y^-, y^+_{<t}) \right] + \lambda \mathbb{KL}[\pi_\theta || \pi_{\text{ref}}]
75
  $$
76
 
77
  * \\(\mathcal{D} = \{(x, y^-, y^+_{cot})\}_{i=1}^N\\) represents the training dataset containing financial query triplets.
78
+ * \\(y^-\\) denotes the Negative Sample containing common logical fallacies.
79
  * \\(y^+_{cot}\\) is the corrective Chain-of-Thought that the model aims to generate.
80
+ * \\(\alpha_t\\) is a dynamic attention weight that penalizes logical discontinuities.
 
81
 
82
+ ### Dual-State Inference Mechanism
83
  During inference, DMind-3-mini operates in two distinct topological modes based on the presence of a trigger token \\(\tau\\). Let \\(\mathcal{I}(x)\\) denote the inference function:
84
 
85
  $$
 
90
  \end{cases}
91
  $$
92
 
93
+ * **Standard Mode:** Optimized for latency.
94
+ * **Audit Mode:** The model internally generates a latent negative hypothesis \\(\mathcal{G}_{neg}(x)\\) and applies the critique operator \\(\mathcal{H}_{crit}\\) to derive a rigorously verified conclusion.
 
95
 
96
+ ## 4. 💡 Intended Use: Web3 Financial Know-How
97
 
98
+ DMind-3-mini is not just a coder. It is a **Financial Risk Assessor**.
99
 
100
  ### Key Capabilities
101
+ * 📊 **Yield Attribution Analysis:** Deconstructs APY sources to distinguish between Real Yield (Protocol Revenue) and Inflationary Yield (Token Emissions).
102
+ * 🌊 **Liquidity Provisioning (LP) Simulation:** Calculates optimal tick ranges for Uniswap V3 positions by modeling volatility surfaces locally.
103
+ * 🕵️‍♂️ **Risk-Adjusted Code Auditing:** Beyond syntax errors, it identifies economic exploits such as Flash Loan attack vectors based on price manipulation.
104
 
105
+ ### The Brain & Shield Ecosystem
106
  For maximum security, we recommend the **DMind Local Stack**:
107
 
108
  ![Figure 2: Brain and Shield Ecosystem](https://your-image-host-url/figure2-ecosystem.png)
109
+ *(Figure 2: The On-Device Inference Ecosystem showing the synergy between Nano and Mini)*
110
 
111
+ * **The Brain (DMind-3-mini):** Runs on your high-performance laptop. Handles complex strategy formulation, deep research, and System 2 logic.
112
+ * **The Shield (DMind-3-nano):** Runs in your browser/wallet background. Handles real-time transaction signing safety checks and System 1 intuition.
113
 
114
+ ## 5. 📚 Training Data
115
 
116
  The model was fine-tuned on **82,000** high-value private samples:
117
 
 
122
  | **Smart Contract Audits** | 20% | C³-SFT formatted pairs: \\(\text{Vulnerable Code} \to \text{Exploit Analysis} \to \text{Fix}\\). |
123
  | **On-Chain Behavior Logs** | 10% | Parsed intent analysis of "Smart Money" wallet operations during high volatility events. |
124
 
125
+ ## 6. 🏆 Performance Benchmarks
126
 
127
  Evaluated on **Web3-Finance-Eval-2026**:
128
 
 
135
 
136
  *DMind-3-mini outperforms generalist models 15x its size in specific vertical tasks, validating the C³-SFT approach.*
137
 
138
+ ## 7. ⚖️ Limitations & Disclaimer
139
 
140
  * **High Hardware Barrier:** Due to the decision to retain BF16 precision for financial accuracy, this model **requires >= 12GB VRAM**. It is not suitable for standard office laptops.
141
+ * **Knowledge Cutoff:** While the logic is robust, specific protocol data is limited to the training cutoff. Use with RAG for real-time data.
142
+ * **Legal Disclaimer:** This model is an **analytical tool**, not a financial advisor. The output (NFA) should never be the sole basis for investment decisions. The developers assume no liability for financial losses.