File size: 4,643 Bytes
22f8db5
6687019
b1ffa84
6687019
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22f8db5
 
6687019
 
22f8db5
289a418
 
22f8db5
289a418
 
22f8db5
6687019
22f8db5
289a418
22f8db5
6687019
 
 
 
 
22f8db5
289a418
 
 
6687019
 
 
22f8db5
6687019
b1ffa84
289a418
22f8db5
6687019
289a418
 
22f8db5
6687019
289a418
 
22f8db5
289a418
22f8db5
6687019
b1ffa84
289a418
b1ffa84
6687019
 
 
 
289a418
 
b1ffa84
6687019
 
289a418
 
b1ffa84
6687019
22f8db5
6687019
22f8db5
289a418
22f8db5
289a418
 
 
 
 
 
 
 
22f8db5
6687019
 
 
289a418
6687019
289a418
22f8db5
6687019
22f8db5
289a418
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22f8db5
6687019
22f8db5
289a418
22f8db5
289a418
 
6687019
289a418
 
22f8db5
289a418
 
 
 
6687019
71e2140
6687019
71e2140
289a418
71e2140
6687019
71e2140
b1ffa84
6687019
 
b1ffa84
71e2140
6687019
22f8db5
289a418
22f8db5
6687019
 
 
22f8db5
b1ffa84
22f8db5
289a418
 
 
22f8db5
6687019
289a418
 
 
6687019
289a418
22f8db5
6687019
22f8db5
289a418
22f8db5
289a418
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
---
license: mit
language:
  - en
tags:
  - finance
  - financial-reasoning
  - adversarial
  - critique-model
  - lora
  - safety
  - trading
datasets:
  - wmaousley/minicrit-training-12k
  - wmaousley/finrebut-600
model-index:
  - name: MiniCrit-1.5B
    results:
      - task:
          type: text-classification
          name: Financial Reasoning Critique
        dataset:
          name: MiniCrit-Training-12k
          type: wmaousley/minicrit-training-12k
        metrics:
          - name: Weak Reasoning F1
            type: f1
            value: 0.82
          - name: Hallucination Detection F1
            type: f1
            value: 0.76
---

# 🧠 MiniCrit-1.5B  
**Adversarial Financial Critic LLM for Trading-Rationale Evaluation**

MiniCrit-1.5B is an adversarial financial-critic LLM trained to evaluate, stress-test, and rebut trading rationales produced by other LLMs.  
It serves as a **validator layer** for autonomous or semi-autonomous trading systems where hallucinated logic or weak reasoning may create financial risk.

The model **does not** generate trades.  
It **only** critiques reasoning quality.

---

## πŸ“¦ Model Description

**Base Model:** 1.5B-parameter transformer  
**Tuning Method:** ATAC-LoRA  
**Training Data:**  
- **MiniCrit-Training-12k** (12,132 rationale β†’ critique pairs)  
- **FinRebut-600** curated evaluation set  

**Primary Abilities**
- Detect flawed or risky trading logic  
- Identify hallucinated financial statistics  
- Flag improper use of indicators  
- Provide adversarial rebuttals  
- Validate rationales before execution  

---

## πŸ“š Datasets

### **1. MiniCrit-Training-12k**  
Large-scale dataset of institutional rationale/critique pairs.  
➑ https://huggingface.co/datasets/wmaousley/minicrit-training-12k

### **2. FinRebut-600**  
Curated, high-quality adversarial rebuttal set.  
➑ https://huggingface.co/datasets/wmaousley/finrebut-600

Both datasets are available under **CC-BY-4.0**.

---

## πŸš€ Intended Use

### βœ” Recommended:
- Validating LLM-generated trading rationales  
- Hallucination detection in financial explanations  
- Model-to-model critique pipelines  
- AI-safety analysis for financial agents  
- Research in adversarial financial reasoning  

### ❌ Not Recommended:
- Generating trades  
- Investment decision-making  
- Fully autonomous trading without human review  

This model is for **research** and **evaluation** only.

---

## πŸ“ˆ Performance

### Forward-Test (Paper Trading)
| Metric | Value |
|--------|-------|
| Sharpe (baseline) | +0.20 |
| Sharpe (MiniCrit-validated) | **+0.80** |
| Hallucination reduction | **–48%** |
| Weak-reasoning detection F1 | **0.82** |
| Hallucination F1 | **0.76** |

### Qualitative Strengths
- Detects regime mismatch  
- Identifies liquidity illusions  
- Flags circular or self-justifying logic  
- Highlights data-mining  
- Generates strong evidence-demanding rebuttals  

---

## πŸ”§ Usage

> This example works after the full model is uploaded to this repository.

```python
from transformers import AutoTokenizer, AutoModelForCausalLM

model_name = "wmaousley/MiniCrit-1.5B"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

prompt = """Rationale:
'NVDA is oversold so I will long because RSI is below 30.'

Provide a critique.
"""

inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(
    **inputs,
    max_new_tokens=200,
    do_sample=False,
    temperature=0.0,
)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```

---

## πŸ›‘οΈ Safety & Limitations

### Model Risks
- May produce overly forceful critiques  
- Sensitive to prompt phrasing  
- Limited deep macroeconomic understanding  
- Not a trading or financial-advice model  

### Mitigations
- Does not produce trade signals  
- Outputs critique only  
- Warns about high-risk reasoning patterns  
- Datasets avoid target-label leakage  

---

## πŸ“„ Citation

If you use MiniCrit-1.5B, please cite:

```
Ousley, W. A. (2025). MiniCrit-1.5B: Adversarial Financial Critic Model.
Zenodo. https://doi.org/10.5281/zenodo.17594497
```

---

## πŸ‘€ Author

**William Alexander Ousley**  
AI/ML Researcher β€” Autonomous Trading Systems  
ORCID: https://orcid.org/0009-0009-2503-2010  

---

## 🀝 Contributions

Pull requests welcome.

Ideal contributions include:  
- Dataset expansions  
- Adversarial-evaluation benchmarks  
- Safety improvements  
- ATAC-LoRA optimization  
- Forward-test research  

---

## πŸ“¬ Contact

πŸ“§ Email: **founders@antagon.ai**  
πŸ”— GitHub: https://github.com/wmaousley