Spaces:
Sleeping
Sleeping
File size: 1,335 Bytes
be490c4 739bd96 bedb746 be490c4 06e7bdc | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 | ---
title: LLM Pushback
emoji: 🧠
colorFrom: blue
colorTo: indigo
sdk: streamlit
sdk_version: 1.35.0
python_version: 3.11
app_file: chat_gui.py
pinned: false
---
# Ministral Alignment Lab: BullshitBench Evaluation
A specialized research project focused on aligning small language models (**Ministral-3B** and **Qwen-4B**) to identify and push back against nonsensical logical premises using **Direct Preference Optimization (DPO)** and **Supervised Fine-Tuning (SFT)** on Apple Silicon (MLX).
## 🚀 Key Results
* **Ministral-3B**: Improved from **4.0%** (Baseline) to **74.2%** (SFT V3) Green Rate.
* **Qwen-4B**: Jumped from **35.0%** (Baseline) to **79.0%** (SFT V1) Green Rate with 100% accuracy in Physics.
## 📦 Project Structure
- `chat_gui.py`: Interactive Streamlit dashboard with real-time reasoning visualization.
- `finetune.py` / `post_eval.py`: Core pipeline for LoRA fine-tuning and evaluation.
- `data/`: Curated dataset of reasoning pairs used for alignment.
- `deepseek_evaluation_report.md`: Full technical audit of model performance.
## 🛠️ Setup
1. **Requirements**: `pip install -r requirements.txt`
2. **Run GUI**: `streamlit run chat_gui.py`
## 📊 Deployment
For sharing within your network, see the [Deployment Guide](deployment_guide.md).
---
*Evaluated using DeepSeek R1 14B as the judge.*
|