Spaces:
Sleeping
Sleeping
A newer version of the Streamlit SDK is available: 1.57.0
metadata
title: LLM Pushback
emoji: π§
colorFrom: blue
colorTo: indigo
sdk: streamlit
sdk_version: 1.35.0
python_version: 3.11
app_file: chat_gui.py
pinned: false
Ministral Alignment Lab: BullshitBench Evaluation
A specialized research project focused on aligning small language models (Ministral-3B and Qwen-4B) to identify and push back against nonsensical logical premises using Direct Preference Optimization (DPO) and Supervised Fine-Tuning (SFT) on Apple Silicon (MLX).
π Key Results
- Ministral-3B: Improved from 4.0% (Baseline) to 74.2% (SFT V3) Green Rate.
- Qwen-4B: Jumped from 35.0% (Baseline) to 79.0% (SFT V1) Green Rate with 100% accuracy in Physics.
π¦ Project Structure
chat_gui.py: Interactive Streamlit dashboard with real-time reasoning visualization.finetune.py/post_eval.py: Core pipeline for LoRA fine-tuning and evaluation.data/: Curated dataset of reasoning pairs used for alignment.deepseek_evaluation_report.md: Full technical audit of model performance.
π οΈ Setup
- Requirements:
pip install -r requirements.txt - Run GUI:
streamlit run chat_gui.py
π Deployment
For sharing within your network, see the Deployment Guide.
Evaluated using DeepSeek R1 14B as the judge.