Spaces:
Running
Running
Phase 2 Quick Fix: Switch to clean Qwen2.5-0.5B-Instruct base model
Browse filesCRITICAL FIX: Replaced corrupted anktechsol/anki-2.5 model with clean Qwen/Qwen2.5-0.5B-Instruct base model.
Issue: Previous model (checkpoint-1000) was producing gibberish outputs with hallucinations and token artifacts.
Solution: Switching to official Qwen2.5-0.5B-Instruct model which is properly instruction-tuned and should provide coherent responses.
This is a temporary fix. Next step: Proper LoRA fine-tuning on quality chat data from scratch.
app.py
CHANGED
|
@@ -5,7 +5,7 @@ from transformers import TextIteratorStreamer
|
|
| 5 |
from threading import Thread
|
| 6 |
|
| 7 |
# Load model and tokenizer at startup
|
| 8 |
-
model_name = "
|
| 9 |
print(f"Loading model {model_name}...")
|
| 10 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
| 11 |
tokenizer.pad_token = tokenizer.eos_token
|
|
|
|
| 5 |
from threading import Thread
|
| 6 |
|
| 7 |
# Load model and tokenizer at startup
|
| 8 |
+
model_name = "Qwen/Qwen2.5-0.5B-Instruct"
|
| 9 |
print(f"Loading model {model_name}...")
|
| 10 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
| 11 |
tokenizer.pad_token = tokenizer.eos_token
|