anktechsol commited on
Commit
e89521f
·
verified ·
1 Parent(s): 9c878f1

Phase 2 Quick Fix: Switch to clean Qwen2.5-0.5B-Instruct base model

Browse files

CRITICAL FIX: Replaced corrupted anktechsol/anki-2.5 model with clean Qwen/Qwen2.5-0.5B-Instruct base model.

Issue: Previous model (checkpoint-1000) was producing gibberish outputs with hallucinations and token artifacts.

Solution: Switching to official Qwen2.5-0.5B-Instruct model which is properly instruction-tuned and should provide coherent responses.

This is a temporary fix. Next step: Proper LoRA fine-tuning on quality chat data from scratch.

Files changed (1) hide show
  1. app.py +1 -1
app.py CHANGED
@@ -5,7 +5,7 @@ from transformers import TextIteratorStreamer
5
  from threading import Thread
6
 
7
  # Load model and tokenizer at startup
8
- model_name = "anktechsol/anki-2.5"
9
  print(f"Loading model {model_name}...")
10
  tokenizer = AutoTokenizer.from_pretrained(model_name)
11
  tokenizer.pad_token = tokenizer.eos_token
 
5
  from threading import Thread
6
 
7
  # Load model and tokenizer at startup
8
+ model_name = "Qwen/Qwen2.5-0.5B-Instruct"
9
  print(f"Loading model {model_name}...")
10
  tokenizer = AutoTokenizer.from_pretrained(model_name)
11
  tokenizer.pad_token = tokenizer.eos_token