swap: Qwen2.5-0.5B-Instruct (faster, reliable #### format) + fix dp2 detection bug 80dbe0c Mustafa Tag Eldeen commited on 5 days ago
feat: pre-load model at startup to avoid 200s first-request wait 075304e Mustafa Tag Eldeen commited on 5 days ago
fix: handle None gold_prob/prod_prob in steps display cba93cb Mustafa Tag Eldeen commited on 5 days ago
fix: context-aware Step token detection for TinyLlama (handle multiple token IDs) 4b00778 Mustafa Tag Eldeen commited on 5 days ago
Switch to TinyLlama-1.1B for CPU, remove bitsandbytes f688a6e Mustafa Tag Eldeen commited on 5 days ago