chore: suppress Qwen config warnings in greedy generation 54c5680 Mustafa Tag Eldeen commited on 5 days ago
swap: Qwen2.5-0.5B-Instruct (faster, reliable #### format) + fix dp2 detection bug 80dbe0c Mustafa Tag Eldeen commited on 5 days ago
feat: pre-load model at startup to avoid 200s first-request wait 075304e Mustafa Tag Eldeen commited on 5 days ago
fix: handle None gold_prob/prod_prob in steps display cba93cb Mustafa Tag Eldeen commited on 5 days ago
fix: context-aware Step token detection for TinyLlama (handle multiple token IDs) 4b00778 Mustafa Tag Eldeen commited on 5 days ago
Restore top_p_presence_next field in TimestepArtifacts to match research code 963de15 Mustafa Tag Eldeen commited on 5 days ago
Fix: remove top_p_presence_next arg not in TimestepArtifacts 84c3309 Mustafa Tag Eldeen commited on 5 days ago
Fix Python 3.13 compat: add audioop-lts for gradio, bump to gradio 5 e90989e Mustafa Tag Eldeen commited on 5 days ago
Switch to TinyLlama-1.1B for CPU, remove bitsandbytes f688a6e Mustafa Tag Eldeen commited on 5 days ago