Fix: pin gradio==5.12.0 to avoid API schema bug b52235a verified AshkanTaghipour commited on 26 days ago
Switch to transformers CPU inference (Qwen3.5 not yet supported by llama.cpp) 106b2e5 verified AshkanTaghipour commited on 26 days ago
Initial Space: Gradio demo with GGUF CPU inference cb3a879 verified AshkanTaghipour commited on 26 days ago