Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference e44cdab verified Bhuvandesai commited on 10 days ago
Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them" 2f03b0e Bhuvandesai commited on 17 days ago
Make torch/transformers/peft lazy imports so CPU Space boots without them eac26c7 Bhuvandesai commited on 17 days ago
Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup f5ce94d Bhuvandesai commited on 17 days ago