Commit History

Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference
e44cdab
verified

Bhuvandesai commited on

Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them"
2f03b0e

Bhuvandesai commited on

Make torch/transformers/peft lazy imports so CPU Space boots without them
eac26c7

Bhuvandesai commited on

Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup
f5ce94d

Bhuvandesai commited on

initial deployment
55159b1

Bhuvandesai commited on