Redesign UI: flat minimal layout, no gradients/glass; add project stats panel and schema row counts 61c1901 verified Bhuvandesai commited on 13 days ago
Rewrite CPU banner (T4 now paid; 5-8min load, 2-4min query) and add on-page FAQ c935f9c verified Bhuvandesai commited on 13 days ago
Update Space README: GGUF/llama.cpp CPU serving, highlights, accurate timing 9748758 verified Bhuvandesai commited on 13 days ago
Remove unreliable base-model comparison card; single fine-tuned result + wall-clock timer + accurate CPU banner 9497393 verified Bhuvandesai commited on 13 days ago
Fix CPU thread oversubscription: cap n_threads to 2 (cpu-basic vCPUs) for faster generation 4853f12 verified Bhuvandesai commited on 13 days ago
Fix build OOM (exit 137): CMAKE_BUILD_PARALLEL_LEVEL=1 (single-threaded compile) bf1a6c4 verified Bhuvandesai commited on 13 days ago
Fix inference SIGSEGV: pin llama-cpp-python==0.3.19 (0.3.2 crashed on this GGUF); verified locally 2ee2903 verified Bhuvandesai commited on 13 days ago
Build llama-cpp-python from source (glibc); prebuilt wheels are musl-only for phi3-capable versions f438c10 verified Bhuvandesai commited on 13 days ago
Pin llama-cpp-python==0.3.19 + --only-binary (use prebuilt CPU wheel, no source build) 2641362 verified Bhuvandesai commited on 13 days ago
Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference e44cdab verified Bhuvandesai commited on 13 days ago
Revert "Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)" d031aeb Bhuvandesai commited on 20 days ago
Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them" 2f03b0e Bhuvandesai commited on 20 days ago
Make torch/transformers/peft lazy imports so CPU Space boots without them eac26c7 Bhuvandesai commited on 20 days ago
Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers) 564ad28 Bhuvandesai commited on 20 days ago
Update CPU warning banner with accurate load and query times e4ff54d Bhuvandesai commited on 21 days ago
Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup f5ce94d Bhuvandesai commited on 21 days ago