Commit History

Redesign UI: flat minimal layout, no gradients/glass; add project stats panel and schema row counts
61c1901
verified

Bhuvandesai commited on

Rewrite CPU banner (T4 now paid; 5-8min load, 2-4min query) and add on-page FAQ
c935f9c
verified

Bhuvandesai commited on

Update Space README: GGUF/llama.cpp CPU serving, highlights, accurate timing
9748758
verified

Bhuvandesai commited on

Remove unreliable base-model comparison card; single fine-tuned result + wall-clock timer + accurate CPU banner
9497393
verified

Bhuvandesai commited on

Fix CPU thread oversubscription: cap n_threads to 2 (cpu-basic vCPUs) for faster generation
4853f12
verified

Bhuvandesai commited on

Fix build OOM (exit 137): CMAKE_BUILD_PARALLEL_LEVEL=1 (single-threaded compile)
bf1a6c4
verified

Bhuvandesai commited on

Fix inference SIGSEGV: pin llama-cpp-python==0.3.19 (0.3.2 crashed on this GGUF); verified locally
2ee2903
verified

Bhuvandesai commited on

Build llama-cpp-python from source (glibc); prebuilt wheels are musl-only for phi3-capable versions
f438c10
verified

Bhuvandesai commited on

Pin llama-cpp-python==0.3.19 + --only-binary (use prebuilt CPU wheel, no source build)
2641362
verified

Bhuvandesai commited on

Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference
e44cdab
verified

Bhuvandesai commited on

Revert "Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)"
d031aeb

Bhuvandesai commited on

Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them"
2f03b0e

Bhuvandesai commited on

Make torch/transformers/peft lazy imports so CPU Space boots without them
eac26c7

Bhuvandesai commited on

Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)
564ad28

Bhuvandesai commited on

Update CPU warning banner with accurate load and query times
e4ff54d

Bhuvandesai commited on

Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup
f5ce94d

Bhuvandesai commited on

Add README with HF Spaces config
6227517

Bhuvandesai commited on

initial deployment
55159b1

Bhuvandesai commited on