Spaces:

Bhuvandesai
/

phi3-text-to-sql-studio

Sleeping

App Files Files Community

phi3-text-to-sql-studio

Commit History

Redesign UI: flat minimal layout, no gradients/glass; add project stats panel and schema row counts

61c1901
verified

Bhuvandesai commited on 13 days ago

Rewrite CPU banner (T4 now paid; 5-8min load, 2-4min query) and add on-page FAQ

c935f9c
verified

Bhuvandesai commited on 13 days ago

Update Space README: GGUF/llama.cpp CPU serving, highlights, accurate timing

9748758
verified

Bhuvandesai commited on 13 days ago

Remove unreliable base-model comparison card; single fine-tuned result + wall-clock timer + accurate CPU banner

9497393
verified

Bhuvandesai commited on 13 days ago

Fix CPU thread oversubscription: cap n_threads to 2 (cpu-basic vCPUs) for faster generation

4853f12
verified

Bhuvandesai commited on 13 days ago

Fix build OOM (exit 137): CMAKE_BUILD_PARALLEL_LEVEL=1 (single-threaded compile)

bf1a6c4
verified

Bhuvandesai commited on 13 days ago

Fix inference SIGSEGV: pin llama-cpp-python==0.3.19 (0.3.2 crashed on this GGUF); verified locally

2ee2903
verified

Bhuvandesai commited on 13 days ago

Build llama-cpp-python from source (glibc); prebuilt wheels are musl-only for phi3-capable versions

f438c10
verified

Bhuvandesai commited on 13 days ago

Pin llama-cpp-python==0.3.19 + --only-binary (use prebuilt CPU wheel, no source build)

2641362
verified

Bhuvandesai commited on 13 days ago

Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference

e44cdab
verified

Bhuvandesai commited on 13 days ago

Revert "Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)"

d031aeb

Bhuvandesai commited on 20 days ago

Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them"

2f03b0e

Bhuvandesai commited on 20 days ago

Make torch/transformers/peft lazy imports so CPU Space boots without them

eac26c7

Bhuvandesai commited on 20 days ago

Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)

564ad28

Bhuvandesai commited on 20 days ago

Fix bugs

a7bb6b3

Bhuvandesai commited on 21 days ago

Fix bugs

52e3716

Bhuvandesai commited on 21 days ago

Fix bugs

19cfcb8

Bhuvandesai commited on 21 days ago

Update CPU warning banner with accurate load and query times

e4ff54d

Bhuvandesai commited on 21 days ago

Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup

f5ce94d

Bhuvandesai commited on 21 days ago

Add README with HF Spaces config

6227517

Bhuvandesai commited on 21 days ago

initial deployment

55159b1

Bhuvandesai commited on 21 days ago

Commit History

Redesign UI: flat minimal layout, no gradients/glass; add project stats panel and schema row counts 61c1901 verified

Rewrite CPU banner (T4 now paid; 5-8min load, 2-4min query) and add on-page FAQ c935f9c verified

Update Space README: GGUF/llama.cpp CPU serving, highlights, accurate timing 9748758 verified

Remove unreliable base-model comparison card; single fine-tuned result + wall-clock timer + accurate CPU banner 9497393 verified

Fix CPU thread oversubscription: cap n_threads to 2 (cpu-basic vCPUs) for faster generation 4853f12 verified

Fix build OOM (exit 137): CMAKE_BUILD_PARALLEL_LEVEL=1 (single-threaded compile) bf1a6c4 verified

Fix inference SIGSEGV: pin llama-cpp-python==0.3.19 (0.3.2 crashed on this GGUF); verified locally 2ee2903 verified

Build llama-cpp-python from source (glibc); prebuilt wheels are musl-only for phi3-capable versions f438c10 verified

Pin llama-cpp-python==0.3.19 + --only-binary (use prebuilt CPU wheel, no source build) 2641362 verified

Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference e44cdab verified

Revert "Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)" d031aeb

Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them" 2f03b0e

Make torch/transformers/peft lazy imports so CPU Space boots without them eac26c7

Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers) 564ad28

Fix bugs a7bb6b3

Fix bugs 52e3716

Fix bugs 19cfcb8

Update CPU warning banner with accurate load and query times e4ff54d

Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup f5ce94d

Add README with HF Spaces config 6227517

initial deployment 55159b1

Redesign UI: flat minimal layout, no gradients/glass; add project stats panel and schema row counts

61c1901
verified

Rewrite CPU banner (T4 now paid; 5-8min load, 2-4min query) and add on-page FAQ

c935f9c
verified

Update Space README: GGUF/llama.cpp CPU serving, highlights, accurate timing

9748758
verified

Remove unreliable base-model comparison card; single fine-tuned result + wall-clock timer + accurate CPU banner

9497393
verified

Fix CPU thread oversubscription: cap n_threads to 2 (cpu-basic vCPUs) for faster generation

4853f12
verified

Fix build OOM (exit 137): CMAKE_BUILD_PARALLEL_LEVEL=1 (single-threaded compile)

bf1a6c4
verified

Fix inference SIGSEGV: pin llama-cpp-python==0.3.19 (0.3.2 crashed on this GGUF); verified locally

2ee2903
verified

Build llama-cpp-python from source (glibc); prebuilt wheels are musl-only for phi3-capable versions

f438c10
verified

Pin llama-cpp-python==0.3.19 + --only-binary (use prebuilt CPU wheel, no source build)

2641362
verified

Re-deploy llama.cpp + GGUF CPU serving (default Q4_K_M); fast CPU inference

e44cdab
verified

Revert "Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)"

d031aeb

Revert "Make torch/transformers/peft lazy imports so CPU Space boots without them"

2f03b0e

Make torch/transformers/peft lazy imports so CPU Space boots without them

eac26c7

Migrate CPU serving to llama.cpp + Q5_K_M GGUF (was bf16 transformers)

564ad28

Fix bugs

a7bb6b3

Fix bugs

52e3716

Fix bugs

19cfcb8

Update CPU warning banner with accurate load and query times

e4ff54d

Fix inference device bug, auto-load model, remove fine-tuning console, UI cleanup

f5ce94d

Add README with HF Spaces config

6227517

initial deployment

55159b1