Commit History

compile llama-cpp-python with -j1 for low-mem build
9ce5e68
verified

SuhaibAtef commited on

add requests (gradio cli imports it)
e342c9a
verified

SuhaibAtef commited on

install llama-cpp-python 0.3.19 wheel directly (no compile)
5941e33
verified

SuhaibAtef commited on

switch to Docker SDK + abetlen prebuilt llama-cpp-python wheel
bef2ed3
verified

SuhaibAtef commited on

switch to transformers + merged fp16 (avoid llama-cpp build OOM)
12d0c59
verified

SuhaibAtef commited on

fix: pin python 3.11 (llama-cpp-python wheels available, avoids py3.13 build)
af116f6
verified

SuhaibAtef commited on

fix: use prebuilt CPU wheels for llama-cpp-python (avoid OOM on builder)
08816ee
verified

SuhaibAtef commited on

fix: bump llama-cpp-python 0.3.2 -> 0.3.20 for Qwen3.5 GGUF support
c3e2d13
verified

SuhaibAtef commited on

fix: bump gradio 4.44 -> 5.9 + audioop-lts for py3.13
d140e3d
verified

SuhaibAtef commited on

initial demo: Qwen3.5-0.8B LoRA q4_k_m GGUF on CPU
5be427a
verified

SuhaibAtef commited on

initial commit
624d00f
verified

SuhaibAtef commited on