private-ai-backend / model.py

Commit History

increased n_ctx
6cf1dc4

adeebjamal commited on

Updated LLM to Gemma 4 E4B
f3f9730

adeebjamal commited on

Auto-clear model_cache when model changes to save disk space
0e0d121

adeebjamal commited on

Fix severe CPU thread thrashing by hardcoding n_threads to 2
43330e2

adeebjamal commited on

Change default model to Llama-3.2-3B-Instruct for much faster CPU inference
92b500d

adeebjamal commited on

Fix HF Space timeout
608c93d

adeebjamal commited on

first commit
64c8865

adeebjamal commited on