fix(docker): resolve llama-cpp-python module import error fd4e818 CrazyMonkey0 commited on Dec 14, 2025
refactor(chat): migrate from transformers to llama-cpp-python using Qwen 3B 6151d5f CrazyMonkey0 commited on Dec 14, 2025
chore(docker): reintroduce llama-cpp-python pre-built wheel for faster build 23187e2 CrazyMonkey0 commited on Dec 14, 2025
feat(nlp): Optimize CPU usage for Hugging Face Spaces Free Tier 75451ba CrazyMonkey0 commited on Dec 12, 2025
feat(nlp): reintroduce Qwen2.5-1.5B-Instruct model and migrate back to Transformers 94cf754 CrazyMonkey0 commited on Dec 12, 2025
feat(llama): another attempt to integrate llama-cpp with the Qwen3-8B-Q4_K_M.gguf model 9bb78b3 CrazyMonkey0 commited on Dec 12, 2025
fix(nlp): update Llama loading to use from_pretrained() f7ec4f4 CrazyMonkey0 commited on Dec 11, 2025
feat(python): Change Python version to 3.11-bullseye for llama-cpp-python prebuilt wheel f45e402 CrazyMonkey0 commited on Dec 11, 2025
Fix(docker): start of the model when building the Docker image 9c29a0e CrazyMonkey0 commited on Dec 11, 2025
feat(nlp): add lama.cpp support for Qwen3-8B-Q5_K_M.gguf and download models b2565e9 CrazyMonkey0 commited on Dec 11, 2025