test(tts): checking whether the tts model is working correctly 65952f6 CrazyMonkey0 commited on Dec 16, 2025
fix: optimize FastAPI + Qwen2.5-1.5B for CPU, reduce max_tokens, increase timeout 88284a4 CrazyMonkey0 commited on Dec 16, 2025
feat(docker): install llama-cpp-python directly with OpenBLAS 5adcb69 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): switch to prebuilt llama-cpp-python image on Debian 33c584f CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-slim with python:3.12-alpine ad141ba CrazyMonkey0 commited on Dec 15, 2025
fix(docker): add --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu ef2c272 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-alpine with python:3.12-slim 5c256f5 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-slim with python:3.12-alpine bf92c5c CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replace llama-cpp-python extra index with pinned cp312 linux wheel 2a2937c CrazyMonkey0 commited on Dec 15, 2025
fix(docker): changing the option --only-binary=llama-cpp-python to --prefer-binary 8757f21 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): add build dependencies for llama-cpp-python c0cf206 CrazyMonkey0 commited on Dec 14, 2025
fix(docker): resolve llama-cpp-python module import error fd4e818 CrazyMonkey0 commited on Dec 14, 2025
chore(docker): reintroduce llama-cpp-python pre-built wheel for faster build 23187e2 CrazyMonkey0 commited on Dec 14, 2025
feat(nlp): reintroduce Qwen2.5-1.5B-Instruct model and migrate back to Transformers 94cf754 CrazyMonkey0 commited on Dec 12, 2025
fix(nlp): update Llama loading to use from_pretrained() f7ec4f4 CrazyMonkey0 commited on Dec 11, 2025
feat(python): Change Python version to 3.11-bullseye for llama-cpp-python prebuilt wheel f45e402 CrazyMonkey0 commited on Dec 11, 2025
Fix(docker): start of the model when building the Docker image 9c29a0e CrazyMonkey0 commited on Dec 11, 2025