APi_English / Dockerfile

Commit History

test(tts): checking whether the tts model is working correctly
65952f6

CrazyMonkey0 commited on

fix: optimize FastAPI + Qwen2.5-1.5B for CPU, reduce max_tokens, increase timeout
88284a4

CrazyMonkey0 commited on

perf: implement lazy loading to fix startup timeouts
8f110eb

CrazyMonkey0 commited on

build: use custom Docker base image from Docker Hub
fc8b522

CrazyMonkey0 commited on

feat(docker): install llama-cpp-python directly with OpenBLAS
5adcb69

CrazyMonkey0 commited on

fix(docker): Add python alpine for llama-cpp-python
0442de3

CrazyMonkey0 commited on

fix(requirements): adapting packages for Python 3.13
f42f941

CrazyMonkey0 commited on

fix(docker): switch version python to 3.12
482015f

CrazyMonkey0 commited on

fix(docker): switch to prebuilt llama-cpp-python image on Debian
33c584f

CrazyMonkey0 commited on

fix(docker): repair version dependecies
5d2827d

CrazyMonkey0 commited on

fix(docker): adding a permanent version of llama
06eacd3

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-slim with python:3.12-alpine
ad141ba

CrazyMonkey0 commited on

fix(docker): add --prefer-binary
347b2fa

CrazyMonkey0 commited on

fix(docker): add --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu
ef2c272

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-alpine with python:3.12-slim
5c256f5

CrazyMonkey0 commited on

fix(docker): replacing apt-get with apk
38fe107

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-slim with python:3.12-alpine
bf92c5c

CrazyMonkey0 commited on

fix(docker): replace llama-cpp-python extra index with pinned cp312 linux wheel
2a2937c

CrazyMonkey0 commited on

fix(docker): changing the option --only-binary=llama-cpp-python to --prefer-binary
8757f21

CrazyMonkey0 commited on

fix(docker): add build dependencies for llama-cpp-python
c0cf206

CrazyMonkey0 commited on

fix(docker): resolve llama-cpp-python module import error
fd4e818

CrazyMonkey0 commited on

chore(docker): reintroduce llama-cpp-python pre-built wheel for faster build
23187e2

CrazyMonkey0 commited on

fix(docker): increase gunicorn worker timeout to 120s
101988f

CrazyMonkey0 commited on

feat(nlp): reintroduce Qwen2.5-1.5B-Instruct model and migrate back to Transformers
94cf754

CrazyMonkey0 commited on

fix(llama): fixed a bug with building llama
19edf1b

CrazyMonkey0 commited on

fix(llama): clear cache after llama
ca4e6e3

CrazyMonkey0 commited on

test(models): downloading models from transformers
fe8b413

CrazyMonkey0 commited on

fix(lama.cpp): Add prebuilt lamma.cpp to docker
ad5570a

CrazyMonkey0 commited on

fix(nlp): update Llama loading to use from_pretrained()
f7ec4f4

CrazyMonkey0 commited on

feat(python): Change Python version to 3.11-bullseye for llama-cpp-python prebuilt wheel
f45e402

CrazyMonkey0 commited on

Fix(nlp): NLP model download and build fix
89865a6

CrazyMonkey0 commited on

Fix(docker): repair building llama.cpp
74de35c

CrazyMonkey0 commited on

Fix(docker): start of the model when building the Docker image
9c29a0e

CrazyMonkey0 commited on

Fix commad to run in dockerfile
0b8f94b

CrazyMonkey0 commited on

Fix python version
420a9d9

CrazyMonkey0 commited on

Fix dockerfile
385c547

CrazyMonkey0 commited on

Initial APP
7eb3110

CrazyMonkey0 commited on