Commit History

Update Dockerfile with main.py copy
47bf7ec

Andrew McCracken Claude commited on

Copy main.py to Docker image to pick up syntax fixes
7c501a5

Andrew McCracken Claude commited on

Fix remaining f-string syntax errors
8f62d83

Andrew McCracken Claude commited on

Fix f-string syntax error in streaming endpoint
c53e66f

Andrew McCracken Claude commited on

Switch to GPU-enabled Docker image
2f6841c

Andrew McCracken Claude commited on

Add thread-safety for concurrent users
cfc97b4

Andrew McCracken Claude commited on

Configure uvicorn for concurrent request handling
6b0a701

Andrew McCracken Claude commited on

Add concurrent request handling with model pool
efd4459

Andrew McCracken Claude commited on

Add GPU support
bfa102d

Andrew McCracken Claude commited on

Revert to simpler configuration - optimizations caused slowdown
457c9e1

Andrew McCracken Claude commited on

Optimize for faster inference
8cfe5b7

Andrew McCracken Claude commited on

Increase threads to 8 for faster inference
3f2ee19

Andrew McCracken Claude commited on

Disable RAG for faster inference
2a55dc3

Andrew McCracken Claude commited on

Optimize for 8vCPU/32GB instance
b7fb901

Andrew McCracken Claude commited on

Optimize model parameters for faster CPU inference
6e83384

Andrew McCracken Claude commited on

Fix API endpoint to use relative URL instead of localhost
1b98923

Andrew McCracken Claude commited on

Add test interface HTML to Docker image
403caa4

Andrew McCracken Claude commited on

Add knowledge_db directory and remove deprecated env var
7942e87

Andrew McCracken Claude commited on

Fix Hugging Face cache permissions
b721fc6

Andrew McCracken Claude commited on

Fix /app/models directory permissions
3b792bc

Andrew McCracken Claude commited on

Fix /data directory permissions for HF Spaces
28a4990

Andrew McCracken Claude commited on

Use pre-built Docker image from Docker Hub
cf74856

Andrew McCracken Claude commited on

Use pre-built Docker image from Docker Hub
ccc0289

Andrew McCracken commited on

Fix: Install llama-cpp-python at startup to /tmp to avoid build timeout
0a1ff0d

Andrew McCracken commited on

Fix: Install cmake and build-essential for llama-cpp-python build
8e854ef

Andrew McCracken commited on

Fix: Force use of pre-built llama-cpp-python wheels
a922ca8

Andrew McCracken commited on

Fix: Install llama-cpp-python at runtime (HF Spaces workaround)
3fad655

Andrew McCracken commited on

Optimize: Use pre-built llama-cpp-python wheels for faster builds
6e61395

Andrew McCracken commited on

Initial deployment to Spaces
2fb680d

Andrew McCracken commited on

initial commit
7ba4b03
verified

tech-daskalos commited on