Copy main.py to Docker image to pick up syntax fixes 7c501a5 Andrew McCracken Claude commited on Oct 14, 2025
Fix f-string syntax error in streaming endpoint c53e66f Andrew McCracken Claude commited on Oct 14, 2025
Configure uvicorn for concurrent request handling 6b0a701 Andrew McCracken Claude commited on Oct 14, 2025
Add concurrent request handling with model pool efd4459 Andrew McCracken Claude commited on Oct 14, 2025
Revert to simpler configuration - optimizations caused slowdown 457c9e1 Andrew McCracken Claude commited on Oct 13, 2025
Optimize model parameters for faster CPU inference 6e83384 Andrew McCracken Claude commited on Oct 13, 2025
Fix API endpoint to use relative URL instead of localhost 1b98923 Andrew McCracken Claude commited on Oct 13, 2025
Add knowledge_db directory and remove deprecated env var 7942e87 Andrew McCracken Claude commited on Oct 13, 2025
Fix /data directory permissions for HF Spaces 28a4990 Andrew McCracken Claude commited on Oct 13, 2025
Fix: Install llama-cpp-python at startup to /tmp to avoid build timeout 0a1ff0d Andrew McCracken commited on Oct 13, 2025
Fix: Install cmake and build-essential for llama-cpp-python build 8e854ef Andrew McCracken commited on Oct 13, 2025
Fix: Force use of pre-built llama-cpp-python wheels a922ca8 Andrew McCracken commited on Oct 13, 2025
Fix: Install llama-cpp-python at runtime (HF Spaces workaround) 3fad655 Andrew McCracken commited on Oct 13, 2025
Optimize: Use pre-built llama-cpp-python wheels for faster builds 6e61395 Andrew McCracken commited on Oct 13, 2025