fix(inference): correct format of chat history and streaming response b2faa29 Kajlid commited on Dec 2, 2025
chore(requirements): remove llama_cpp requirement from requirements.txt 9a07ae6 Kajlid commited on Dec 2, 2025
fix(requirements): install llama_cpp_python via subprocess to fix HuggingFace error 00ed5b0 Kajlid commited on Dec 2, 2025
fix(requirements): install llama_cpp_python via subprocess to fix HuggingFace error 0c0ee51 Kajlid commited on Dec 2, 2025
fix(llm): download model from huggingface_hub before inference 364f0e9 Kajlid commited on Dec 1, 2025