Implement local in-process inference backend for transformers models c6cdf25 agharsallah commited on 21 days ago
feat: Implement llama.cpp backend for local inference with GGUF models 0f11b49 agharsallah commited on 21 days ago
feat: update HF model catalogue to prioritize chat-capable model and adjust tests for new routing logic 66f0e23 agharsallah commited on 25 days ago