Implement local in-process inference backend for transformers models c6cdf25 agharsallah commited on 14 days ago
feat: Implement llama.cpp backend for local inference with GGUF models 0f11b49 agharsallah commited on 14 days ago