feat(media): introduce MediaRouter and stubs for image and speech generation 8400d8c agharsallah commited on 18 days ago
feat: Update model references to MiniCPM5 and adjust related documentation and tests for transformers 5.x compatibility cc8e9f2 agharsallah commited on 19 days ago
Refactor local provider to support on-device loading with accelerate 3bcd1af agharsallah commited on 19 days ago
feat: Add use_cache parameter to support KV cache handling in models and update tests 169b620 agharsallah commited on 20 days ago
feat: Update model routing and local provider to support multi-sponsor tiers and auto-class resolution 7a13e4e agharsallah commited on 20 days ago
feat: Ensure full materialization of model weights to prevent meta tensor issues in ZeroGPU 97f641c agharsallah commited on 20 days ago
fix: Ensure proper unpacking of BatchEncoding in model generation to prevent AttributeError 0dfc318 agharsallah commited on 20 days ago
feat: Replace llama.cpp backend with in-process transformers backend for local GPU inference 7d636f8 agharsallah commited on 20 days ago
Implement local in-process inference backend for transformers models c6cdf25 agharsallah commited on 20 days ago