feat(commentary): refine rafters-critic persona and improve commentary prompt for humor 26bc5b9 agharsallah commited on 14 days ago
feat(media): enhance media request handling with timeout configuration and improved serialization 846eb30 agharsallah commited on 14 days ago
feat(media): update media configuration for FLUX.2 and VoxCPM2 models with enhanced details 3fd114a agharsallah commited on 14 days ago
feat(media): introduce MediaRouter and stubs for image and speech generation 8400d8c agharsallah commited on 14 days ago
fix: Update GPU configuration for NVIDIA models to use A100 instead of H200 bd9568a agharsallah commited on 15 days ago
fix: Update Python version to 3.13 for compatibility with local deploy environment 965bf0f agharsallah commited on 16 days ago
Refactor code structure for improved readability and maintainability a464e67 agharsallah commited on 16 days ago
feat: Handle FP8 KV cache incompatibility with snapshot models in build command e334e95 agharsallah commited on 16 days ago
feat: Add FP8 quantization support for model serving with environment overrides e3dfec9 agharsallah commited on 16 days ago
feat: Enhance deployment instructions and update web server label formatting 1bc1435 agharsallah commited on 16 days ago
feat: Implement well-known typed fields for verdicts in output models ce159dc agharsallah commited on 16 days ago
fix(modal): shorten cascade slug to fit DNS limit; add endpoint health tooling f1bec25 agharsallah Codex commited on 19 days ago
feat: refine structured output handling with guided decoding and mode adjustments 330d790 agharsallah commited on 20 days ago
feat(logging): implement structured JSON logging for vLLM and enhance observability 6ca7a5f agharsallah commited on 20 days ago
feat: enhance agent personas and improve model performance tuning 40a30b6 agharsallah commited on 21 days ago
fix: update Google Gemma model configuration to use instruction-tuned version c1a916b agharsallah commited on 21 days ago
feat: add Nemotron-Cascade-14B-Thinking model configuration to catalogue and README dfae9ee agharsallah commited on 21 days ago
feat: unify model catalogue and self-hosted routing 9dd6dab agharsallah Codex commited on 21 days ago
fix: update endpoint URLs to reflect new app naming conventions 7cedfb2 agharsallah commited on 21 days ago
feat: add OpenAPI docs and opt-in bearer auth via env-var secret 57b8237 agharsallah Codex commited on 22 days ago
feat: add Modal model-serving layer, one app per provider 8a801e8 agharsallah Codex commited on 22 days ago