Commit History

feat(commentary): refine rafters-critic persona and improve commentary prompt for humor
26bc5b9

agharsallah commited on

feat(media): enhance media request handling with timeout configuration and improved serialization
846eb30

agharsallah commited on

feat(media): update media configuration for FLUX.2 and VoxCPM2 models with enhanced details
3fd114a

agharsallah commited on

feat(media): introduce MediaRouter and stubs for image and speech generation
8400d8c

agharsallah commited on

fix: Update GPU configuration for NVIDIA models to use A100 instead of H200
bd9568a

agharsallah commited on

fix: Update Python version to 3.13 for compatibility with local deploy environment
965bf0f

agharsallah commited on

Refactor modal service and logging setup
5d4ef87

agharsallah commited on

Refactor code structure for improved readability and maintainability
a464e67

agharsallah commited on

feat: Handle FP8 KV cache incompatibility with snapshot models in build command
e334e95

agharsallah commited on

feat: Add FP8 quantization support for model serving with environment overrides
e3dfec9

agharsallah commited on

feat: Enhance deployment instructions and update web server label formatting
1bc1435

agharsallah commited on

feat: Implement well-known typed fields for verdicts in output models
ce159dc

agharsallah commited on

fix(modal): shorten cascade slug to fit DNS limit; add endpoint health tooling
f1bec25

agharsallah Codex commited on

feat: refine structured output handling with guided decoding and mode adjustments
330d790

agharsallah commited on

feat(logging): implement structured JSON logging for vLLM and enhance observability
6ca7a5f

agharsallah commited on

feat: enhance agent personas and improve model performance tuning
40a30b6

agharsallah commited on

feat: Enhance model routing and deployment flexibility
c1656a8

agharsallah commited on

fix: update Google Gemma model configuration to use instruction-tuned version
c1a916b

agharsallah commited on

feat: add Nemotron-Cascade-14B-Thinking model configuration to catalogue and README
dfae9ee

agharsallah commited on

feat: unify model catalogue and self-hosted routing
9dd6dab

agharsallah Codex commited on

fix: update endpoint URLs to reflect new app naming conventions
7cedfb2

agharsallah commited on

fix: update Python version to 3.13 for compatibility
385faa1

agharsallah commited on

feat: add OpenAPI docs and opt-in bearer auth via env-var secret
57b8237

agharsallah Codex commited on

feat: add Modal model-serving layer, one app per provider
8a801e8

agharsallah Codex commited on