Commit History

🐛 Correctly unset loaded state after unloading
9f66c7c

autumnssuns commited on

✨ Implement lazy loading for models and correct tokens counting
e4b3020

autumnssuns commited on

✨ Add Gemma 4 E2B model integration and update service to support multiple models
21bfda5

autumnssuns commited on

⚗️Move spaces.GPU to app.py endpoints
22af552

autumnssuns commited on

🩺 Add debug statement for Llama text generation
b57a35c

autumnssuns commited on

👔 Update model to immediately initialise instead of lazy loading
bcf818c

autumnssuns commited on

✨ Implement model gateway with Llama model integration and /generate API endpoint
6efef64

autumnssuns commited on