Fix GPU detection by adding module-level GPU request and decorator on root endpoint ca562c6 WatNeru commited on Nov 21, 2025
Add token debugging and explicit login for gated model access 8620bbf WatNeru commited on Nov 21, 2025
Fix model download logic for PyTorch models and improve error handling dbb276c WatNeru commited on Nov 21, 2025
Switch from llama-cpp-python to transformers for PyTorch model support adb0f98 WatNeru commited on Nov 21, 2025
Fix ZeroGPU support, improve model loading and error handling df1c720 WatNeru commited on Nov 21, 2025