Default provider to zerogpu_transformers on HF Spaces; drop bogus README env block e9ef2b5 verified unity4ar commited on 17 days ago
Load model on cuda at module level (canonical ZeroGPU pattern) a475083 verified unity4ar commited on 17 days ago
Refactor: Docker+llama.cpp -> Gradio SDK + ZeroGPU transformers backend 7036a02 verified unity4ar commited on 17 days ago