Use dynamic GPU duration based on text length 17abbee Running Harry Coultas Blum commited on 7 days ago
Reduce GPU duration for generate function from 120 to 10 seconds 641f3e1 Harry Coultas Blum commited on 7 days ago
Simplify to non-streaming render for ZeroGPU compatibility 270e056 Harry Coultas Blum commited on 28 days ago
Wrap streaming inference in @spaces.GPU, yield chunks after 70d0e43 Harry Coultas Blum commited on 28 days ago
Revert to float32 model loading - codec needs float32, dtype casts in model.py handle ZeroGPU SDPA b425248 Harry Coultas Blum commited on 28 days ago
Add streaming audio generation with Web Audio player fee1df4 Harry Coultas Blum commited on 28 days ago