neuphonic/neutts-air
Text-to-Speech β’ 0.7B β’ Updated
β’ 11.9k β’ 855
State-of-the-art target speech extractor
Extreme Super-Resolution via Scale Autoregression
Explore gigascale 3D generation with Direct3DβS2
Watch and experiment with realtime AIβs with visuals
Voice Activity Detection using MarbleNet model
Filter multilingual data for high-quality language models
Transcribe speech and highlight emphasized words
Generate engaging educational videos