TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated Mar 24 • 71
Running on T4 Agents Featured 78 Trackers 🔥 78 Track objects in your video and get an annotated result
Running on Zero Agents Featured 1.02k Omni Video Factory 🏆 1.02k text to video, image to video, video extend
Running on Zero MCP Featured 96 GLM OCR Demo 📄 96 Multimodal OCR model for complex document understanding.