FunCineForge: A Unified Dataset Pipeline and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes

FunCineForge dataset pipeline is a fully open-source, locally deployed tool for producing multimodal speech datasets. It integrates batches of film or television data from the source into comprehensive data including text, speech, video, clues, timestamps, and other information for training our VTTS dubbing LLM. All pre-trained models have been uploaded to Hugging Face. You can access https://funcineforge.github.io/ to get our dataset samples and demo samples.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support