FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes

FunCineForge dataset toolkit is a fully open-source, locally deployed tool for producing multimodal speech datasets. It integrates batches of film or television data from the source into comprehensive data including text, speech, video, clues, timestamps, and other information for training our VTTS dubbing LLM. All pre-trained models have been uploaded to Hugging Face. You can access https://anonymous.4open.science/w/FunCineForge/ to get our dataset samples and demo samples.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support