Overview
NOVA is a pair-free video editing model built on WAN 1.3B Fun InP. It uses sparse keyframe control (e.g., a single edited first frame) to guide dense video synthesis, trained without requiring paired before/after video data.
- Pair-free training via degradation simulation
- Sparse keyframe control: provide one or more edited keyframes
- Optional coarse mask for improved editing accuracy
Usage
See the GitHub repository for installation, inference, training, and the interactive Gradio demo.
Acknowledgements
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
