CocoBro
/

Foley-Omni

Model card Files Files and versions

CocoBro commited on 2 days ago

Commit

87b4b59

·

verified ·

1 Parent(s): 5d3a675

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -6,7 +6,6 @@ pipeline_tag: text-to-audio
 # Foley-Omni
-**Foley-Omni: A Unified Multimodal Generation Model from Task-Level Audio Synthesis to Complete Video Soundtrack Generation**
 [GitHub Code](https://github.com/NJU-Speech/Foley-Omni) | [arXiv](https://arxiv.org/abs/2606.03672) | [Demo](https://ty0402.github.io/Foley-omni-Web/)
@@ -15,6 +14,8 @@ pipeline_tag: text-to-audio
 This repository packages the public inference checkpoint set for **Foley-Omni**.
 The release focuses on **Video-to-Soundtrack (V2ST)** generation, where the model jointly generates synchronized **speech**, **sound effects**, and **music** from a video and optional text prompt.
 ## Repository Contents

 # Foley-Omni
 [GitHub Code](https://github.com/NJU-Speech/Foley-Omni) | [arXiv](https://arxiv.org/abs/2606.03672) | [Demo](https://ty0402.github.io/Foley-omni-Web/)
 This repository packages the public inference checkpoint set for **Foley-Omni**.
 The release focuses on **Video-to-Soundtrack (V2ST)** generation, where the model jointly generates synchronized **speech**, **sound effects**, and **music** from a video and optional text prompt.
+## Model Size
+5.5B
 ## Repository Contents