Text-to-Audio
Diffusers
English
text-video-to-audio
text-controlled-video-to-audio
audio-controlled-video-to-audio
audio-generation
Instructions to use YJX-Xiaomi/ControlFoley with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use YJX-Xiaomi/ControlFoley with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("YJX-Xiaomi/ControlFoley", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -83,7 +83,7 @@ If you find this project useful, please consider giving a star ⭐️~
|
|
| 83 |
- [2026-04] [Project page](https://yjx-research.github.io/ControlFoley_web_page/) is now live.
|
| 84 |
- [2026-04] [Inference code](https://github.com/xiaomi-research/controlfoley) and [pretrained models](https://huggingface.co/YJX-Xiaomi/ControlFoley) are released.
|
| 85 |
- [2026-04] Online demo is available on [Project Inference Interface](https://yjx-research.github.io/ControlFoley_web_page/#try-gen).
|
| 86 |
-
- [
|
| 87 |
|
| 88 |
<hr style="border: none; border-top: 3px solid #333; margin: 16px 0;">
|
| 89 |
|
|
@@ -93,7 +93,7 @@ If you find this project useful, please consider giving a star ⭐️~
|
|
| 93 |
- [x] Launch project page.
|
| 94 |
- [x] Release inference code and pretrained models.
|
| 95 |
- [x] Launch online inference demo (available on project page).
|
| 96 |
-
- [
|
| 97 |
|
| 98 |
<hr style="border: none; border-top: 3px solid #333; margin: 16px 0;">
|
| 99 |
|
|
|
|
| 83 |
- [2026-04] [Project page](https://yjx-research.github.io/ControlFoley_web_page/) is now live.
|
| 84 |
- [2026-04] [Inference code](https://github.com/xiaomi-research/controlfoley) and [pretrained models](https://huggingface.co/YJX-Xiaomi/ControlFoley) are released.
|
| 85 |
- [2026-04] Online demo is available on [Project Inference Interface](https://yjx-research.github.io/ControlFoley_web_page/#try-gen).
|
| 86 |
+
- [2026-04] Skill [ControlFoley Audio Generator](https://clawhub.ai/yjx-research/controlfoley-audio-generator) released.
|
| 87 |
|
| 88 |
<hr style="border: none; border-top: 3px solid #333; margin: 16px 0;">
|
| 89 |
|
|
|
|
| 93 |
- [x] Launch project page.
|
| 94 |
- [x] Release inference code and pretrained models.
|
| 95 |
- [x] Launch online inference demo (available on project page).
|
| 96 |
+
- [x] Release skill.
|
| 97 |
|
| 98 |
<hr style="border: none; border-top: 3px solid #333; margin: 16px 0;">
|
| 99 |
|