Text-to-Audio
Diffusers
Safetensors
English
Chinese
MossSoundEffectPipeline
diffusion
flow-matching
sound-effects
audio-generation
Instructions to use OpenMOSS-Team/MOSS-SoundEffect-v2.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use OpenMOSS-Team/MOSS-SoundEffect-v2.0 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("OpenMOSS-Team/MOSS-SoundEffect-v2.0", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
File size: 504 Bytes
4338367 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 | {
"_class_name": "MossSoundEffectPipeline",
"_diffusers_version": "0.32.0",
"transformer": [
"WanAudioModel",
"transformer"
],
"vae": [
"DAC",
"vae"
],
"text_encoder": [
"Qwen3TextEncoder",
"text_encoder"
],
"tokenizer": [
"AutoTokenizer",
"tokenizer"
],
"scheduler": [
"FlowMatchScheduler",
"scheduler"
],
"dit_variant": "1.3B",
"sample_rate": 48000,
"max_inference_seconds": 30,
"vae_type": "dac",
"text_encoder_type": "qwen3"
} |