AEmotionStudio
/

foundation1-models

+---
+license: other
+license_name: stability-ai-community
+license_link: LICENSE
+base_model:
+  - stabilityai/stable-audio-open-1.0
+tags:
+  - music
+  - audio
+  - text-to-audio
+  - music-generation
+  - loop
+  - sample
+  - stable-audio
+  - safetensors
+  - comfyui
+pipeline_tag: text-to-audio
+---
+# Foundation-1 — Mirror
+**BPM/Key-Aware Music Sample Generator**
+[Original Model](https://huggingface.co/RoyalCities/Foundation-1) by [RoyalCities](https://huggingface.co/RoyalCities) · Fine-tuned on [stable-audio-open-1.0](https://huggingface.co/stabilityai/stable-audio-open-1.0)
+> This is an **ungated mirror** of the Foundation-1 model weights for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA). All credits go to the original authors.
+## What's in This Repo
+| File | Description | Size |
+|------|-------------|------|
+| `Foundation_1.safetensors` | FP16 model checkpoint | ~3 GB |
+| `model_config.json` | Model architecture config | ~1 KB |
+## What Foundation-1 Does
+Foundation-1 generates production-ready musical loops with fine-grained control over:
+- **Musical structure**: BPM, bars, time signatures, key/mode
+- **Instrument identity**: 30+ instrument families (synth, strings, brass, guitar, etc.)
+- **Timbral control**: 100+ timbre tags (warm, bright, gritty, glassy, etc.)
+- **FX prompting**: Reverb, delay, chorus, distortion, and more
+- **Loop fidelity**: Seamless, tempo-synced loops designed for layering
+## Usage with ComfyUI-FFMPEGA
+These weights are auto-downloaded by the **FFMPEGA Agent** node:
+1. Set `llm_model` to `none`
+2. Set `no_llm_mode` to `generate_sample`
+3. Enter a prompt like: *"Synth, Pad, Warm, Wide, Lush, 120 BPM, 4 Bars, C major"*
+The model (~3 GB) downloads on first use to `ComfyUI/models/foundation1/`.
+## Hardware Requirements
+- **VRAM**: ~7 GB during generation
+- **Generation speed**: ~7–8 seconds per sample (RTX 3090)
+## Prompt Structure
+Foundation-1 uses a layered prompt system:
+```
+[Instrument Family], [Sub-Family], [Timbre Tags], [FX Tags], [BPM], [Bars], [Key]
+```
+Example: `"Synth, Lead, Bright, Sharp, Saw, Detune, Delay Ping Pong, 140 BPM, 8 Bars, A minor"`
+## License
+These model weights are released under the [Stability AI Community License](./LICENSE).
+- ✅ Free for non-commercial use
+- ✅ Free for commercial use by entities with annual revenue < $1M USD
+- ❌ Entities with annual revenue ≥ $1M need an Enterprise license from Stability AI
+## Acknowledgements
+- [Foundation-1](https://huggingface.co/RoyalCities/Foundation-1) by RoyalCities
+- [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI