File size: 2,655 Bytes
b68ff8b | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 | ---
license: other
license_name: stability-ai-community
license_link: LICENSE
base_model:
- stabilityai/stable-audio-open-1.0
tags:
- music
- audio
- text-to-audio
- music-generation
- loop
- sample
- stable-audio
- safetensors
- comfyui
pipeline_tag: text-to-audio
---
# Foundation-1 — Mirror
**BPM/Key-Aware Music Sample Generator**
[Original Model](https://huggingface.co/RoyalCities/Foundation-1) by [RoyalCities](https://huggingface.co/RoyalCities) · Fine-tuned on [stable-audio-open-1.0](https://huggingface.co/stabilityai/stable-audio-open-1.0)
> This is an **ungated mirror** of the Foundation-1 model weights for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA). All credits go to the original authors.
## What's in This Repo
| File | Description | Size |
|------|-------------|------|
| `Foundation_1.safetensors` | FP16 model checkpoint | ~3 GB |
| `model_config.json` | Model architecture config | ~1 KB |
## What Foundation-1 Does
Foundation-1 generates production-ready musical loops with fine-grained control over:
- **Musical structure**: BPM, bars, time signatures, key/mode
- **Instrument identity**: 30+ instrument families (synth, strings, brass, guitar, etc.)
- **Timbral control**: 100+ timbre tags (warm, bright, gritty, glassy, etc.)
- **FX prompting**: Reverb, delay, chorus, distortion, and more
- **Loop fidelity**: Seamless, tempo-synced loops designed for layering
## Usage with ComfyUI-FFMPEGA
These weights are auto-downloaded by the **FFMPEGA Agent** node:
1. Set `llm_model` to `none`
2. Set `no_llm_mode` to `generate_sample`
3. Enter a prompt like: *"Synth, Pad, Warm, Wide, Lush, 120 BPM, 4 Bars, C major"*
The model (~3 GB) downloads on first use to `ComfyUI/models/foundation1/`.
## Hardware Requirements
- **VRAM**: ~7 GB during generation
- **Generation speed**: ~7–8 seconds per sample (RTX 3090)
## Prompt Structure
Foundation-1 uses a layered prompt system:
```
[Instrument Family], [Sub-Family], [Timbre Tags], [FX Tags], [BPM], [Bars], [Key]
```
Example: `"Synth, Lead, Bright, Sharp, Saw, Detune, Delay Ping Pong, 140 BPM, 8 Bars, A minor"`
## License
These model weights are released under the [Stability AI Community License](./LICENSE).
- ✅ Free for non-commercial use
- ✅ Free for commercial use by entities with annual revenue < $1M USD
- ❌ Entities with annual revenue ≥ $1M need an Enterprise license from Stability AI
## Acknowledgements
- [Foundation-1](https://huggingface.co/RoyalCities/Foundation-1) by RoyalCities
- [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI
|