File size: 2,655 Bytes
b68ff8b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
---
license: other
license_name: stability-ai-community
license_link: LICENSE
base_model:
  - stabilityai/stable-audio-open-1.0
tags:
  - music
  - audio
  - text-to-audio
  - music-generation
  - loop
  - sample
  - stable-audio
  - safetensors
  - comfyui
pipeline_tag: text-to-audio
---

# Foundation-1 — Mirror

**BPM/Key-Aware Music Sample Generator**

[Original Model](https://huggingface.co/RoyalCities/Foundation-1) by [RoyalCities](https://huggingface.co/RoyalCities) · Fine-tuned on [stable-audio-open-1.0](https://huggingface.co/stabilityai/stable-audio-open-1.0)

> This is an **ungated mirror** of the Foundation-1 model weights for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA). All credits go to the original authors.

## What's in This Repo

| File | Description | Size |
|------|-------------|------|
| `Foundation_1.safetensors` | FP16 model checkpoint | ~3 GB |
| `model_config.json` | Model architecture config | ~1 KB |

## What Foundation-1 Does

Foundation-1 generates production-ready musical loops with fine-grained control over:

- **Musical structure**: BPM, bars, time signatures, key/mode
- **Instrument identity**: 30+ instrument families (synth, strings, brass, guitar, etc.)
- **Timbral control**: 100+ timbre tags (warm, bright, gritty, glassy, etc.)
- **FX prompting**: Reverb, delay, chorus, distortion, and more
- **Loop fidelity**: Seamless, tempo-synced loops designed for layering

## Usage with ComfyUI-FFMPEGA

These weights are auto-downloaded by the **FFMPEGA Agent** node:

1. Set `llm_model` to `none`
2. Set `no_llm_mode` to `generate_sample`
3. Enter a prompt like: *"Synth, Pad, Warm, Wide, Lush, 120 BPM, 4 Bars, C major"*

The model (~3 GB) downloads on first use to `ComfyUI/models/foundation1/`.

## Hardware Requirements

- **VRAM**: ~7 GB during generation
- **Generation speed**: ~7–8 seconds per sample (RTX 3090)

## Prompt Structure

Foundation-1 uses a layered prompt system:

```
[Instrument Family], [Sub-Family], [Timbre Tags], [FX Tags], [BPM], [Bars], [Key]
```

Example: `"Synth, Lead, Bright, Sharp, Saw, Detune, Delay Ping Pong, 140 BPM, 8 Bars, A minor"`

## License

These model weights are released under the [Stability AI Community License](./LICENSE).

- ✅ Free for non-commercial use
- ✅ Free for commercial use by entities with annual revenue < $1M USD
- ❌ Entities with annual revenue ≥ $1M need an Enterprise license from Stability AI

## Acknowledgements

- [Foundation-1](https://huggingface.co/RoyalCities/Foundation-1) by RoyalCities
- [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI