AEmotionStudio
/

foundation1-models

music-generation

Model card Files Files and versions

foundation1-models / README.md

AEmotionStudio's picture

Upload README.md with huggingface_hub

b68ff8b verified 10 days ago

|

history blame contribute delete

2.66 kB

	---
	license: other
	license_name: stability-ai-community
	license_link: LICENSE
	base_model:
	- stabilityai/stable-audio-open-1.0
	tags:
	- music
	- audio
	- text-to-audio
	- music-generation
	- loop
	- sample
	- stable-audio
	- safetensors
	- comfyui
	pipeline_tag: text-to-audio
	---

	# Foundation-1 — Mirror

	BPM/Key-Aware Music Sample Generator

	[Original Model](https://huggingface.co/RoyalCities/Foundation-1) by [RoyalCities](https://huggingface.co/RoyalCities) · Fine-tuned on [stable-audio-open-1.0](https://huggingface.co/stabilityai/stable-audio-open-1.0)

	> This is an ungated mirror of the Foundation-1 model weights for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA). All credits go to the original authors.

	## What's in This Repo

	\| File \| Description \| Size \|
	\|------\|-------------\|------\|
	\| `Foundation_1.safetensors` \| FP16 model checkpoint \| ~3 GB \|
	\| `model_config.json` \| Model architecture config \| ~1 KB \|

	## What Foundation-1 Does

	Foundation-1 generates production-ready musical loops with fine-grained control over:

	- Musical structure: BPM, bars, time signatures, key/mode
	- Instrument identity: 30+ instrument families (synth, strings, brass, guitar, etc.)
	- Timbral control: 100+ timbre tags (warm, bright, gritty, glassy, etc.)
	- FX prompting: Reverb, delay, chorus, distortion, and more
	- Loop fidelity: Seamless, tempo-synced loops designed for layering

	## Usage with ComfyUI-FFMPEGA

	These weights are auto-downloaded by the FFMPEGA Agent node:

	1. Set `llm_model` to `none`
	2. Set `no_llm_mode` to `generate_sample`
	3. Enter a prompt like: "Synth, Pad, Warm, Wide, Lush, 120 BPM, 4 Bars, C major"

	The model (~3 GB) downloads on first use to `ComfyUI/models/foundation1/`.

	## Hardware Requirements

	- VRAM: ~7 GB during generation
	- Generation speed: ~7–8 seconds per sample (RTX 3090)

	## Prompt Structure

	Foundation-1 uses a layered prompt system:

	```
	[Instrument Family], [Sub-Family], [Timbre Tags], [FX Tags], [BPM], [Bars], [Key]
	```

	Example: `"Synth, Lead, Bright, Sharp, Saw, Detune, Delay Ping Pong, 140 BPM, 8 Bars, A minor"`

	## License

	These model weights are released under the [Stability AI Community License](./LICENSE).

	- ✅ Free for non-commercial use
	- ✅ Free for commercial use by entities with annual revenue < $1M USD
	- ❌ Entities with annual revenue ≥ $1M need an Enterprise license from Stability AI

	## Acknowledgements

	- [Foundation-1](https://huggingface.co/RoyalCities/Foundation-1) by RoyalCities
	- [Stable Audio Open](https://huggingface.co/stabilityai/stable-audio-open-1.0) by Stability AI