ghostai1
/

GHOSTSONAFB

English

python

Model card Files Files and versions

xet

Community

ghostai1 commited on Oct 10, 2025

Commit

5ec93f2

verified ·

1 Parent(s): 5ca3b53

Update README.md

Browse files

Files changed (1) hide show

README.md +81 -231

README.md CHANGED Viewed

@@ -1,4 +1,12 @@
 ---
 license: mit
 language:
 - en
@@ -7,117 +15,84 @@ tags:
 - ai
 ---
-* FULL API build [beta build] Optimizied to handle full 30/60/180 second renders
-https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/stable12gblg30sec.py
-<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/RifW0nT3T-Y5Q3kawuHu2.mpga"></audio>
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/-P3uQK1P_qP9F1GjzhgCm.png)
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/8oGZ0g0ZmKuYDp2tbwkf4.png)
-🚀 Updated Repo Alert! MUSIC GEN LARGE FULL API [BETA]🚀
-PYTHON/JS/BASH/CURL
-no MCP AGENTIC YET CLIENT APP UTILIZSES AGENTIC MCP
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/60dwmLyHVadwom8v3DEzY.png)
 https://huggingface.co/facebook/musicgen-large
-Massive SM80 build optimized to the max 🛠️ for CUDA 12.1 & cuDNN 9! 🎉 No dependencies, just the raw file update dropped in the repo! 📂
-🚫 No MCP AGENTIC RAG AI API for this one—built for 3000 series GPUs with 12GB+ VRAM only. Don’t try 40xx/50xx, it’s a no-go! 😿
-🎵 New SM80 build crafted for large music gen—grab it from the repo! 🔗
-🐍 Python 3.10 is the vibe, 3.9 works but might be buggy 🐛
-🔥 Get the update here: huggingface.co/ghostai1/GHOSTSONAFB
-⏭️ Next update: Higher link threading, supports up to 8 GTS, no Gen 4 yet. 50xx support? Maybe later!
-UPDATE FOUND HERE * https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/STABLE12gb3060.py
-*API EXAMPLE CURL
-```curl -X POST https://533d9ec43354159938.gradio.live/call/set_red_hot_chili_peppers_prompt -s -H "Content-Type: application/json" -d '{
-  "data": [
-							60,
-							"none",
-							"none",
-							"none",
-							"none",
-							"none"
-]}' \
-  | awk -F'"' '{ print $4}'  \
-  | read EVENT_ID; curl -N https://533d9ec43354159938.gradio.live/call/set_red_hot_chili_peppers_prompt/$EVENT_ID
-gen music endpoint
-api_name: /generate_music_wrapper
-curl -X POST https://533d9ec43354159938.gradio.live/call/generate_music_wrapper -s -H "Content-Type: application/json" -d '{
-  "data": [
-							"Hello!!",
-							1,
-							10,
-							0,
-							0.1,
-							"30",
-							60,
-							"none",
-							"none",
-							"none",
-							"none",
-							"none",
-							-30,
-							"default",
-							"1000",
-							"Hello!!"
-]}' \
-  | awk -F'"' '{ print $4}'  \
-  | read EVENT_ID; curl -N https://533d9ec43354159938.gradio.live/call/generate_music_wrapper/$EVENT_ID
-```
-<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/eDd-f8QJiY4GeMJ8PG_22.mpga"></audio>
-https://huggingface.co/facebook/musicgen-medium
 <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/xdLE5yosDG_MtnzkyG4_L.mpga"></audio>
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/bVhLFORVf1p1A8VrXWZeB.png)
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/SbL9DMWRzEf47CqOJsq3i.png)
-# use huggingface-cli to sync
 # 🎵 GhostAI Music Generator 🎸 & VOCAL UPDATE* barks.py 1.5B Optimized to run on 8GB Will release a Large model 12-24 GB soon UPDATE* Stable float16/32 working on INT8
-https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/start_bash.sh
 # SH auto downloader dir etc get FB music perms from HF first
-FLOAT16/32 CUDA 11.8 & 12.1  4bit for lower end 8 bit full
 Welcome to the GhostAI Music Generator! This web-based tool utilizes Meta AI's `musicgen-medium` model to craft high-quality instrumental tracks across genres such as Rock, Techno, Jazz, Classical, and Hip-Hop. The application structures compositions with sections like intros, verses, and choruses, all accessible through an intuitive Gradio interface. Outputs are high-quality MP3 files at 320 kbps, complete with embedded metadata. To enhance audio quality, we've integrated processing features including equalization (EQ), a chorus effect, and peak limiting for a polished sound.
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/5PCpX_7Yuhs8S9BEDck_5.png)
-![image](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/R-UxaeGKbM_tK6B7lCGIE.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/LZkcrdpN5PQXOF4pj33bu.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/sIIjdL3it8MSw9w5XBz0q.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/HcBK7X9373CVYO5zyo4YL.png)
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/MoQb9arla6rXGepgFugNp.png)
 ## Project Evolution and Optimization
@@ -130,178 +105,53 @@ Audio enhancements include:
 - **Gain Adjustment**: +2 dB boost before crossfading to address amplitude dips.
 - **Compression**: Removed to preserve dynamic range.
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/b78antJwwWAx-jFfXoYHk.png)
-## System Requirements
-To get started, ensure your system meets the following requirements:
 - **Operating System**: Ubuntu (Note: Windows/macOS are untested).
 - **GPU**: CUDA-capable GPU with at least 8 GB VRAM.
 - **Python**: Version 3.10.
 - **ffmpeg**: Installed for audio processing.
-## Installation and Setup
 1. **Clone the Repository**:
    ```bash
-   git clone https://huggingface.co/your-username/ghostai-music-generator
    cd ghostai-music-generator
-   ```
-2. **Set Up a Virtual Environment**:
-   ```bash
    python3 -m venv venv
    source venv/bin/activate
-   ```
-3. **Install PyTorch**:
-   For CUDA 12.1:
-   ```bash
    pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121
-   ```
-   For other CUDA versions, refer to the [PyTorch installation guide](https://pytorch.org/get-started/locally/).
-4. **Install Other Dependencies**:
-   ```bash
    pip install -r requirements.txt
-   ```
-5. **Install ffmpeg**:
-   ```bash
    sudo apt-get install ffmpeg
-   ```
-6. **Authenticate with Hugging Face**:
-   ```bash
    huggingface-cli login
-   ```
-   Retrieve your token from [Hugging Face Tokens](https://huggingface.co/settings/tokens).
-7. **Request Access to the Model**:
-   Visit [facebook/musicgen-medium](https://huggingface.co/facebook/musicgen-medium) and request access.
-8. **Download and Place Model Weights**:
-   ```bash
    mkdir -p /home/ubuntu/ghostai_music_generator/models/musicgen-medium
-   ```
-   Place the model weights in the directory above. If you store the model elsewhere, update the `local_model_path` in `app.py` accordingly.
-## Running the Application
-Start the application by executing:
-```bash
-python app.py
-```
-This will launch a Gradio UI at `http://0.0.0.0:9999`. Open this URL in your browser to access the interface.
-## Using the Interface
-Within the Gradio interface:
-- **Select a Genre**: Choose from Rock, Techno, Jazz, Classical, or Hip-Hop.
-- **Custom Prompt**: Enter a custom prompt, such as:
-  ```
-  Hard rock with a dynamic intro, expressive verse, and powerful chorus, featuring electric guitars, steady heavy drums, and deep bass.
-  ```
-- **Adjust Parameters**:
-  - **Guidance Scale (CFG)**: Default is 3.0.
-  - **Top-K Sampling**: Default is 300.
-  - **Top-P Sampling**: Default is 0.95.
-  - **Temperature**: Default is 1.0.
-  - **Total Duration**: Set to 30 seconds (range: 10-60).
-  - **Crossfade Duration**: Set to 500 ms (range: 100-2000).
-- **Generate Music**: Click "Generate Music" to create the track. The output will be saved as `output_cleaned.mp3` and played within Gradio.
-Monitor the terminal output for VRAM and GPU memory usage to ensure smooth operation.
-## Troubleshooting and Customization
-- **Quiet Spots in Waveform**: Edit `app.py` to increase gain before crossfading:
-  ```python
-  next_segment = next_segment + 3
-  ```
-  Use tools like Audacity to inspect and adjust the waveform.
-- **Enhancing the Chorus**: Modify the second chunk prompt to:
-  ```
-  explosive chorus with soaring guitars and pounding drums
-  ```
-  Or increase the temperature to 1.2 and `top_k` to 350 in the UI.
-- **Audio Distortion**: Reduce the chorus effect gain in `apply_chorus`:
-  ```python
-  delayed = segment - 6
-  ```
-  Adjust EQ settings in `apply_eq` with a high-pass at 80 Hz and low-pass at 5000 Hz.
-- **MP3 Export Issues**: Ensure `ffmpeg` is installed:
-  ```bash
-  sudo apt-get install ffmpeg
-  ```
-  Check the existence of `chunk_{i}.mp3` and `output_cleaned.mp3` files.
-- **VRAM Constraints**: Reduce the total duration to 20 seconds, close other GPU-intensive applications using `nvidia-smi`, and monitor usage with:
-  ```python
-  print(torch.cuda.memory_summary())
-  ```
-## Customization Options
-- **Lock Dependencies**:
-  ```bash
-  pip freeze > requirements.txt
-  ```
-- **Add New Genres**: In `app.py`, define a new genre prompt:
-  ```python
-  def set_pop_prompt():
-      return "Pop with a catchy intro, upbeat verse, and anthemic chorus, featuring bright synths, punchy drums, and groovy bass"
-  ```
-  Add a button for the new genre:
-  ```python
-  pop_btn = gr.Button("Pop", elem_classes="genre-btn")
-  pop_btn.click(set_pop_prompt, inputs=None, outputs=[instrumental_prompt])
-  ```
-- **Edit MP3 Files**: Use Audacity or similar tools for more control over the final output.
-- **Use a Smaller Model**: If VRAM is limited, switch to `musicgen-small` by updating `app.py`:
-  ```python
-  musicgen_model = MusicGen.get_pretrained('facebook/musicgen-small', device=device)
-  ```
-### Prerequisites
-- Ubuntu system with Python 3.10 installed.
-- NVIDIA RTX 3060 Ti GPU with CUDA support (CUDA 11.8 recommended).
-- Internet connection to download the `musicgen-medium` model.
-### Step 1: Make the Setup Script Executable
-The `start_bash.sh` script sets up the virtual environment, installs dependencies, and downloads the `musicgen-medium` model. First, make the script executable:
-```bash
-chmod +x start_bash.sh
-## License and Acknowledgments
-This project is licensed under the MIT License. Please include a LICENSE file with the MIT License text.
-Special thanks to:
-- Meta AI for `musicgen-medium` and Audiocraft.
-- Hugging Face for hosting and CLI tools.
-- Gradio for the web interface.
-- pydub for audio processing and MP3 export.
-- xAI for their support.
-Enjoy creating music! If you have questions or suggestions, feel free to open an issue on the repository. Let's make some tunes! 🎉
-CUDA 12 MEMORY MANAGEMENT UPDATE
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/fzyGz3Ondrr_snqH8yHiG.png)

 ---
+title: GhostAI Music Generator
+emoji: 🎵
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
 license: mit
 language:
 - en
 - ai
 ---
+<div align="center">
+# 🎵 GhostAI Music Generator 🎸
+[![Python](https://img.shields.io/badge/Python-3.10-blue.svg)](https://www.python.org/downloads/)
+[![MIT License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
+[![Hugging Face](https://img.shields.io/badge/Hugging%20Face-ghostai1%2FGHOSTSONAFB-yellow.svg)](https://huggingface.co/ghostai1/GHOSTSONAFB)
+[![CUDA](https://img.shields.io/badge/CUDA-12.1%20%7C%2011.8-brightgreen.svg)](https://developer.nvidia.com/cuda-downloads)
+**FULL API build [beta build] Optimized to handle full 30/60/180 second renders**
+Generate high-quality instrumental tracks with Meta AI's MusicGen models!
+</div>
+<div align="center">
+  <table>
+    <tr>
+      <td align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/-P3uQK1P_qP9F1GjzhgCm.png" width="200" height="200" alt="Interface 1" /></td>
+      <td align="center"><img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/8oGZ0g0ZmKuYDp2tbwkf4.png" width="200" height="200" alt="Interface 2" /></td>
+    </tr>
+    <tr>
+      <td align="center"><audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/RifW0nT3T-Y5Q3kawuHu2.mpga"></audio></td>
+      <td align="center"><audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/eDd-f8QJiY4GeMJ8PG_22.mpga"></audio></td>
+    </tr>
+  </table>
+</div>
+🚀 **Updated Repo Alert! MUSIC GEN LARGE FULL API [BETA]🚀**
+PYTHON/JS/BASH/CURL • No MCP AGENTIC YET, CLIENT APP UTILIZES AGENTIC MCP
+![Interface 3](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/60dwmLyHVadwom8v3DEzY.png)
 https://huggingface.co/facebook/musicgen-large
+**Massive SM80 build optimized for CUDA 12.1 & cuDNN 9!** 🛠️ 🎉 No dependencies, raw file update dropped in repo! 📂
+🚫 **No MCP AGENTIC RAG AI API**—built for 3000 series GPUs with 12GB+ VRAM only. Don’t try 40xx/50xx, it’s a no-go! 😿
+🎵 **New SM80 build crafted for large music gen**—grab it from the repo! 🔗
+🐍 **Python 3.10 is the vibe**, 3.9 works but might be buggy 🐛
+🔥 **Get the update here**: https://huggingface.co/ghostai1/GHOSTSONAFB
+⏭️ **Next update**: Higher link threading, supports up to 8 GTS, no Gen 4 yet. 50xx support? Maybe later!
+**UPDATE FOUND HERE**: https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/STABLE12gb3060.py
+**Scripts**: https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/stable12gblg30sec.py
 <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/xdLE5yosDG_MtnzkyG4_L.mpga"></audio>
+https://huggingface.co/facebook/musicgen-medium
+![Waveform](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/bVhLFORVf1p1A8VrXWZeB.png)
+![Settings](https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/SbL9DMWRzEf47CqOJsq3i.png)
+# Use huggingface-cli to sync
 # 🎵 GhostAI Music Generator 🎸 & VOCAL UPDATE* barks.py 1.5B Optimized to run on 8GB Will release a Large model 12-24 GB soon UPDATE* Stable float16/32 working on INT8
+https://huggingface.co/ghostai1/GHOSTSONAFB/blob/main/start_bash.sh
 # SH auto downloader dir etc get FB music perms from HF first
+**FLOAT16/32 CUDA 11.8 & 12.1** 4bit for lower end 8 bit full
 Welcome to the GhostAI Music Generator! This web-based tool utilizes Meta AI's `musicgen-medium` model to craft high-quality instrumental tracks across genres such as Rock, Techno, Jazz, Classical, and Hip-Hop. The application structures compositions with sections like intros, verses, and choruses, all accessible through an intuitive Gradio interface. Outputs are high-quality MP3 files at 320 kbps, complete with embedded metadata. To enhance audio quality, we've integrated processing features including equalization (EQ), a chorus effect, and peak limiting for a polished sound.
+<div align="center">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/5PCpX_7Yuhs8S9BEDck_5.png" width="45%" alt="UI Preview">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/R-UxaeGKbM_tK6B7lCGIE.png" width="45%" alt="Output">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/LZkcrdpN5PQXOF4pj33bu.png" width="45%" alt="Controls">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/sIIjdL3it8MSw9w5XBz0q.png" width="45%" alt="Processing">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/HcBK7X9373CVYO5zyo4YL.png" width="45%" alt="Results">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/MoQb9arla6rXGepgFugNp.png" width="45%" alt="Analytics">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/b78antJwwWAx-jFfXoYHk.png" width="45%" alt="Performance">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6421b1c68adc8881b974a89d/fzyGz3Ondrr_snqH8yHiG.png" width="45%" alt="CUDA Update">
+</div>
 ## Project Evolution and Optimization
 - **Gain Adjustment**: +2 dB boost before crossfading to address amplitude dips.
 - **Compression**: Removed to preserve dynamic range.
+## 🖥️ System Requirements
 - **Operating System**: Ubuntu (Note: Windows/macOS are untested).
 - **GPU**: CUDA-capable GPU with at least 8 GB VRAM.
 - **Python**: Version 3.10.
 - **ffmpeg**: Installed for audio processing.
+## ⚙️ Installation and Setup
 1. **Clone the Repository**:
    ```bash
+   git clone https://huggingface.co/ghostai1/ghostai-music-generator
    cd ghostai-music-generator
+## ⚙️ Installation and Setup
+1. Clone the Repository:
+   git clone https://huggingface.co/ghostai1/ghostai-music-generator
+   cd ghostai-music-generator
+2. Set Up a Virtual Environment:
    python3 -m venv venv
    source venv/bin/activate
+3. Install PyTorch (CUDA 12.1):
    pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121
+   For other CUDA versions, refer to https://pytorch.org/get-started/locally/.
+4. Install Other Dependencies:
    pip install -r requirements.txt
+5. Install ffmpeg:
    sudo apt-get install ffmpeg
+6. Authenticate with Hugging Face:
    huggingface-cli login
+   Retrieve token from https://huggingface.co/settings/tokens
+7. Request Access to the Model:
+   Visit https://huggingface.co/facebook/musicgen-medium and request access.
+8. Download and Place Model Weights:
    mkdir -p /home/ubuntu/ghostai_music_generator/models/musicgen-medium
+   Place the model weights in the directory above. Update local_model_path in app.py if stored elsewhere.
+9. Run Setup Script:
+   chmod +x start_bash.sh
+   ./start_bash.sh