Spaces:

OnyxMunk
/

Stable-Audio-Open

Runtime error

App Files Files Community

OnyxlMunkey commited on Dec 20, 2025

Commit

99bda05

1 Parent(s): c30d48f

Update requirements: Add missing dependencies for audio processing and improve documentation

Browse files

Files changed (1) hide show

GEMINI.md +65 -0

GEMINI.md ADDED Viewed

	@@ -0,0 +1,65 @@

+# Stable Audio Open
+## Project Overview
+**Stable Audio Open** is a Python-based web application that leverages generative AI to create audio from text prompts. It utilizes the Stable Audio technology (via the `diffusers` library) to synthesize high-quality sound effects, music, and ambient noise. The user interface is built with **Gradio**, providing an interactive and accessible way to generate and listen to audio.
+**Key Technologies:**
+*   **Python:** Core programming language.
+*   **Gradio:** Web interface framework for machine learning demos.
+*   **PyTorch & Diffusers:** Libraries for loading and running the Stable Audio Open model.
+*   **Hugging Face Hub:** Source for the pre-trained models.
+## Building and Running
+### Prerequisites
+*   Python 3.8+
+*   CUDA-capable GPU recommended (for faster generation), but runs on CPU (slower).
+### Installation
+1.  **Clone the repository:**
+    ```bash
+    git clone <repository_url>
+    cd Stable-Audio-Open
+    ```
+2.  **Install dependencies:**
+    It is recommended to use a virtual environment.
+    ```bash
+    # Create virtual environment (optional but recommended)
+    python -m venv env
+    # Windows:
+    .\env\Scripts\activate
+    # Linux/Mac:
+    source env/bin/activate
+    # Install packages
+    pip install -r requirements.txt
+    ```
+### Running the Application
+To start the Gradio web interface:
+```bash
+python app.py
+```
+After running the command, the application will typically be accessible at `http://127.0.0.1:7860` in your web browser.
+## Development Conventions
+*   **Entry Point:** `app.py` is the main script. It handles model loading, audio generation logic, and UI construction.
+*   **Model Caching:** The application implements a simple global caching mechanism (`model_cache`) to avoid reloading the heavy model on every request.
+*   **Error Handling:** The `generate_audio` function includes fallback mechanisms. If the model fails to load or generate, it synthesizes a simple sine wave to ensure the UI remains responsive and provides feedback.
+*   **Configuration:** Key parameters like model ID (`stabilityai/stable-audio-open-small`) are currently hardcoded in `app.py`.
+*   **Dependencies:** Managed via `requirements.txt`.
+## Directory Structure
+*   `app.py`: Main application source code.
+*   `requirements.txt`: List of Python packages required.
+*   `README.md`: General project documentation.
+*   `.gitattributes`: Git configuration for file handling.