# AICoverGen Enhanced 

**AI-Powered Voice Cover Generation with Advanced Audio Enhancement**

AICoverGen Enhanced is a powerful tool for creating AI voice covers with professional-grade audio enhancement features. This enhanced version includes advanced AI audio processing, EQ controls, dynamic range compression, and much more!

##  New Features

###  Advanced Audio Enhancement
- **AI Noise Reduction** - Remove background noise and artifacts
- **Professional EQ** - 5 EQ types: Balanced, Vocal Boost, Bass Boost, Treble Boost, Flat
- **Dynamic Range Compression** - Improve loudness and consistency
- **Harmonic Enhancement** - Add richness and warmth to vocals
- **Stereo Widening** - Enhance spatial imaging for stereo tracks
- **Reverb Control** - Add depth and professional polish
- **Gain Control** - Fine-tune volume (-20 to +20 dB)

###  Enhancement Types
- **Full** - Balanced enhancement with all features
- **Light** - Subtle improvements for natural sound
- **Aggressive** - Maximum enhancement for impact
- **Custom** - Use your specific settings

##  System Requirements

### Minimum Requirements
- **OS**: Windows 10/11, Linux, or macOS
- **Python**: 3.9+ (3.10+ recommended)
- **RAM**: 8GB minimum, 16GB recommended
- **Storage**: 10GB free space
- **GPU**: NVIDIA GPU with CUDA support (recommended)

### Recommended Setup
- **OS**: Windows 11 or Ubuntu 20.04+
- **Python**: 3.10 or 3.11
- **RAM**: 16GB or more
- **GPU**: NVIDIA RTX 3060 or better
- **CUDA**: 11.8 or 12.0+
- **cuDNN**: 8.6 or 9.0+

##  Installation Guide

### Step 1: Clone the Repository
`ash
git clone https://github.com/SociallyIneptWeeb/AICoverGen.git
cd AICoverGen
`

### Step 2: Create Virtual Environment
`ash
# Windows
python -m venv AICoverGen
AICoverGen\Scripts\activate

# Linux/macOS
python3 -m venv AICoverGen
source AICoverGen/bin/activate
`

### Step 3: Install Dependencies

#### Option A: Automatic Installation (Recommended)
`ash
pip install -r requirements.txt
`

#### Option B: Manual Installation
`ash
# Core dependencies
pip install gradio==3.50.2
pip install librosa==0.9.1
pip install numpy==1.23.5
pip install scipy==1.11.1
pip install soundfile==0.12.1
pip install pedalboard==0.7.7
pip install pydub==0.25.1
pip install fairseq==0.12.2
pip install faiss-cpu==1.7.3
pip install pyworld==0.3.4
pip install praat-parselmouth>=0.4.2
pip install ffmpeg-python>=0.2.0
pip install tqdm==4.65.0
pip install yt-dlp>=2025.9.23
pip install sox==1.4.1

# AI Audio Enhancement dependencies
pip install noisereduce==3.0.3
pip install scikit-learn==1.6.1

# PyTorch with CUDA support
pip install torch==2.0.1+cu118 --find-links https://download.pytorch.org/whl/torch_stable.html
pip install torchcrepe==0.0.20

# ONNX Runtime with CUDA support
pip install onnxruntime-gpu==1.18.0
`

### Step 4: Download Models
`ash
python src/download_models.py
`

### Step 5: Verify Installation
`ash
python src/audio_enhancer.py
`

##  Usage

### Quick Start
1. **Start the Web UI**:
   `ash
   python src/webui.py
   `

2. **Open your browser** and go to http://127.0.0.1:7860

3. **Upload a song** (YouTube URL or audio file)

4. **Select a voice model** from the dropdown

5. **Configure audio enhancement**:
   - Expand "AI Audio Enhancement" section
   - Choose enhancement type (Full/Light/Aggressive/Custom)
   - Adjust EQ type (Balanced/Vocal Boost/Bass Boost/Treble Boost/Flat)
   - Set noise reduction strength (0-100%)
   - Adjust gain (-20 to +20 dB)
   - Set compression ratio (1-10)
   - Add reverb amount (0-100%)

6. **Click Generate** and enjoy your enhanced AI cover!

##  Troubleshooting

### Common Issues

#### CUDA Not Detected
`ash
# Check CUDA installation
nvidia-smi

# Verify PyTorch CUDA support
python -c "import torch; print(torch.cuda.is_available())"

# Check ONNX Runtime CUDA
python -c "import onnxruntime as ort; print('CUDA' in ort.get_available_providers())"
`

#### Audio Enhancement Errors
`ash
# Test audio enhancer
python src/audio_enhancer.py

# Check dependencies
pip list | grep -E "(noisereduce|scikit-learn|pedalboard)"
`

#### Memory Issues
- Reduce batch size in settings
- Use CPU-only mode for ONNX Runtime
- Close other applications to free RAM

##  Project Structure

`
AICoverGen_Enhanced/
 src/
    webui.py              # Main web interface
    main.py               # Core pipeline with audio enhancement
    audio_enhancer.py     # AI audio enhancement module
    rvc.py                # RVC voice conversion
    mdx.py                # Audio separation
    ...
 rvc_models/               # Voice models
 mdxnet_models/            # Audio separation models
 song_output/              # Generated covers
 requirements.txt          # Dependencies
 README_Enhanced.md        # This file
`

##  Audio Enhancement Features

### AI Noise Reduction
- Uses machine learning to identify and remove background noise
- Preserves vocal clarity while eliminating artifacts
- Adjustable strength from 0-100%

### Professional EQ
- **Balanced**: Gentle mid boost for overall clarity
- **Vocal Boost**: Emphasizes 800-3000 Hz range for vocals
- **Bass Boost**: Enhances 60-250 Hz for low-end presence
- **Treble Boost**: Brightens 4-16 kHz for crispness
- **Flat**: Minimal processing with high-pass filter

### Dynamic Range Compression
- Improves loudness consistency
- Reduces dynamic range for better streaming
- Configurable ratio from 1-10

### Harmonic Enhancement
- Adds warmth and richness to vocals
- Uses soft saturation for natural harmonics
- Enhances perceived quality

### Stereo Widening
- Improves spatial imaging for stereo tracks
- Enhances left-right separation
- Creates more immersive listening experience

### Reverb Control
- Adds subtle depth and space
- Professional room simulation
- Configurable wet/dry mix

##  Contributing

We welcome contributions! Please see our [Contributing Guidelines](CONTRIBUTING.md) for details.

### Development Setup
`ash
git clone https://github.com/SociallyIneptWeeb/AICoverGen.git
cd AICoverGen
pip install -r requirements.txt
pip install -r requirements-dev.txt  # If available
`

##  License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

##  Acknowledgments

- Original AICoverGen by [SociallyIneptWeeb](https://github.com/SociallyIneptWeeb)
- RVC (Retrieval-based Voice Conversion) framework
- MDXNet for audio separation
- All the amazing open-source audio processing libraries

##  Support

- **Issues**: [GitHub Issues](https://github.com/SociallyIneptWeeb/AICoverGen/issues)
- **Discussions**: [GitHub Discussions](https://github.com/SociallyIneptWeeb/AICoverGen/discussions)
- **Documentation**: [Wiki](https://github.com/SociallyIneptWeeb/AICoverGen/wiki)

---

**Happy Creating! **

*Create amazing AI voice covers with professional audio enhancement!*