| # AICoverGen Enhanced | |
| **AI-Powered Voice Cover Generation with Advanced Audio Enhancement** | |
| AICoverGen Enhanced is a powerful tool for creating AI voice covers with professional-grade audio enhancement features. This enhanced version includes advanced AI audio processing, EQ controls, dynamic range compression, and much more! | |
| ## New Features | |
| ### Advanced Audio Enhancement | |
| - **AI Noise Reduction** - Remove background noise and artifacts | |
| - **Professional EQ** - 5 EQ types: Balanced, Vocal Boost, Bass Boost, Treble Boost, Flat | |
| - **Dynamic Range Compression** - Improve loudness and consistency | |
| - **Harmonic Enhancement** - Add richness and warmth to vocals | |
| - **Stereo Widening** - Enhance spatial imaging for stereo tracks | |
| - **Reverb Control** - Add depth and professional polish | |
| - **Gain Control** - Fine-tune volume (-20 to +20 dB) | |
| ### Enhancement Types | |
| - **Full** - Balanced enhancement with all features | |
| - **Light** - Subtle improvements for natural sound | |
| - **Aggressive** - Maximum enhancement for impact | |
| - **Custom** - Use your specific settings | |
| ## System Requirements | |
| ### Minimum Requirements | |
| - **OS**: Windows 10/11, Linux, or macOS | |
| - **Python**: 3.9+ (3.10+ recommended) | |
| - **RAM**: 8GB minimum, 16GB recommended | |
| - **Storage**: 10GB free space | |
| - **GPU**: NVIDIA GPU with CUDA support (recommended) | |
| ### Recommended Setup | |
| - **OS**: Windows 11 or Ubuntu 20.04+ | |
| - **Python**: 3.10 or 3.11 | |
| - **RAM**: 16GB or more | |
| - **GPU**: NVIDIA RTX 3060 or better | |
| - **CUDA**: 11.8 or 12.0+ | |
| - **cuDNN**: 8.6 or 9.0+ | |
| ## Installation Guide | |
| ### Step 1: Clone the Repository | |
| `ash | |
| git clone https://github.com/SociallyIneptWeeb/AICoverGen.git | |
| cd AICoverGen | |
| ` | |
| ### Step 2: Create Virtual Environment | |
| `ash | |
| # Windows | |
| python -m venv AICoverGen | |
| AICoverGen\Scripts\activate | |
| # Linux/macOS | |
| python3 -m venv AICoverGen | |
| source AICoverGen/bin/activate | |
| ` | |
| ### Step 3: Install Dependencies | |
| #### Option A: Automatic Installation (Recommended) | |
| `ash | |
| pip install -r requirements.txt | |
| ` | |
| #### Option B: Manual Installation | |
| `ash | |
| # Core dependencies | |
| pip install gradio==3.50.2 | |
| pip install librosa==0.9.1 | |
| pip install numpy==1.23.5 | |
| pip install scipy==1.11.1 | |
| pip install soundfile==0.12.1 | |
| pip install pedalboard==0.7.7 | |
| pip install pydub==0.25.1 | |
| pip install fairseq==0.12.2 | |
| pip install faiss-cpu==1.7.3 | |
| pip install pyworld==0.3.4 | |
| pip install praat-parselmouth>=0.4.2 | |
| pip install ffmpeg-python>=0.2.0 | |
| pip install tqdm==4.65.0 | |
| pip install yt-dlp>=2025.9.23 | |
| pip install sox==1.4.1 | |
| # AI Audio Enhancement dependencies | |
| pip install noisereduce==3.0.3 | |
| pip install scikit-learn==1.6.1 | |
| # PyTorch with CUDA support | |
| pip install torch==2.0.1+cu118 --find-links https://download.pytorch.org/whl/torch_stable.html | |
| pip install torchcrepe==0.0.20 | |
| # ONNX Runtime with CUDA support | |
| pip install onnxruntime-gpu==1.18.0 | |
| ` | |
| ### Step 4: Download Models | |
| `ash | |
| python src/download_models.py | |
| ` | |
| ### Step 5: Verify Installation | |
| `ash | |
| python src/audio_enhancer.py | |
| ` | |
| ## Usage | |
| ### Quick Start | |
| 1. **Start the Web UI**: | |
| `ash | |
| python src/webui.py | |
| ` | |
| 2. **Open your browser** and go to http://127.0.0.1:7860 | |
| 3. **Upload a song** (YouTube URL or audio file) | |
| 4. **Select a voice model** from the dropdown | |
| 5. **Configure audio enhancement**: | |
| - Expand "AI Audio Enhancement" section | |
| - Choose enhancement type (Full/Light/Aggressive/Custom) | |
| - Adjust EQ type (Balanced/Vocal Boost/Bass Boost/Treble Boost/Flat) | |
| - Set noise reduction strength (0-100%) | |
| - Adjust gain (-20 to +20 dB) | |
| - Set compression ratio (1-10) | |
| - Add reverb amount (0-100%) | |
| 6. **Click Generate** and enjoy your enhanced AI cover! | |
| ## Troubleshooting | |
| ### Common Issues | |
| #### CUDA Not Detected | |
| `ash | |
| # Check CUDA installation | |
| nvidia-smi | |
| # Verify PyTorch CUDA support | |
| python -c "import torch; print(torch.cuda.is_available())" | |
| # Check ONNX Runtime CUDA | |
| python -c "import onnxruntime as ort; print('CUDA' in ort.get_available_providers())" | |
| ` | |
| #### Audio Enhancement Errors | |
| `ash | |
| # Test audio enhancer | |
| python src/audio_enhancer.py | |
| # Check dependencies | |
| pip list | grep -E "(noisereduce|scikit-learn|pedalboard)" | |
| ` | |
| #### Memory Issues | |
| - Reduce batch size in settings | |
| - Use CPU-only mode for ONNX Runtime | |
| - Close other applications to free RAM | |
| ## Project Structure | |
| ` | |
| AICoverGen_Enhanced/ | |
| src/ | |
| webui.py # Main web interface | |
| main.py # Core pipeline with audio enhancement | |
| audio_enhancer.py # AI audio enhancement module | |
| rvc.py # RVC voice conversion | |
| mdx.py # Audio separation | |
| ... | |
| rvc_models/ # Voice models | |
| mdxnet_models/ # Audio separation models | |
| song_output/ # Generated covers | |
| requirements.txt # Dependencies | |
| README_Enhanced.md # This file | |
| ` | |
| ## Audio Enhancement Features | |
| ### AI Noise Reduction | |
| - Uses machine learning to identify and remove background noise | |
| - Preserves vocal clarity while eliminating artifacts | |
| - Adjustable strength from 0-100% | |
| ### Professional EQ | |
| - **Balanced**: Gentle mid boost for overall clarity | |
| - **Vocal Boost**: Emphasizes 800-3000 Hz range for vocals | |
| - **Bass Boost**: Enhances 60-250 Hz for low-end presence | |
| - **Treble Boost**: Brightens 4-16 kHz for crispness | |
| - **Flat**: Minimal processing with high-pass filter | |
| ### Dynamic Range Compression | |
| - Improves loudness consistency | |
| - Reduces dynamic range for better streaming | |
| - Configurable ratio from 1-10 | |
| ### Harmonic Enhancement | |
| - Adds warmth and richness to vocals | |
| - Uses soft saturation for natural harmonics | |
| - Enhances perceived quality | |
| ### Stereo Widening | |
| - Improves spatial imaging for stereo tracks | |
| - Enhances left-right separation | |
| - Creates more immersive listening experience | |
| ### Reverb Control | |
| - Adds subtle depth and space | |
| - Professional room simulation | |
| - Configurable wet/dry mix | |
| ## Contributing | |
| We welcome contributions! Please see our [Contributing Guidelines](CONTRIBUTING.md) for details. | |
| ### Development Setup | |
| `ash | |
| git clone https://github.com/SociallyIneptWeeb/AICoverGen.git | |
| cd AICoverGen | |
| pip install -r requirements.txt | |
| pip install -r requirements-dev.txt # If available | |
| ` | |
| ## License | |
| This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details. | |
| ## Acknowledgments | |
| - Original AICoverGen by [SociallyIneptWeeb](https://github.com/SociallyIneptWeeb) | |
| - RVC (Retrieval-based Voice Conversion) framework | |
| - MDXNet for audio separation | |
| - All the amazing open-source audio processing libraries | |
| ## Support | |
| - **Issues**: [GitHub Issues](https://github.com/SociallyIneptWeeb/AICoverGen/issues) | |
| - **Discussions**: [GitHub Discussions](https://github.com/SociallyIneptWeeb/AICoverGen/discussions) | |
| - **Documentation**: [Wiki](https://github.com/SociallyIneptWeeb/AICoverGen/wiki) | |
| --- | |
| **Happy Creating! ** | |
| *Create amazing AI voice covers with professional audio enhancement!* | |