Spaces:

Adam3
/

Aicover

Build error

App Files Files Community

Aicover / README_Enhanced.md

Adam3

Upload 10 files

9d908c8 verified about 2 months ago

preview code

raw

history blame contribute delete

6.82 kB

	# AICoverGen Enhanced

	AI-Powered Voice Cover Generation with Advanced Audio Enhancement

	AICoverGen Enhanced is a powerful tool for creating AI voice covers with professional-grade audio enhancement features. This enhanced version includes advanced AI audio processing, EQ controls, dynamic range compression, and much more!

	## New Features

	### Advanced Audio Enhancement
	- AI Noise Reduction - Remove background noise and artifacts
	- Professional EQ - 5 EQ types: Balanced, Vocal Boost, Bass Boost, Treble Boost, Flat
	- Dynamic Range Compression - Improve loudness and consistency
	- Harmonic Enhancement - Add richness and warmth to vocals
	- Stereo Widening - Enhance spatial imaging for stereo tracks
	- Reverb Control - Add depth and professional polish
	- Gain Control - Fine-tune volume (-20 to +20 dB)

	### Enhancement Types
	- Full - Balanced enhancement with all features
	- Light - Subtle improvements for natural sound
	- Aggressive - Maximum enhancement for impact
	- Custom - Use your specific settings

	## System Requirements

	### Minimum Requirements
	- OS: Windows 10/11, Linux, or macOS
	- Python: 3.9+ (3.10+ recommended)
	- RAM: 8GB minimum, 16GB recommended
	- Storage: 10GB free space
	- GPU: NVIDIA GPU with CUDA support (recommended)

	### Recommended Setup
	- OS: Windows 11 or Ubuntu 20.04+
	- Python: 3.10 or 3.11
	- RAM: 16GB or more
	- GPU: NVIDIA RTX 3060 or better
	- CUDA: 11.8 or 12.0+
	- cuDNN: 8.6 or 9.0+

	## Installation Guide

	### Step 1: Clone the Repository
	`ash
	git clone https://github.com/SociallyIneptWeeb/AICoverGen.git
	cd AICoverGen
	`

	### Step 2: Create Virtual Environment
	`ash
	# Windows
	python -m venv AICoverGen
	AICoverGen\Scripts\activate

	# Linux/macOS
	python3 -m venv AICoverGen
	source AICoverGen/bin/activate
	`

	### Step 3: Install Dependencies

	#### Option A: Automatic Installation (Recommended)
	`ash
	pip install -r requirements.txt
	`

	#### Option B: Manual Installation
	`ash
	# Core dependencies
	pip install gradio==3.50.2
	pip install librosa==0.9.1
	pip install numpy==1.23.5
	pip install scipy==1.11.1
	pip install soundfile==0.12.1
	pip install pedalboard==0.7.7
	pip install pydub==0.25.1
	pip install fairseq==0.12.2
	pip install faiss-cpu==1.7.3
	pip install pyworld==0.3.4
	pip install praat-parselmouth>=0.4.2
	pip install ffmpeg-python>=0.2.0
	pip install tqdm==4.65.0
	pip install yt-dlp>=2025.9.23
	pip install sox==1.4.1

	# AI Audio Enhancement dependencies
	pip install noisereduce==3.0.3
	pip install scikit-learn==1.6.1

	# PyTorch with CUDA support
	pip install torch==2.0.1+cu118 --find-links https://download.pytorch.org/whl/torch_stable.html
	pip install torchcrepe==0.0.20

	# ONNX Runtime with CUDA support
	pip install onnxruntime-gpu==1.18.0
	`

	### Step 4: Download Models
	`ash
	python src/download_models.py
	`

	### Step 5: Verify Installation
	`ash
	python src/audio_enhancer.py
	`

	## Usage

	### Quick Start
	1. Start the Web UI:
	`ash
	python src/webui.py
	`

	2. Open your browser and go to http://127.0.0.1:7860

	3. Upload a song (YouTube URL or audio file)

	4. Select a voice model from the dropdown

	5. Configure audio enhancement:
	- Expand "AI Audio Enhancement" section
	- Choose enhancement type (Full/Light/Aggressive/Custom)
	- Adjust EQ type (Balanced/Vocal Boost/Bass Boost/Treble Boost/Flat)
	- Set noise reduction strength (0-100%)
	- Adjust gain (-20 to +20 dB)
	- Set compression ratio (1-10)
	- Add reverb amount (0-100%)

	6. Click Generate and enjoy your enhanced AI cover!

	## Troubleshooting

	### Common Issues

	#### CUDA Not Detected
	`ash
	# Check CUDA installation
	nvidia-smi

	# Verify PyTorch CUDA support
	python -c "import torch; print(torch.cuda.is_available())"

	# Check ONNX Runtime CUDA
	python -c "import onnxruntime as ort; print('CUDA' in ort.get_available_providers())"
	`

	#### Audio Enhancement Errors
	`ash
	# Test audio enhancer
	python src/audio_enhancer.py

	# Check dependencies
	pip list \| grep -E "(noisereduce\|scikit-learn\|pedalboard)"
	`

	#### Memory Issues
	- Reduce batch size in settings
	- Use CPU-only mode for ONNX Runtime
	- Close other applications to free RAM

	## Project Structure

	`
	AICoverGen_Enhanced/
	src/
	webui.py # Main web interface
	main.py # Core pipeline with audio enhancement
	audio_enhancer.py # AI audio enhancement module
	rvc.py # RVC voice conversion
	mdx.py # Audio separation
	...
	rvc_models/ # Voice models
	mdxnet_models/ # Audio separation models
	song_output/ # Generated covers
	requirements.txt # Dependencies
	README_Enhanced.md # This file
	`

	## Audio Enhancement Features

	### AI Noise Reduction
	- Uses machine learning to identify and remove background noise
	- Preserves vocal clarity while eliminating artifacts
	- Adjustable strength from 0-100%

	### Professional EQ
	- Balanced: Gentle mid boost for overall clarity
	- Vocal Boost: Emphasizes 800-3000 Hz range for vocals
	- Bass Boost: Enhances 60-250 Hz for low-end presence
	- Treble Boost: Brightens 4-16 kHz for crispness
	- Flat: Minimal processing with high-pass filter

	### Dynamic Range Compression
	- Improves loudness consistency
	- Reduces dynamic range for better streaming
	- Configurable ratio from 1-10

	### Harmonic Enhancement
	- Adds warmth and richness to vocals
	- Uses soft saturation for natural harmonics
	- Enhances perceived quality

	### Stereo Widening
	- Improves spatial imaging for stereo tracks
	- Enhances left-right separation
	- Creates more immersive listening experience

	### Reverb Control
	- Adds subtle depth and space
	- Professional room simulation
	- Configurable wet/dry mix

	## Contributing

	We welcome contributions! Please see our [Contributing Guidelines](CONTRIBUTING.md) for details.

	### Development Setup
	`ash
	git clone https://github.com/SociallyIneptWeeb/AICoverGen.git
	cd AICoverGen
	pip install -r requirements.txt
	pip install -r requirements-dev.txt # If available
	`

	## License

	This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

	## Acknowledgments

	- Original AICoverGen by [SociallyIneptWeeb](https://github.com/SociallyIneptWeeb)
	- RVC (Retrieval-based Voice Conversion) framework
	- MDXNet for audio separation
	- All the amazing open-source audio processing libraries

	## Support

	- Issues: [GitHub Issues](https://github.com/SociallyIneptWeeb/AICoverGen/issues)
	- Discussions: [GitHub Discussions](https://github.com/SociallyIneptWeeb/AICoverGen/discussions)
	- Documentation: [Wiki](https://github.com/SociallyIneptWeeb/AICoverGen/wiki)

	---

	Happy Creating!

	Create amazing AI voice covers with professional audio enhancement!