A newer version of the Gradio SDK is available:
6.3.0
metadata
title: LinaCodec Audio Tool
emoji: π΅
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
π΅ LinaCodec Audio Tool
A Gradio web interface for LinaCodec, a neural audio codec for high-quality speech compression and voice conversion.
Features
π Encode & Decode
- Upload audio files and compress them into speech tokens
- Decode tokens back to 48kHz audio
- Demonstrates codec's compression and reconstruction quality
π Voice Conversion
- Transfer timbre/style from one speaker to another
- Keep content from source audio
- Apply voice characteristics from reference audio
How to Use
Encode & Decode Tab:
- Upload an audio file or record using your microphone
- Click "Encode & Decode"
- Listen to the reconstructed audio
Voice Conversion Tab:
- Upload source audio (the content you want)
- Upload reference audio (the voice style you want)
- Click "Convert Voice"
- Listen to the converted result
Technical Details
- Model: YatharthS/LinaCodec
- Output Sample Rate: 48 kHz
- Input Formats: WAV, MP3, and other common audio formats
- Framework: PyTorch, Gradio
Model Information
LinaCodec is a neural audio codec designed for high-quality speech compression and voice conversion. It encodes audio into discrete speech tokens and a global embedding, enabling efficient storage and manipulation of speech signals.
Local Development
# Clone the repository
git clone https://huggingface.co/spaces/humair025/LinaCodec/
# Install dependencies
pip install -r requirements.txt
# Run the app
python app.py
Citation
If you use LinaCodec in your research, please cite the original work.
License
MIT License