LinaCodec / README.md
humair025's picture
Update README.md
ca7a45e verified

A newer version of the Gradio SDK is available: 6.3.0

Upgrade
metadata
title: LinaCodec Audio Tool
emoji: 🎡
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

🎡 LinaCodec Audio Tool

A Gradio web interface for LinaCodec, a neural audio codec for high-quality speech compression and voice conversion.

Features

πŸ”„ Encode & Decode

  • Upload audio files and compress them into speech tokens
  • Decode tokens back to 48kHz audio
  • Demonstrates codec's compression and reconstruction quality

🎭 Voice Conversion

  • Transfer timbre/style from one speaker to another
  • Keep content from source audio
  • Apply voice characteristics from reference audio

How to Use

  1. Encode & Decode Tab:

    • Upload an audio file or record using your microphone
    • Click "Encode & Decode"
    • Listen to the reconstructed audio
  2. Voice Conversion Tab:

    • Upload source audio (the content you want)
    • Upload reference audio (the voice style you want)
    • Click "Convert Voice"
    • Listen to the converted result

Technical Details

  • Model: YatharthS/LinaCodec
  • Output Sample Rate: 48 kHz
  • Input Formats: WAV, MP3, and other common audio formats
  • Framework: PyTorch, Gradio

Model Information

LinaCodec is a neural audio codec designed for high-quality speech compression and voice conversion. It encodes audio into discrete speech tokens and a global embedding, enabling efficient storage and manipulation of speech signals.

Local Development

# Clone the repository
git clone  https://huggingface.co/spaces/humair025/LinaCodec/

# Install dependencies
pip install -r requirements.txt

# Run the app
python app.py

Citation

If you use LinaCodec in your research, please cite the original work.

License

MIT License