audiobook-ru-tts / README.md
danilahs's picture
Upload folder using huggingface_hub
4f6648e verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade
metadata
title: Russian Audiobook Studio
emoji: 🎧
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
short_description: Create Russian audiobooks from EPUB files with advanced TTS

🎧 Russian Audiobook Studio

Transform your Russian EPUB books into professional-quality audiobooks using advanced Text-to-Speech technology with automatic accent placement and voice customization.

✨ Features

  • πŸ“š EPUB Processing: Upload EPUB files and automatically extract chapters
  • 🎯 Chapter Selection: Choose specific chapters to process with checkbox interface
  • 🎡 Advanced TTS: High-quality Russian TTS with automatic accent placement using ruaccent
  • πŸŽ›οΈ Voice Controls: Customize voice clarity, variation, and volume levels
  • 🎧 Audio Preview: Built-in audio players for chapter previews
  • πŸ“₯ Download Options: Download individual chapters or complete audiobook
  • πŸ“± Responsive Design: Works on desktop and mobile devices
  • β™Ώ Accessibility: Full keyboard navigation and screen reader support

πŸš€ How to Use

  1. Upload EPUB: Click "Choose File" and select your Russian EPUB book
  2. Validate: Click "Validate EPUB" to extract chapters and book information
  3. Select Chapters: Choose which chapters to process using the checkbox interface
  4. Adjust Settings: Customize voice parameters in Advanced Settings
  5. Process: Click "Process Selected Chapters" to generate audiobook
  6. Preview & Download: Listen to previews and download your audiobook

βš™οΈ Advanced Settings

  • Speech Speed: Adjust playback speed (0.6x to 1.4x)
  • Quality Steps: Higher values = better quality, longer processing time
  • Voice Clarity: Control how closely voice follows reference (1.0-4.0)
  • Voice Variation: Control natural voice variation (-2.0 to 1.0)
  • Volume Level: Target volume level for generated audio

🎯 Technical Features

  • Automatic Accent Placement: Uses ruaccent library for perfect Russian pronunciation
  • Professional Audio Quality: 24kHz, 16-bit, mono WAV output
  • Real-time Progress: Live updates during processing with cancellation support
  • Error Handling: Robust error handling with detailed status messages
  • File Management: Automatic file serving and download functionality

πŸ“‹ Requirements

  • Russian EPUB files with proper text encoding
  • Reference audio sample (6-12 seconds) for voice cloning
  • Reference text (transcript of the reference audio)

πŸ”§ Technical Stack

  • TTS Engine: ESpeech-TTS-1_RL-V2 (F5-TTS)
  • Accent Processing: ruaccent for automatic Russian stress placement
  • Audio Processing: librosa, soundfile, pydub
  • Web Interface: Gradio with responsive design
  • EPUB Processing: ebooklib for book parsing

πŸ“ Notes

  • Processing time depends on chapter length and quality settings
  • Higher quality settings require more processing time
  • All audio files are temporarily stored for preview and download
  • The system automatically handles Russian text normalization and accent placement

πŸŽ‰ Ready to Create Audiobooks?

Upload your Russian EPUB file and start creating professional-quality audiobooks in minutes!