--- license: mit title: 🎙️ PodXplain sdk: gradio emoji: 📚 colorFrom: red colorTo: blue pinned: true short_description: PodXplain is a Hugging Face-hosted application that converts --- # 🎙️ PodXplain **From script to story — voice it like never before.** PodXplain is a Hugging Face-hosted application that converts long-form text into engaging multi-speaker podcast-style audio. Simply input your script, and get a professional-sounding MP3 podcast with automatic speaker detection and assignment. ## ✨ Features - **📝 Long-form Support**: Handle up to 50,000 characters of text - **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment - **🔄 Smart Segmentation**: Intelligent text splitting with progress tracking - **🎵 High-quality Output**: MP3 format for optimal file size and compatibility - **🚀 Real-time Progress**: Live updates during generation - **🎨 Modern UI**: Clean, intuitive Gradio interface ## 🛠️ Tech Stack - **Frontend**: Gradio for interactive web interface - **TTS Engine**: Nari DIA 1.6B for natural voice synthesis (currently mocked) - **Audio Processing**: pydub for audio manipulation and MP3 conversion - **Hosting**: Hugging Face Spaces with GPU support ## 📋 How to Use 1. **Input Text**: Paste or type your podcast script (up to 50,000 characters) 2. **Choose Mode**: Select speaker detection mode: * **Auto**: Smart detection based on content structure * **Paragraph**: Speaker changes at paragraph breaks * **Dialogue**: Detection based on dialogue markers 3. **Generate**: Click "Generate Podcast" and watch the progress 4. **Download**: Get your MP3 file and listen to your podcast! ## 🚀 Quick Start ### Local Development ```bash # Clone the repository git clone [https://github.com/yourusername/podxplain.git](https://github.com/yourusername/podxplain.git) # Replace with your actual repo URL cd podxplain # Install dependencies pip install -r requirements.txt # Run the application python app.py