---
license: mit
title: 🎙️ PodXplain
sdk: gradio
emoji: 📚
colorFrom: red
colorTo: blue
pinned: true
short_description: PodXplain is a Hugging Face-hosted application that converts
---
# 🎙️ PodXplain

**From script to story — voice it like never before.**

PodXplain is a Hugging Face-hosted application that converts long-form text into engaging multi-speaker podcast-style audio. Simply input your script, and get a professional-sounding MP3 podcast with automatic speaker detection and assignment.

## ✨ Features

- **📝 Long-form Support**: Handle up to 50,000 characters of text
- **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment
- **🔄 Smart Segmentation**: Intelligent text splitting with progress tracking
- **🎵 High-quality Output**: MP3 format for optimal file size and compatibility
- **🚀 Real-time Progress**: Live updates during generation
- **🎨 Modern UI**: Clean, intuitive Gradio interface

## 🛠️ Tech Stack

- **Frontend**: Gradio for interactive web interface
- **TTS Engine**: Nari DIA 1.6B for natural voice synthesis (currently mocked)
- **Audio Processing**: pydub for audio manipulation and MP3 conversion
- **Hosting**: Hugging Face Spaces with GPU support

## 📋 How to Use

1.  **Input Text**: Paste or type your podcast script (up to 50,000 characters)
2.  **Choose Mode**: Select speaker detection mode:
    * **Auto**: Smart detection based on content structure
    * **Paragraph**: Speaker changes at paragraph breaks
    * **Dialogue**: Detection based on dialogue markers
3.  **Generate**: Click "Generate Podcast" and watch the progress
4.  **Download**: Get your MP3 file and listen to your podcast!

## 🚀 Quick Start

### Local Development

```bash
# Clone the repository
git clone [https://github.com/yourusername/podxplain.git](https://github.com/yourusername/podxplain.git) # Replace with your actual repo URL
cd podxplain

# Install dependencies
pip install -r requirements.txt

# Run the application
python app.py