File size: 1,698 Bytes
d35a6a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
---

title: PersonaFlow
emoji: ๐ŸŽญ
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
pinned: false
hf_oauth: true
license: apache-2.0
short_description: Speak with AI characters that have distinct personalities
tags:
  - voice
  - audio
  - tts
  - stt
  - character
  - roleplay
---


# ๐ŸŽญ PersonaFlow

**Interactive Audio Character Demo** - Speak with AI characters that have distinct personalities, voices, and animated portraits.

## Features

- **Voice Input**: Speak into your microphone (up to 10 seconds)
- **Multiple Characters**: Choose from 3 distinct personalities
  - ๐Ÿš€ **The Visionary** - Bold, dramatic, futuristic
  - ๐Ÿค” **The Skeptic** - Dry, questioning, sardonic
  - ๐ŸŒŸ **The Guide** - Warm, helpful, encouraging
- **Unique Voices**: Each character has a distinct voice
- **Animated Portraits**: Visual feedback with lip-sync animation
- **Conversation History**: Track your dialogue with each character

## How It Works

1. **Select a character** from the dropdown
2. **Click the microphone** and speak your message
3. **Listen** to the character's response with their unique voice
4. **Continue** the conversation or switch characters

## Technology

- **STT**: distil-whisper/distil-large-v3 (faster-whisper backend)
- **LLM**: Qwen/Qwen2.5-3B-Instruct
- **TTS**: Kokoro-82M with multiple voice options
- **Frontend**: Gradio with custom CSS animations

## Latency Target

| Component | Target |
|-----------|--------|
| STT | <500ms |
| LLM | <400ms |
| TTS | <300ms |
| **Total** | **<1.5s** |

---

Built for engaging 1-2 minute voice interactions with AI personalities.