PersonaFlow / README.md

Kailing-Leifang

Upload README.md with huggingface_hub

d35a6a0 verified about 1 month ago

preview code

raw

history blame contribute delete

1.7 kB

metadata

title: PersonaFlow
emoji: 🎭
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
pinned: false
hf_oauth: true
license: apache-2.0
short_description: Speak with AI characters that have distinct personalities
tags:
  - voice
  - audio
  - tts
  - stt
  - character
  - roleplay

🎭 PersonaFlow

Interactive Audio Character Demo - Speak with AI characters that have distinct personalities, voices, and animated portraits.

Features

Voice Input: Speak into your microphone (up to 10 seconds)
Multiple Characters: Choose from 3 distinct personalities
- 🚀 The Visionary - Bold, dramatic, futuristic
- 🤔 The Skeptic - Dry, questioning, sardonic
- 🌟 The Guide - Warm, helpful, encouraging
Unique Voices: Each character has a distinct voice
Animated Portraits: Visual feedback with lip-sync animation
Conversation History: Track your dialogue with each character

How It Works

Select a character from the dropdown
Click the microphone and speak your message
Listen to the character's response with their unique voice
Continue the conversation or switch characters

Technology

STT: distil-whisper/distil-large-v3 (faster-whisper backend)
LLM: Qwen/Qwen2.5-3B-Instruct
TTS: Kokoro-82M with multiple voice options
Frontend: Gradio with custom CSS animations

Latency Target

Component	Target
STT	<500ms
LLM	<400ms
TTS	<300ms
Total	<1.5s

Built for engaging 1-2 minute voice interactions with AI personalities.