PersonaFlow / README.md
Kailing-Leifang's picture
Upload README.md with huggingface_hub
d35a6a0 verified
metadata
title: PersonaFlow
emoji: 🎭
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
pinned: false
hf_oauth: true
license: apache-2.0
short_description: Speak with AI characters that have distinct personalities
tags:
  - voice
  - audio
  - tts
  - stt
  - character
  - roleplay

🎭 PersonaFlow

Interactive Audio Character Demo - Speak with AI characters that have distinct personalities, voices, and animated portraits.

Features

  • Voice Input: Speak into your microphone (up to 10 seconds)
  • Multiple Characters: Choose from 3 distinct personalities
    • πŸš€ The Visionary - Bold, dramatic, futuristic
    • πŸ€” The Skeptic - Dry, questioning, sardonic
    • 🌟 The Guide - Warm, helpful, encouraging
  • Unique Voices: Each character has a distinct voice
  • Animated Portraits: Visual feedback with lip-sync animation
  • Conversation History: Track your dialogue with each character

How It Works

  1. Select a character from the dropdown
  2. Click the microphone and speak your message
  3. Listen to the character's response with their unique voice
  4. Continue the conversation or switch characters

Technology

  • STT: distil-whisper/distil-large-v3 (faster-whisper backend)
  • LLM: Qwen/Qwen2.5-3B-Instruct
  • TTS: Kokoro-82M with multiple voice options
  • Frontend: Gradio with custom CSS animations

Latency Target

Component Target
STT <500ms
LLM <400ms
TTS <300ms
Total <1.5s

Built for engaging 1-2 minute voice interactions with AI personalities.