A newer version of the Streamlit SDK is available: 1.56.0
title: VoiceCal - Voice-Enabled AI Scheduling Assistant
emoji: π€π
colorFrom: blue
colorTo: purple
sdk: streamlit
app_file: app.py
pinned: false
license: mit
short_description: Voice-enabled AI assistant with WebRTC integration
π€π VoiceCal - Voice-Enabled AI Scheduling Assistant
An intelligent AI scheduling assistant with voice interaction capabilities powered by WebRTC, Whisper STT, and Bark TTS. Book Google Calendar appointments through natural conversation - now with voice input and audio responses!
π New Deployment: VoiceCal is the latest iteration with improved WebSocket-based voice services and enhanced reliability. Previous ChatCal.ai space has been superseded by this deployment.
π― Features
π£οΈ Voice Interaction
- π€ Real-time Speech-to-Text: WebRTC audio capture with Whisper transcription
- π Text-to-Speech Responses: AI responses with natural voice synthesis
- π Multiple Voice Options: Choose from different voice personalities
- β‘ Real-time Processing: Live transcription as you speak
π Smart Calendar Integration
- π€ AI-Powered Booking: Natural language appointment scheduling
- π Google Calendar Sync: Seamless integration with your calendar
- π Conflict Detection: Smart availability checking
- π₯ Google Meet Integration: Automatic video conference setup
- π§ Email Notifications: Booking confirmations and cancellations
π§ Intelligent Conversation
- π Conversation Memory: Persistent context across interactions
- π― Smart Extraction: Automatically extract names, emails, times
- π Timezone Awareness: Global scheduling support
- β° Flexible Time Parsing: "tomorrow at 2pm", "next Tuesday", etc.
ποΈ Architecture
- Frontend: Gradio with WebRTC audio capture
- AI: Groq Llama-3.1 with Anthropic Claude fallback
- STT: Whisper via external service integration
- TTS: Bark text-to-speech synthesis
- Calendar: Google Calendar API with OAuth2
- Storage: Google Cloud Secret Manager for persistent auth
π Usage
Voice Interaction
- Click the microphone button to start recording
- Speak naturally: "Hi, I'm John. Book a 30-minute meeting tomorrow at 2pm"
- Watch real-time transcription appear in the text box
- AI responds with voice confirmation of your booking
Text Interaction
- Type messages as normal - voice and text work together
- Edit voice transcriptions before sending
- Use quick action buttons for common requests
Example Conversations
Voice: "Book a Google Meet with Peter next Tuesday at 10 AM for 45 minutes" AI Audio Response: "Perfect! I've scheduled your 45-minute Google Meet with Peter for next Tuesday at 10:00 AM..."
π οΈ Development
This space integrates:
- VoiceCal Core: Calendar booking logic and Google integration
- STT Service: External Whisper service for speech recognition via WebSocket
- TTS Service: External Bark service for voice synthesis via WebSocket
- WebRTC: Browser-based audio capture and streaming
π§ Service Configuration
For optimal performance, the external voice services should be configured with:
- STT Service:
STT_SERVICE_MODE=websocket(enables WebSocket endpoints) - TTS Service:
TTS_SERVICE_MODE=websocket(enables WebSocket endpoints) - Service URLs:
- STT:
wss://pgits-stt-gpu-service.hf.space/ws/stt - TTS:
wss://pgits-tts-gpu-service.hf.space/ws/tts
- STT:
π Privacy & Security
- Secure OAuth2 authentication with Google
- Audio processed in real-time, not stored
- Persistent token storage via Google Secret Manager
- All calendar operations respect your existing permissions
π Contact
For business scheduling needs or technical support: pgits.job@gmail.com