--- title: VoiceCal - Voice-Enabled AI Scheduling Assistant emoji: 🎤📅 colorFrom: blue colorTo: purple sdk: streamlit app_file: app.py pinned: false license: mit short_description: Voice-enabled AI assistant with WebRTC integration --- # 🎤📅 VoiceCal - Voice-Enabled AI Scheduling Assistant An intelligent AI scheduling assistant with **voice interaction capabilities** powered by WebRTC, Whisper STT, and Bark TTS. Book Google Calendar appointments through natural conversation - now with voice input and audio responses! > **🚀 New Deployment**: VoiceCal is the latest iteration with improved WebSocket-based voice services and enhanced reliability. Previous ChatCal.ai space has been superseded by this deployment. ## 🎯 Features ### 🗣️ Voice Interaction - 🎤 **Real-time Speech-to-Text**: WebRTC audio capture with Whisper transcription - 🔊 **Text-to-Speech Responses**: AI responses with natural voice synthesis - 🎭 **Multiple Voice Options**: Choose from different voice personalities - ⚡ **Real-time Processing**: Live transcription as you speak ### 📅 Smart Calendar Integration - 🤖 **AI-Powered Booking**: Natural language appointment scheduling - 📅 **Google Calendar Sync**: Seamless integration with your calendar - 🔍 **Conflict Detection**: Smart availability checking - 🎥 **Google Meet Integration**: Automatic video conference setup - 📧 **Email Notifications**: Booking confirmations and cancellations ### 🧠 Intelligent Conversation - 💭 **Conversation Memory**: Persistent context across interactions - 🎯 **Smart Extraction**: Automatically extract names, emails, times - 🌍 **Timezone Awareness**: Global scheduling support - ⏰ **Flexible Time Parsing**: "tomorrow at 2pm", "next Tuesday", etc. ## 🏗️ Architecture - **Frontend**: Gradio with WebRTC audio capture - **AI**: Groq Llama-3.1 with Anthropic Claude fallback - **STT**: Whisper via external service integration - **TTS**: Bark text-to-speech synthesis - **Calendar**: Google Calendar API with OAuth2 - **Storage**: Google Cloud Secret Manager for persistent auth ## 🚀 Usage ### Voice Interaction 1. Click the microphone button to start recording 2. Speak naturally: "Hi, I'm John. Book a 30-minute meeting tomorrow at 2pm" 3. Watch real-time transcription appear in the text box 4. AI responds with voice confirmation of your booking ### Text Interaction - Type messages as normal - voice and text work together - Edit voice transcriptions before sending - Use quick action buttons for common requests ### Example Conversations **Voice**: "Book a Google Meet with Peter next Tuesday at 10 AM for 45 minutes" **AI Audio Response**: "Perfect! I've scheduled your 45-minute Google Meet with Peter for next Tuesday at 10:00 AM..." ## 🛠️ Development This space integrates: - **VoiceCal Core**: Calendar booking logic and Google integration - **STT Service**: External Whisper service for speech recognition via WebSocket - **TTS Service**: External Bark service for voice synthesis via WebSocket - **WebRTC**: Browser-based audio capture and streaming ### 🔧 Service Configuration For optimal performance, the external voice services should be configured with: - **STT Service**: `STT_SERVICE_MODE=websocket` (enables WebSocket endpoints) - **TTS Service**: `TTS_SERVICE_MODE=websocket` (enables WebSocket endpoints) - **Service URLs**: - STT: `wss://pgits-stt-gpu-service.hf.space/ws/stt` - TTS: `wss://pgits-tts-gpu-service.hf.space/ws/tts` ## 🔐 Privacy & Security - Secure OAuth2 authentication with Google - Audio processed in real-time, not stored - Persistent token storage via Google Secret Manager - All calendar operations respect your existing permissions ## 📞 Contact For business scheduling needs or technical support: pgits.job@gmail.com