Spaces:
Paused
Paused
| title: ChatCal Voice-Enabled AI Assistant | |
| emoji: π€π | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 4.44.1 | |
| app_file: app.py | |
| pinned: true | |
| license: mit | |
| suggested_hardware: cpu-basic | |
| # π€π ChatCal Voice-Enabled AI Assistant | |
| An intelligent AI scheduling assistant with **voice interaction capabilities** powered by WebRTC, Whisper STT, and Bark TTS. Book Google Calendar appointments through natural conversation - now with voice input and audio responses! | |
| ## π― Features | |
| ### π£οΈ Voice Interaction | |
| - π€ **Real-time Speech-to-Text**: WebRTC audio capture with Whisper transcription | |
| - π **Text-to-Speech Responses**: AI responses with natural voice synthesis | |
| - π **Multiple Voice Options**: Choose from different voice personalities | |
| - β‘ **Real-time Processing**: Live transcription as you speak | |
| ### π Smart Calendar Integration | |
| - π€ **AI-Powered Booking**: Natural language appointment scheduling | |
| - π **Google Calendar Sync**: Seamless integration with your calendar | |
| - π **Conflict Detection**: Smart availability checking | |
| - π₯ **Google Meet Integration**: Automatic video conference setup | |
| - π§ **Email Notifications**: Booking confirmations and cancellations | |
| ### π§ Intelligent Conversation | |
| - π **Conversation Memory**: Persistent context across interactions | |
| - π― **Smart Extraction**: Automatically extract names, emails, times | |
| - π **Timezone Awareness**: Global scheduling support | |
| - β° **Flexible Time Parsing**: "tomorrow at 2pm", "next Tuesday", etc. | |
| ## ποΈ Architecture | |
| - **Frontend**: Gradio with WebRTC audio capture | |
| - **AI**: Groq Llama-3.1 with Anthropic Claude fallback | |
| - **STT**: Whisper via external service integration | |
| - **TTS**: Bark text-to-speech synthesis | |
| - **Calendar**: Google Calendar API with OAuth2 | |
| - **Storage**: Google Cloud Secret Manager for persistent auth | |
| ## π Usage | |
| ### Voice Interaction | |
| 1. Click the microphone button to start recording | |
| 2. Speak naturally: "Hi, I'm John. Book a 30-minute meeting tomorrow at 2pm" | |
| 3. Watch real-time transcription appear in the text box | |
| 4. AI responds with voice confirmation of your booking | |
| ### Text Interaction | |
| - Type messages as normal - voice and text work together | |
| - Edit voice transcriptions before sending | |
| - Use quick action buttons for common requests | |
| ### Example Conversations | |
| **Voice**: "Book a Google Meet with Peter next Tuesday at 10 AM for 45 minutes" | |
| **AI Audio Response**: "Perfect! I've scheduled your 45-minute Google Meet with Peter for next Tuesday at 10:00 AM..." | |
| ## π οΈ Development | |
| This space integrates: | |
| - **ChatCal Core**: Calendar booking logic and Google integration | |
| - **STT Service**: External Whisper service for speech recognition | |
| - **TTS Service**: External Bark service for voice synthesis | |
| - **WebRTC**: Browser-based audio capture and streaming | |
| ## π Privacy & Security | |
| - Secure OAuth2 authentication with Google | |
| - Audio processed in real-time, not stored | |
| - Persistent token storage via Google Secret Manager | |
| - All calendar operations respect your existing permissions | |
| ## π Contact | |
| For business scheduling needs or technical support: pgits.job@gmail.com |