ChatCal.ai-1 / README.md
Peter Michael Gits
FORCE: HF sync issue - change README pinned + sync file v0.3.21
7f07efb

A newer version of the Gradio SDK is available: 6.11.0

Upgrade
metadata
title: ChatCal Voice-Enabled AI Assistant
emoji: πŸŽ€πŸ“…
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: true
license: mit
suggested_hardware: cpu-basic

πŸŽ€πŸ“… ChatCal Voice-Enabled AI Assistant

An intelligent AI scheduling assistant with voice interaction capabilities powered by WebRTC, Whisper STT, and Bark TTS. Book Google Calendar appointments through natural conversation - now with voice input and audio responses!

🎯 Features

πŸ—£οΈ Voice Interaction

  • 🎀 Real-time Speech-to-Text: WebRTC audio capture with Whisper transcription
  • πŸ”Š Text-to-Speech Responses: AI responses with natural voice synthesis
  • 🎭 Multiple Voice Options: Choose from different voice personalities
  • ⚑ Real-time Processing: Live transcription as you speak

πŸ“… Smart Calendar Integration

  • πŸ€– AI-Powered Booking: Natural language appointment scheduling
  • πŸ“… Google Calendar Sync: Seamless integration with your calendar
  • πŸ” Conflict Detection: Smart availability checking
  • πŸŽ₯ Google Meet Integration: Automatic video conference setup
  • πŸ“§ Email Notifications: Booking confirmations and cancellations

🧠 Intelligent Conversation

  • πŸ’­ Conversation Memory: Persistent context across interactions
  • 🎯 Smart Extraction: Automatically extract names, emails, times
  • 🌍 Timezone Awareness: Global scheduling support
  • ⏰ Flexible Time Parsing: "tomorrow at 2pm", "next Tuesday", etc.

πŸ—οΈ Architecture

  • Frontend: Gradio with WebRTC audio capture
  • AI: Groq Llama-3.1 with Anthropic Claude fallback
  • STT: Whisper via external service integration
  • TTS: Bark text-to-speech synthesis
  • Calendar: Google Calendar API with OAuth2
  • Storage: Google Cloud Secret Manager for persistent auth

πŸš€ Usage

Voice Interaction

  1. Click the microphone button to start recording
  2. Speak naturally: "Hi, I'm John. Book a 30-minute meeting tomorrow at 2pm"
  3. Watch real-time transcription appear in the text box
  4. AI responds with voice confirmation of your booking

Text Interaction

  • Type messages as normal - voice and text work together
  • Edit voice transcriptions before sending
  • Use quick action buttons for common requests

Example Conversations

Voice: "Book a Google Meet with Peter next Tuesday at 10 AM for 45 minutes" AI Audio Response: "Perfect! I've scheduled your 45-minute Google Meet with Peter for next Tuesday at 10:00 AM..."

πŸ› οΈ Development

This space integrates:

  • ChatCal Core: Calendar booking logic and Google integration
  • STT Service: External Whisper service for speech recognition
  • TTS Service: External Bark service for voice synthesis
  • WebRTC: Browser-based audio capture and streaming

πŸ” Privacy & Security

  • Secure OAuth2 authentication with Google
  • Audio processed in real-time, not stored
  • Persistent token storage via Google Secret Manager
  • All calendar operations respect your existing permissions

πŸ“ž Contact

For business scheduling needs or technical support: pgits.job@gmail.com