A newer version of the Gradio SDK is available: 6.13.0
title: Indic Sentiment Audio App
emoji: ๐
colorFrom: yellow
colorTo: green
sdk: gradio
sdk_version: 6.4.0
app_file: app.py
pinned: false
license: mit
short_description: Real-time audio sentiment analysis for code-mixed IndicLang
๐ฎ๐ณ Project-IV: Real-Time Indic Sentiment Analysis
Audio-Visual Sentiment Analysis for Code-Mixed Indian Languages (Gujlish & Hinglish)
๐ Project Overview
This application represents the final deliverable for Project-IV. It is a sophisticated AI system designed to solve a specific challenge in Natural Language Processing (NLP): Sentiment Analysis of Code-Mixed Indian Languages.
Standard AI models fail when users mix languages (e.g., "Aa movie bahu saras che but ending weak hatu"). This project solves that problem using a custom fine-tuned architecture.
๐ ๏ธ Technical Architecture (The Pipeline)
This application uses a Two-Stage Pipeline to process real-time audio:
Stage 1: The Ears (Automatic Speech Recognition)
- Model:
openai/whisper-small - Function: Captures live audio from the microphone and transcribes it into text. It is robust enough to handle Indian accents and mixed-language speech patterns automatically.
- Model:
Stage 2: The Brain (Sentiment Classification)
- Model:
marshal-yash/gujlish-sentiment-analysis - Architecture: Google MuRIL (Multilingual Representations for Indian Languages).
- Training: Fine-tuned on a proprietary synthetic dataset of 150,000 samples (
gujlish_150k_massive.csv). - Performance: Achieved 100% Accuracy on the validation set during training.
- Model:
๐ Dataset Details
The "Brain" of this system was trained on a massive, diverse dataset generated specifically for this project:
- Size: 150,000 unique data points.
- Languages: Gujarati-English (Gujlish), Hindi-English (Hinglish), and Pure English.
- Domains: Technology, Movies, Food, Sports, and Daily Life conversations.
- Technique: Generated using advanced combinatorial data augmentation to ensure robust handling of grammar and vocabulary variations.
๐ฏ How to Use
- Allow Microphone Access when prompted by the browser.
- Click the Microphone Icon to start recording.
- Speak a sentence in Gujarati, Hindi, or English (or a mix of all three!).
- Example: "Server connect nathi thatu, bahu slow che."
- Example: "Wow, what a performance! Maja padi gai."
- Click Stop Recording and then Submit.
- View the transcribed text and the AI's sentiment prediction (Positive/Negative/Neutral).
Developed by: Yash Bharvada Model Hosted on: Hugging Face Hub