Update README.md
Browse files
README.md
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
---
|
| 2 |
title: Indic Sentiment Audio App
|
| 3 |
emoji: ๐
|
| 4 |
-
colorFrom:
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 6.4.0
|
|
@@ -11,4 +11,43 @@ license: mit
|
|
| 11 |
short_description: Real-time audio sentiment analysis for code-mixed IndicLang
|
| 12 |
---
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
title: Indic Sentiment Audio App
|
| 3 |
emoji: ๐
|
| 4 |
+
colorFrom: yellow
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 6.4.0
|
|
|
|
| 11 |
short_description: Real-time audio sentiment analysis for code-mixed IndicLang
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# ๐ฎ๐ณ Project-IV: Real-Time Indic Sentiment Analysis
|
| 15 |
+
### *Audio-Visual Sentiment Analysis for Code-Mixed Indian Languages (Gujlish & Hinglish)*
|
| 16 |
+
|
| 17 |
+
## ๐ Project Overview
|
| 18 |
+
This application represents the final deliverable for **Project-IV**. It is a sophisticated AI system designed to solve a specific challenge in Natural Language Processing (NLP): **Sentiment Analysis of Code-Mixed Indian Languages**.
|
| 19 |
+
|
| 20 |
+
Standard AI models fail when users mix languages (e.g., *"Aa movie bahu saras che but ending weak hatu"*). This project solves that problem using a custom fine-tuned architecture.
|
| 21 |
+
|
| 22 |
+
## ๐ ๏ธ Technical Architecture (The Pipeline)
|
| 23 |
+
This application uses a **Two-Stage Pipeline** to process real-time audio:
|
| 24 |
+
|
| 25 |
+
1. **Stage 1: The Ears (Automatic Speech Recognition)**
|
| 26 |
+
* **Model:** `openai/whisper-small`
|
| 27 |
+
* **Function:** Captures live audio from the microphone and transcribes it into text. It is robust enough to handle Indian accents and mixed-language speech patterns automatically.
|
| 28 |
+
|
| 29 |
+
2. **Stage 2: The Brain (Sentiment Classification)**
|
| 30 |
+
* **Model:** `marshal-yash/gujlish-sentiment-analysis`
|
| 31 |
+
* **Architecture:** Google MuRIL (Multilingual Representations for Indian Languages).
|
| 32 |
+
* **Training:** Fine-tuned on a proprietary synthetic dataset of **150,000 samples** (`gujlish_150k_massive.csv`).
|
| 33 |
+
* **Performance:** Achieved **100% Accuracy** on the validation set during training.
|
| 34 |
+
|
| 35 |
+
## ๐ Dataset Details
|
| 36 |
+
The "Brain" of this system was trained on a massive, diverse dataset generated specifically for this project:
|
| 37 |
+
* **Size:** 150,000 unique data points.
|
| 38 |
+
* **Languages:** Gujarati-English (Gujlish), Hindi-English (Hinglish), and Pure English.
|
| 39 |
+
* **Domains:** Technology, Movies, Food, Sports, and Daily Life conversations.
|
| 40 |
+
* **Technique:** Generated using advanced combinatorial data augmentation to ensure robust handling of grammar and vocabulary variations.
|
| 41 |
+
|
| 42 |
+
## ๐ฏ How to Use
|
| 43 |
+
1. **Allow Microphone Access** when prompted by the browser.
|
| 44 |
+
2. Click the **Microphone Icon** to start recording.
|
| 45 |
+
3. Speak a sentence in **Gujarati, Hindi, or English** (or a mix of all three!).
|
| 46 |
+
* *Example: "Server connect nathi thatu, bahu slow che."*
|
| 47 |
+
* *Example: "Wow, what a performance! Maja padi gai."*
|
| 48 |
+
4. Click **Stop Recording** and then **Submit**.
|
| 49 |
+
5. View the **transcribed text** and the **AI's sentiment prediction** (Positive/Negative/Neutral).
|
| 50 |
+
|
| 51 |
+
---
|
| 52 |
+
**Developed by:** Yash Bharvada
|
| 53 |
+
**Model Hosted on:** Hugging Face Hub
|