marshal-yash commited on
Commit
e50441b
ยท
verified ยท
1 Parent(s): 0c589e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -2
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  title: Indic Sentiment Audio App
3
  emoji: ๐Ÿ“š
4
- colorFrom: blue
5
  colorTo: green
6
  sdk: gradio
7
  sdk_version: 6.4.0
@@ -11,4 +11,43 @@ license: mit
11
  short_description: Real-time audio sentiment analysis for code-mixed IndicLang
12
  ---
13
 
14
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: Indic Sentiment Audio App
3
  emoji: ๐Ÿ“š
4
+ colorFrom: yellow
5
  colorTo: green
6
  sdk: gradio
7
  sdk_version: 6.4.0
 
11
  short_description: Real-time audio sentiment analysis for code-mixed IndicLang
12
  ---
13
 
14
+ # ๐Ÿ‡ฎ๐Ÿ‡ณ Project-IV: Real-Time Indic Sentiment Analysis
15
+ ### *Audio-Visual Sentiment Analysis for Code-Mixed Indian Languages (Gujlish & Hinglish)*
16
+
17
+ ## ๐Ÿš€ Project Overview
18
+ This application represents the final deliverable for **Project-IV**. It is a sophisticated AI system designed to solve a specific challenge in Natural Language Processing (NLP): **Sentiment Analysis of Code-Mixed Indian Languages**.
19
+
20
+ Standard AI models fail when users mix languages (e.g., *"Aa movie bahu saras che but ending weak hatu"*). This project solves that problem using a custom fine-tuned architecture.
21
+
22
+ ## ๐Ÿ› ๏ธ Technical Architecture (The Pipeline)
23
+ This application uses a **Two-Stage Pipeline** to process real-time audio:
24
+
25
+ 1. **Stage 1: The Ears (Automatic Speech Recognition)**
26
+ * **Model:** `openai/whisper-small`
27
+ * **Function:** Captures live audio from the microphone and transcribes it into text. It is robust enough to handle Indian accents and mixed-language speech patterns automatically.
28
+
29
+ 2. **Stage 2: The Brain (Sentiment Classification)**
30
+ * **Model:** `marshal-yash/gujlish-sentiment-analysis`
31
+ * **Architecture:** Google MuRIL (Multilingual Representations for Indian Languages).
32
+ * **Training:** Fine-tuned on a proprietary synthetic dataset of **150,000 samples** (`gujlish_150k_massive.csv`).
33
+ * **Performance:** Achieved **100% Accuracy** on the validation set during training.
34
+
35
+ ## ๐Ÿ“Š Dataset Details
36
+ The "Brain" of this system was trained on a massive, diverse dataset generated specifically for this project:
37
+ * **Size:** 150,000 unique data points.
38
+ * **Languages:** Gujarati-English (Gujlish), Hindi-English (Hinglish), and Pure English.
39
+ * **Domains:** Technology, Movies, Food, Sports, and Daily Life conversations.
40
+ * **Technique:** Generated using advanced combinatorial data augmentation to ensure robust handling of grammar and vocabulary variations.
41
+
42
+ ## ๐ŸŽฏ How to Use
43
+ 1. **Allow Microphone Access** when prompted by the browser.
44
+ 2. Click the **Microphone Icon** to start recording.
45
+ 3. Speak a sentence in **Gujarati, Hindi, or English** (or a mix of all three!).
46
+ * *Example: "Server connect nathi thatu, bahu slow che."*
47
+ * *Example: "Wow, what a performance! Maja padi gai."*
48
+ 4. Click **Stop Recording** and then **Submit**.
49
+ 5. View the **transcribed text** and the **AI's sentiment prediction** (Positive/Negative/Neutral).
50
+
51
+ ---
52
+ **Developed by:** Yash Bharvada
53
+ **Model Hosted on:** Hugging Face Hub