abbasNoway commited on
Commit
57c40a2
·
verified ·
1 Parent(s): 776a231

uodate README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -7
README.md CHANGED
@@ -1,15 +1,53 @@
1
  ---
2
  title: Urdu Emoji Predictor
3
- emoji: 💬
4
- colorFrom: yellow
5
  colorTo: purple
6
  sdk: gradio
7
- sdk_version: 5.42.0
8
  app_file: app.py
9
  pinned: false
10
- hf_oauth: true
11
- hf_oauth_scopes:
12
- - inference-api
13
  ---
14
 
15
- An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: Urdu Emoji Predictor
3
+ emoji: 🎯
4
+ colorFrom: blue
5
  colorTo: purple
6
  sdk: gradio
7
+ sdk_version: 4.0.0
8
  app_file: app.py
9
  pinned: false
 
 
 
10
  ---
11
 
12
+ # 🎯 Urdu Emoji Predictor
13
+
14
+ An AI-powered tool that predicts relevant emojis for Urdu text using machine learning and semantic similarity.
15
+
16
+ ## 🚀 Try It Out!
17
+
18
+ Simply enter Urdu text and get the most relevant emojis instantly.
19
+
20
+ ## 🎯 Examples
21
+
22
+ - `میں بہت خوش ہوں` → 🎉 🎊 👌
23
+ - `دل ٹوٹ گیا ہے` → 🌚 😞 💔
24
+ - `نیند آ رہی ہے` → 😴 😞 🌚
25
+ - `دوستوں کے ساتھ پارٹی` → 🎉 😋 🎊
26
+
27
+ ## 🔧 How It Works
28
+
29
+ 1. **Text Encoding**: Converts Urdu text to semantic embeddings using multilingual sentence transformers
30
+ 2. **Similarity Search**: Compares text embeddings with pre-computed emoji embeddings
31
+ 3. **Ranking**: Returns top emojis based on cosine similarity scores
32
+
33
+ ## 🏗️ Technical Details
34
+
35
+ - **Model**: `sentence-transformers/paraphrase-multilingual-mpnet-base-v2`
36
+ - **Emojis**: 80 most common emojis from Urdu social media
37
+ - **Method**: Cosine similarity between text and emoji embeddings
38
+ - **Framework**: Gradio + FastAPI
39
+
40
+ ## 📊 Model Performance
41
+
42
+ - **Top-1 Accuracy**: ~16%
43
+ - **Top-3 Accuracy**: ~30%
44
+ - **Trained on**: 800K+ Urdu text-emoji pairs
45
+
46
+ ## 🎮 Usage
47
+
48
+ ```python
49
+ from urdu_specific_embedding import UrduOptimizedPredictor
50
+
51
+ predictor = UrduOptimizedPredictor("models/urdu_optimized_model")
52
+ predictions = predictor.predict_smart("میں بہت خوش ہوں", top_k=3)
53
+ # Returns: [('🎉', 0.555), ('🎊', 0.537), ('👌', 0.439)]