Kelvinmbewe commited on
Commit
a405f26
Β·
verified Β·
1 Parent(s): 703ab26

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -76
README.md CHANGED
@@ -193,82 +193,22 @@ print(topic_results)
193
  ]
194
  ```
195
  ```
196
- =========================== MULTI‑TASK FLOW ===========================
197
-
198
- πŸ“₯ Input Layer β†’ 🧠 Core Engine β†’ πŸ“ˆ Output Layer
199
- ------------------------------------------------------------------------------------------------
200
- Languages 🌍 β†’ Tokenizer πŸ”€ β†’ Language Predictions
201
- β€’ English β†’ mBERT Encoder 🧠 β†’ β€’ Bemba
202
- β€’ Bemba β†’ CLS Vector 🎯 β†’ β€’ Nyanja
203
- β€’ Nyanja β†’ β†’ β€’ English
204
- β€’ Mixed β†’ β†’ β€’ Code‑Switch
205
- ------------------------------------------------------------------------------------------------
206
- Sentiment Signals ❀️ β†’ Tokenizer πŸ”€ β†’ Sentiment Predictions
207
- β†’ Shared Encoder 🧠 β†’ β€’ Negative
208
- β†’ CLS Vector 🎯 β†’ β€’ Neutral
209
- β†’ β€’ Positive
210
- ------------------------------------------------------------------------------------------------
211
- Ride‑Related Text πŸš— β†’ Tokenizer πŸ”€ β†’ Topic Predictions
212
- β†’ Shared Encoder 🧠 β†’ β€’ Driver Behaviour
213
- β†’ CLS Vector 🎯 β†’ β€’ Payment Issue
214
- β†’ β€’ Support
215
- β†’ β€’ App Performance
216
- β†’ β€’ Availability
217
- ------------------------------------------------------------------------------------------------
218
-
219
-
220
- ```
221
-
222
 
223
  ```
224
- πŸ“ Input Text (Any Language)
225
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
226
- β”‚ Input Text (Any Language) β”‚
227
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
228
- β”‚
229
- β–Ό
230
-
231
- πŸ”€ Tokenizer (mBERT-based)
232
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
233
- β”‚ Tokenizer (mBERT-based) β”‚
234
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
235
- β”‚
236
- β–Ό
237
-
238
- 🧠 Shared mBERT Encoder
239
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
240
- β”‚ Shared mBERT Encoder Layer β”‚
241
- β”‚ (bert-base-multilingual-cased) β”‚
242
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
243
- β”‚
244
- β–Ό
245
-
246
- 🎯 [CLS] Pooled Representation
247
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
248
- β”‚ [CLS] Pooled Representation β”‚
249
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
250
- β”‚
251
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
252
- β”‚ β”‚ β”‚
253
- β–Ό β–Ό β–Ό
254
-
255
- 🌍 Language Head ❀️ Sentiment Head πŸ—‚οΈ Topic Head
256
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
257
- β”‚ Language Head β”‚ β”‚ Sentiment Head β”‚ β”‚ Topic Head β”‚
258
- β”‚ (Kelvinmbewe/ β”‚ β”‚ (Kelvinmbewe/ β”‚ β”‚ (Kelvinmbewe/ β”‚
259
- β”‚ mbert_Lusaka_ β”‚ β”‚ mbert_LusakaLang_ β”‚ β”‚ mbert_LusakaLang_ β”‚
260
- β”‚ Language_Analysis) β”‚ β”‚ Sentiment_Analysis) β”‚ β”‚ Topic) β”‚
261
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
262
- β”‚ β”‚ β”‚
263
- β–Ό β–Ό β–Ό
264
-
265
- 🏷️ Language Label 🏷️ Sentiment Label 🏷️ Topic Label
266
- β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
267
- β”‚ Language Label β”‚ β”‚ Sentiment Label β”‚ β”‚ Topic Label β”‚
268
- β”‚ (e.g., Bemba, Nyanja, β”‚ β”‚ (Negative/Neutral/ β”‚ β”‚ (Driver, Payment, β”‚
269
- β”‚ English, Code‑Switch)β”‚ β”‚ Positive) β”‚ β”‚ Support, etc.) β”‚
270
- β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
271
-
272
- ```
273
-
274
 
 
193
  ]
194
  ```
195
  ```
196
+ =========================== MULTI‑TASK PIPELINE ===========================
197
+
198
+ πŸ“₯ Input β†’ 🧠 Core Engine β†’ πŸ“ˆ Output
199
+ ------------------------------------------------------------------------------------
200
+ Text (Any Language) β†’ Tokenizer πŸ”€ β†’ Language 🌍
201
+ β†’ Shared mBERT Encoder 🧠 β†’ Bemba / Nyanja /
202
+ β†’ CLS Vector 🎯 β†’ English / Mixed
203
+ ------------------------------------------------------------------------------------
204
+ User Feedback πŸ’¬ β†’ Tokenizer πŸ”€ β†’ Sentiment ❀️
205
+ β†’ Shared Encoder 🧠 β†’ Negative / Neutral /
206
+ β†’ CLS Vector 🎯 β†’ Positive
207
+ ------------------------------------------------------------------------------------
208
+ Ride Context πŸš— β†’ Tokenizer πŸ”€ β†’ Topic πŸ—‚οΈ
209
+ β†’ Shared Encoder 🧠 β†’ Driver / Payment /
210
+ β†’ CLS Vector 🎯 β†’ Support / App / Availability
211
+ ------------------------------------------------------------------------------------
 
 
 
 
 
 
 
 
 
 
212
 
213
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
214