File size: 1,506 Bytes
89b5007
7923604
 
 
89b5007
 
7923604
89b5007
beb71e9
89b5007
 
7923604
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
title: 'LexCAT: Taglish Sentiment Analysis'
colorFrom: blue
colorTo: purple
sdk: gradio
app_file: app.py
pinned: true
license: cc-by-4.0
sdk_version: 5.47.0
---

# LexCAT: Lexicon-Enhanced Sentiment Analysis for Tagalog–English Code-Switched Text

**Author**: Glenn Marcus D. Cinco  
**Institution**: Mapúa University  
**Thesis**: *LexCAT: A Lexicon-Based Approach for Code-Switching Analysis with Transformers Using XLM-RoBERTa and LexiLiksik*  
**Model**: Fine-tuned XLM-RoBERTa + LexiLiksik lexicon  
**Dataset**: FiReCS (10,487 Taglish reviews)  
**Accuracy**: 84.31%  
**Specialty**: Detects intra-sentential sentiment shifts (e.g., “Maganda pero expensive” → Negative)

---

## 🧠 Try It Out

Enter any Taglish sentence — LexCAT will predict its sentiment and show confidence scores.

Examples:
- *“sobrang lambot ng burger pero expensive tlga”* → ❌ Negative
- *“Salamat sa nyo nagana nmn po sya kaya super thank you ako”* → ✅ Positive
- *“Ang ganda ng service, one star!”* → ❌ Negative

---

## 📚 Model Card

See full documentation, methodology, and citation at:  
👉 https://huggingface.co/your-hf-username/LexCAT-LexiLiksik-Final

---

## 🎓 Academic Use

Cite this model in your research:

```bibtex
@mastersthesis{cinco2025lexcat,
  author = {Cinco, Glenn Marcus D.},
  title = {LexCAT: A Lexicon-Based Approach for Code-Switching Analysis with Transformers Using XLM-RoBERTa and LexiLiksik},
  school = {Mapúa University},
  year = {2025}
}
```