DSL-13-SRMAP
/

XLM-R_WR

sentiment-classification

Model card Files Files and versions

XLM-R_WR / README.md

Raj411's picture

Create README.md

ca23949 verified 5 months ago

|

history blame contribute delete

3.24 kB

	---
	license: cc-by-4.0
	tags:
	- sentiment-classification
	- telugu
	- xlm-r
	- multilingual
	- baseline
	language: te
	datasets:
	- DSL-13-SRMAP/TeSent_Benchmark-Dataset
	model_name: XLM-R_WR
	---

	# XLM-R_WR: XLM-RoBERTa Telugu Sentiment Classification Model (With Rationale)

	## Model Overview

	XLM-R_WR is a Telugu sentiment classification model based on XLM-RoBERTa (XLM-R), a general-purpose multilingual transformer developed by Facebook AI.
	The "WR" in the model name stands for "With Rationale", indicating that this model is trained using both sentiment labels and human-annotated rationales from the TeSent_Benchmark-Dataset.

	---

	## Model Details

	- Architecture: XLM-RoBERTa (transformer-based, multilingual)
	- Pretraining Data: 2.5TB of filtered Common Crawl data across 100+ languages, including Telugu
	- Pretraining Objective: Masked Language Modeling (MLM), no Next Sentence Prediction (NSP)
	- Fine-tuning Data: [TeSent_Benchmark-Dataset](https://huggingface.co/datasets/dsl-13-srmap/tesent_benchmark-dataset), using both sentence-level sentiment labels and rationale annotations
	- Task: Sentence-level sentiment classification (3-way)
	- Rationale Usage: Used during training and/or inference ("WR" = With Rationale)

	---

	## Intended Use

	- Primary Use: Benchmarking Telugu sentiment classification on the TeSent_Benchmark-Dataset, especially as a baseline for models trained with and without rationales
	- Research Setting: Suitable for cross-lingual and multilingual NLP research, as well as explainable AI in low-resource settings

	---

	## Why XLM-R?

	XLM-R is designed for cross-lingual understanding and contextual modeling, providing strong transfer learning capabilities and improved downstream performance compared to mBERT. When fine-tuned with local Telugu data, XLM-R delivers solid results for sentiment analysis.
	However, Telugu-specific models like MuRIL or L3Cube-Telugu-BERT may offer better cultural and linguistic alignment for purely Telugu tasks.

	---

	## Performance and Limitations

	Strengths:
	- Strong transfer learning and contextual modeling for multilingual NLP
	- Good performance for Telugu sentiment analysis when fine-tuned with local data
	- Provides explicit rationales for predictions, aiding explainability
	- Useful as a cross-lingual and multilingual baseline

	Limitations:
	- May be outperformed by Telugu-specific models for culturally nuanced tasks
	- Requires sufficient labeled Telugu data and rationale annotations for best performance

	---

	## Training Data

	- Dataset: [TeSent_Benchmark-Dataset](https://huggingface.co/datasets/dsl-13-srmap/tesent_benchmark-dataset)
	- Data Used: The Content (Telugu sentence), Label (sentiment label), and Rationale (human-annotated rationale) columns are used for XLM-R_WR training

	---

	## Language Coverage

	- Language: Telugu (`te`)
	- Model Scope: This implementation and evaluation focus strictly on Telugu sentiment classification

	---

	## Citation and More Details

	For detailed experimental setup, evaluation metrics, and comparisons with rationale-based models, please refer to our paper.



	---

	## License

	Released under [CC BY 4.0](LICENSE).