Text Classification
Transformers
PyTorch
Safetensors
English
Telugu
roberta
toxic-comment-classification
Instructions to use prabhaskenche/toxic-comment-classification-using-RoBERTa with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use prabhaskenche/toxic-comment-classification-using-RoBERTa with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="prabhaskenche/toxic-comment-classification-using-RoBERTa")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("prabhaskenche/toxic-comment-classification-using-RoBERTa") model = AutoModelForSequenceClassification.from_pretrained("prabhaskenche/toxic-comment-classification-using-RoBERTa") - Notebooks
- Google Colab
- Kaggle
Telugu performance?
#1
by n8duo - opened
I don't see any discussion about the model's performance on Telugu here. So I am slightly wondering if the inclusion of Telugu in the supported languages was a mistake. Is there any comment on this? Does the model indeed support Telugu?
Follow up: it seems the model does not support Telugu, per my initial attempts. It seems unable to distinguish between benign and highly offensive content. And the tokenizer is unable to tokenize Telugu.