L3Cube-MahaNLP: Marathi Natural Language Processing Datasets, Models, and Library
Paper • 2205.14728 • Published
# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("ScriptEdgeAI/MarathiSentiment-Bloom-560m")
model = AutoModelForSequenceClassification.from_pretrained("ScriptEdgeAI/MarathiSentiment-Bloom-560m")Trained by:
Assistance:
Supervision:
| Model | Data | Accuracy |
|---|---|---|
| bigscience/bloom-560m | Validation | 34.7 |
| bigscience/bloom-560m | Test | 34.8 |
| ScriptEdgeAI/MarathiSentiment-Bloom-560m | Validation | 76.0 |
| ScriptEdgeAI/MarathiSentiment-Bloom-560m | Test | 77.0 |
Citation to L3CubePune by the dataset usage.
@article {joshi2022l3cube,
title= {L3Cube-MahaNLP: Marathi Natural Language Processing Datasets, Models, and Library},
author= {Joshi, Raviraj},
journal= {arXiv preprint arXiv:2205.14728},
year= {2022}
}
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="ScriptEdgeAI/MarathiSentiment-Bloom-560m")