Instructions to use l3cube-pune/marathi-bert-scratch with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use l3cube-pune/marathi-bert-scratch with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="l3cube-pune/marathi-bert-scratch")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("l3cube-pune/marathi-bert-scratch") model = AutoModelForMaskedLM.from_pretrained("l3cube-pune/marathi-bert-scratch") - Notebooks
- Google Colab
- Kaggle
# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("l3cube-pune/marathi-bert-scratch")
model = AutoModelForMaskedLM.from_pretrained("l3cube-pune/marathi-bert-scratch")MahaBERT-Scratch
MahaBERT is a Marathi BERT model. It is a base-BERT model trained from scratch on L3Cube-MahaCorpus and other publicly available Marathi monolingual datasets. [dataset link] (https://github.com/l3cube-pune/MarathiNLP)
More details on the dataset, models, and baseline results can be found in our [paper] (https://arxiv.org/abs/2202.01159)
Best version of this model is available here .
@InProceedings{joshi:2022:WILDRE6,
author = {Joshi, Raviraj},
title = {L3Cube-MahaCorpus and MahaBERT: Marathi Monolingual Corpus, Marathi BERT Language Models, and Resources},
booktitle = {Proceedings of The WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference},
month = {June},
year = {2022},
address = {Marseille, France},
publisher = {European Language Resources Association},
pages = {97--101}
}
Other Models trained from scratch are listed below:
Marathi-Scratch
Marathi-Tweets-Scratch
Hindi-Scratch
Dev-Scratch
Kannada-Scratch
Telugu-Scratch
Malayalam-Scratch
Gujarati-Scratch
Better versions of Monolingual Indic BERT models are listed below:
Marathi BERT
Marathi RoBERTa
Marathi AlBERT
Hindi BERT
Hindi RoBERTa
Hindi AlBERT
Dev BERT
Dev RoBERTa
Dev AlBERT
Kannada BERT
Telugu BERT
Malayalam BERT
Tamil BERT
Gujarati BERT
Oriya BERT
Bengali BERT
Punjabi BERT
- Downloads last month
- 12
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="l3cube-pune/marathi-bert-scratch")