| | --- |
| | license: cc-by-4.0 |
| | language: hi |
| | --- |
| | |
| | ## HindBERT-Scratch |
| | HindBERT is a Hindi BERT model. It is a base-BERT model trained from scratch on publicly available Hindi monolingual datasets. |
| | [project link] (https://github.com/l3cube-pune/MarathiNLP) |
| |
|
| | More details on the dataset, models, and baseline results can be found in our [paper] (<a href='https://arxiv.org/abs/2211.11418'> link </a>) |
| |
|
| | The best version of model is shared <a href='https://huggingface.co/l3cube-pune/hindi-bert-v2'> here </a> |
| |
|
| | Citing: |
| | ``` |
| | @article{joshi2022l3cubehind, |
| | author = {Joshi, Raviraj}, |
| | year = {2022}, |
| | month = {09}, |
| | pages = {}, |
| | title = {L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for Devanagari based Hindi and Marathi Languages}, |
| | doi = {10.13140/RG.2.2.14606.84809} |
| | } |
| | ``` |
| |
|
| | Other Models trained from scratch are listed below: <br> |
| | <a href='https://huggingface.co/l3cube-pune/marathi-bert-scratch'> Marathi-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/marathi-tweets-bert-scratch'> Marathi-Tweets-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-bert-scratch'> Hindi-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-bert-scratch'> Dev-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/kannada-bert-scratch'> Kannada-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/telugu-bert-scratch'> Telugu-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/malayalam-bert-scratch'> Malayalam-Scratch </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/gujarati-bert-scratch'> Gujarati-Scratch </a> <br> |
| |
|
| | Better versions of Monolingual Indic BERT models are listed below: <br> |
| | <a href='https://huggingface.co/l3cube-pune/marathi-bert-v2'> Marathi BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/marathi-roberta'> Marathi RoBERTa </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/marathi-albert'> Marathi AlBERT </a> <br> |
| |
|
| | <a href='https://huggingface.co/l3cube-pune/hindi-bert-v2'> Hindi BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-roberta'> Hindi RoBERTa </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-albert'> Hindi AlBERT </a> <br> |
| |
|
| | <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-bert'> Dev BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-roberta'> Dev RoBERTa </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/hindi-marathi-dev-albert'> Dev AlBERT </a> <br> |
| |
|
| | <a href='https://huggingface.co/l3cube-pune/kannada-bert'> Kannada BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/telugu-bert'> Telugu BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/malayalam-bert'> Malayalam BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/tamil-bert'> Tamil BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/gujarati-bert'> Gujarati BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/odia-bert'> Oriya BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/bengali-bert'> Bengali BERT </a> <br> |
| | <a href='https://huggingface.co/l3cube-pune/punjabi-bert'> Punjabi BERT </a> <br> |