MaskedLM / README.md
SRDdev's picture
Update README.md
7ab934c
|
raw
history blame
915 Bytes
---
license: afl-3.0
datasets:
- WillHeld/hinglish_top
language:
- en
- hi
metrics:
- accuracy
library_name: transformers
pipeline_tag: fill-mask
---
### SRDberta
This is a BERT model trained for Masked Language Modeling for Higlish Data.
Hinglish is a term used to describe the hybrid language spoken in India, which combines elements of Hindi and English. It is commonly used in informal conversations and in media such as Bollywood films
### Inference
```
from transformers import AutoTokenizer, AutoModelForMaskedLM, pipeline
tokenizer = AutoTokenizer.from_pretrained("SRDdev/SRDBerta")
model = AutoModelForMaskedLM.from_pretrained("SRDdev/SRDBerta")
fill = pipeline('fill-mask', model='SRDberta', tokenizer='SRDberta')
```
```
fill_mask = fill.tokenizer.mask_token
fill(f'Aap {fill_mask} ho?')
```
### Citation
Author: @[SRDdev](https://huggingface.co/SRDdev)
```
framework : Pytorch
Year: Jan 2023
```