sharif-dal
/

dal-bert

Model card Files Files and versions

arm-on commited on Jul 2, 2022

Commit

434d2bb

·

1 Parent(s): b5478bf

Update README.md

Files changed (1) hide show

README.md +32 -0

README.md CHANGED Viewed

@@ -5,3 +5,35 @@ tags:
 - bert-persian
 license: apache-2.0
 ---

 - bert-persian
 license: apache-2.0
 ---
+DAL-BERT: Another pre-trained language model for Persian
+---
+DAL-BERT is a transformer-based model trained on more than 80 gigabytes of Persian text including both formal and informal (conversational) contexts. The architecture of this model follows the original BERT [[Devlin et al.](https://arxiv.org/abs/1810.04805)].
+How to use the Model
+---
+```python
+from transformers import BertForMaskedLM, BertTokenizer, pipeline
+model = BertForMaskedLM.from_pretrained('sharif-dal/dal-bert')
+tokenizer = BertTokenizer.from_pretrained('sharif-dal/dal-bert')
+fill_sentence = pipeline('fill-mask', model=model, tokenizer=tokenizer)
+fill_sentence('اینجا جمله مورد نظر خود را بنویسید و کلمه موردنظر را [MASK] کنید')
+```
+The Training Data
+---
+The abovementioned model was trained on a bunch of newspapers, news agencies' websites, technology-related sources, people's comments, magazines, literary criticism, and some blogs.
+Evaluation
+---
+| Training Loss | Epoch | Step  | MLM Accuracy |
+|:-------------:|:-----:|:-----:| |
+| 0.0036        | 1.0   | 322906 | |
+Contributors
+---
+- [Arman Malekzadeh](http://ce.sharif.edu/~malekzaadeh/), PhD Student in AI @ Sharif University of Technology [[Linkedin](https://www.linkedin.com/in/arman-malekzadeh/)] [[Github](https://github.com/arm-on)]
+- [Amirhossein Ramazani], Master's Student in AI @ Sharif University of Technology [[Linkedin](https://www.linkedin.com/in/amirhossein-ramazani/)] [[Github](https://github.com/amirhossein1376)]