How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("fill-mask", model="faisalq/bert-base-arapoembert")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("faisalq/bert-base-arapoembert")
model = AutoModelForMaskedLM.from_pretrained("faisalq/bert-base-arapoembert")
Quick Links

AraPoemBERT is the first pre-trained large language model focused exclusively on Arabic poetry. The dataset used in pretraining the model contains more than 2 million verses. The code files along with the results are available on repo.

BibTex

If you use SaudiBERT model in your scientific publication, or if you find the resources in this repository useful, please cite our paper as follows (citation details to be updated):

@article{qarah2024arapoembert,
  title={AraPoemBERT: A Pretrained Language Model for Arabic Poetry Analysis},
  author={Qarah, Faisal},
  journal={arXiv preprint arXiv:2403.12392},
  year={2024}
}

Downloads last month
109,152
Inference Providers NEW
Examples
Mask token: [MASK]

Space using faisalq/bert-base-arapoembert 1

Paper for faisalq/bert-base-arapoembert