akhooli
/

ModernBERT-ar-base-tiny

Generated from Trainer

Model card Files Files and versions

akhooli commited on Jan 12, 2025

Commit

06397c9

·

verified ·

1 Parent(s): 039810b

Update README.md

Files changed (1) hide show

README.md +8 -10

README.md CHANGED Viewed

@@ -10,23 +10,24 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# ModernBERT-ar-base-small4
-This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -45,9 +46,6 @@ The following hyperparameters were used during training:
 - training_steps: 50000
 - mixed_precision_training: Native AMP
-### Training results
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# ModernBERT-ar-base-tiny
+This model was trained on [Fineweb2 Ar sample](https://huggingface.co/datasets/akhooli/fineweb2_ar_24_sample) dataset.
+The tokenizer was also trained using the same dataset.
+See [sample code](https://colab.research.google.com/drive/1CUsUsJQV4ZzJar2987yAzCTn8ve4hR5b?usp=sharing)
+(usage and training) and [initial post](https://www.linkedin.com/posts/akhooli_a-micro-arabic-modern-bert-a-couple-weeks-activity-7282005813357875202-SAGk)
 ## Model description
+ModernBERT Arabic (MLM) experiment.
 ## Intended uses & limitations
+Educational and explorational uses only. Limited data, not fully trained.
 ## Training and evaluation data
+Evaluation on 5% of the data, uses 2 GPUs.
 ### Training hyperparameters
 - training_steps: 50000
 - mixed_precision_training: Native AMP
 ### Framework versions