Slim205
/

Barka-2b-it

Model card Files Files and versions

Slim205 commited on Oct 20, 2024

Commit

a42e5ad

·

verified ·

1 Parent(s): 72db123

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -8,10 +8,14 @@ base_model:
 - google/gemma-2-2b-it
 ---
 The goal of this project is to adapt large language models for the Arabic language. Due to the scarcity of Arabic instruction fine-tuning data, the focus is on creating a high-quality instruction fine-tuning (IFT) dataset. The project aims to finetune models on this dataset and evaluate their performance across various benchmarks.
 This model is the 2B version. It was trained for 2 days on 1 A100 GPU using LoRA with a rank of 128, a learning rate of 1e-4, and a cosine learning rate schedule.
 | Metric               | Slim205/Barka-2b-it |
 |----------------------|---------------------|
 | Average              | 46.98               |

 - google/gemma-2-2b-it
 ---
+# Motivation :
 The goal of this project is to adapt large language models for the Arabic language. Due to the scarcity of Arabic instruction fine-tuning data, the focus is on creating a high-quality instruction fine-tuning (IFT) dataset. The project aims to finetune models on this dataset and evaluate their performance across various benchmarks.
+# Training :
 This model is the 2B version. It was trained for 2 days on 1 A100 GPU using LoRA with a rank of 128, a learning rate of 1e-4, and a cosine learning rate schedule.
+# Evaluation :
 | Metric               | Slim205/Barka-2b-it |
 |----------------------|---------------------|
 | Average              | 46.98               |