almaghrabima
/

ALLaM-Thinking

Model card Files Files and versions

almaghrabima commited on Mar 21, 2025

Commit

7a3ca27

·

verified ·

1 Parent(s): 4771944

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -78,7 +78,7 @@ This model has been optimized using [Unsloth](https://github.com/unslothai/unslo
 ALLaM-Thinking was trained using a combination of techniques:
 - Base architecture fine-tuned on diverse Arabic datasets
-- GRPO (Generalized Reinforced Preference Optimization) for better alignment
 - Specialized training on mathematical reasoning and step-by-step problem-solving
 ## Performance

 ALLaM-Thinking was trained using a combination of techniques:
 - Base architecture fine-tuned on diverse Arabic datasets
+- GRPO (Group Relative Policy Optimization) for better alignment
 - Specialized training on mathematical reasoning and step-by-step problem-solving
 ## Performance