Add library_name metadata and link to GitHub (#1)

Browse files

- Add library_name metadata and link to GitHub (05c1634c0e0cdca526fd75f963a75331c192ff33)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +11 -4

README.md CHANGED Viewed

@@ -1,8 +1,11 @@
 ---
 language:
 - tr
 - en
 license: apache-2.0
 tags:
 - text-generation
 - turkish
@@ -14,14 +17,17 @@ tags:
 - continual-pretraining
 - TRUBA
 - MN5
-base_model: Qwen/Qwen3-4B
-pipeline_tag: text-generation
 ---
 # Mecellem-Qwen3-4B-TR
 [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
 ## Model Description
 Mecellem-Qwen3-4B-TR is a Turkish legal language model adapted through Continual Pre-training (CPT) on Turkish legal and official texts. The model is based on Qwen3-4B decoder architecture (4B parameters) and trained using a single-phase, large-scale CPT process. Unlike the 1.7B model's four-phase curriculum learning, this model employs a single-phase training strategy on a comprehensive dataset, demonstrating that larger model capacity can benefit from direct large-scale domain adaptation.
@@ -177,7 +183,7 @@ If you use this model, please cite our paper:
 ```bibtex
 @article{mecellem2026,
   title={Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain},
-  author={Uğur, Özgür and Göksu, Mahmut and Çimen, Mahmut and Yılmaz, Musa and Şavirdi, Esra and Demir, Alp Talha and Güllüce, Rumeysa and Çetin, İclal and Sağbaş, Ömer Can},
   journal={arXiv preprint arXiv:2601.16018},
   year={2026},
   month={January},
@@ -188,6 +194,7 @@ If you use this model, please cite our paper:
   primaryClass={cs.CL}
 }
 ```
 ### Base Model References
 ```bibtex
@@ -197,4 +204,4 @@ If you use this model, please cite our paper:
   journal={arXiv preprint arXiv:2409.00000},
   year={2024}
 }
-```

 ---
+base_model: Qwen/Qwen3-4B
 language:
 - tr
 - en
 license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
 tags:
 - text-generation
 - turkish
 - continual-pretraining
 - TRUBA
 - MN5
 ---
 # Mecellem-Qwen3-4B-TR
 [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
+This repository contains the **Mecellem-Qwen3-4B-TR** model, as presented in the paper [Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain](https://huggingface.co/papers/2601.16018).
+- **GitHub Repository:** [newmindai/mecellem-models](https://github.com/newmindai/mecellem-models)
+- **Paper:** [arXiv:2601.16018](https://arxiv.org/abs/2601.16018)
 ## Model Description
 Mecellem-Qwen3-4B-TR is a Turkish legal language model adapted through Continual Pre-training (CPT) on Turkish legal and official texts. The model is based on Qwen3-4B decoder architecture (4B parameters) and trained using a single-phase, large-scale CPT process. Unlike the 1.7B model's four-phase curriculum learning, this model employs a single-phase training strategy on a comprehensive dataset, demonstrating that larger model capacity can benefit from direct large-scale domain adaptation.
 ```bibtex
 @article{mecellem2026,
   title={Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain},
+  author={Uğur, Özgür and Göksu, Mahmut and Çimen, Mahmut and Yılmaz, Musa and Şavirdi, Esra and Demir, Alp Talha and Güllüce, Rumeysa and İclal Çetin, Ömer Can Sağbaş},
   journal={arXiv preprint arXiv:2601.16018},
   year={2026},
   month={January},
   primaryClass={cs.CL}
 }
 ```
 ### Base Model References
 ```bibtex
   journal={arXiv preprint arXiv:2409.00000},
   year={2024}
 }
+```