Training in progress, epoch 1

Files changed (4) hide show

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ library_name: transformers
 model_name: mql-finetune
 tags:
 - generated_from_trainer
-- grpo
 - trl
 licence: license
 ---
@@ -31,7 +31,7 @@ print(output["generated_text"])
-This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 ### Framework versions
@@ -43,16 +43,7 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
 ## Citations
-Cite GRPO as:
-```bibtex
-@article{shao2024deepseekmath,
-    title        = {{DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models}},
-    author       = {Zhihong Shao and Peiyi Wang and Qihao Zhu and Runxin Xu and Junxiao Song and Mingchuan Zhang and Y. K. Li and Y. Wu and Daya Guo},
-    year         = 2024,
-    eprint       = {arXiv:2402.03300},
-}
-```
 Cite TRL as:

 model_name: mql-finetune
 tags:
 - generated_from_trainer
+- sft
 - trl
 licence: license
 ---
+This model was trained with SFT.
 ### Framework versions
 ## Citations
 Cite TRL as:

adapter_config.json CHANGED Viewed

@@ -30,12 +30,12 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "gate_proj",
     "o_proj",
-    "up_proj",
     "down_proj",
-    "k_proj",
     "v_proj",
     "q_proj"
   ],
   "target_parameters": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "o_proj",
     "down_proj",
     "v_proj",
+    "gate_proj",
+    "up_proj",
+    "k_proj",
     "q_proj"
   ],
   "target_parameters": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60d95b10b6e140a9626a7058d5038528f2ff80148dc4569b881db56052046509
-size 40

 version https://git-lfs.github.com/spec/v1
+oid sha256:c9517e917b2cf5d12b013f431f8eda9893d620fd61d908335df57774106e6101
+size 66127776

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1b467ffd0aa71bdef98f0a1f0424e340fb630c605dfec4d209d46d919feb0e3b
-size 7121

 version https://git-lfs.github.com/spec/v1
+oid sha256:9aa382ae7bb50f31fa3d152a9426d770fc36b2d3eaf859bfef058ffaaff5ffd2
+size 5713