Training in progress, step 30

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,20 +1,18 @@
 ---
 base_model: Qwen/Qwen2.5-7B-Instruct
-datasets: HuggingFaceH4/deita-10k-v0-sft
 library_name: transformers
 model_name: Qwen2.5-7B-Instruct
 tags:
 - generated_from_trainer
 - sft
 - trl
-- trackio
-- trackio:https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773039020&sidebar=collapsed
 licence: license
 ---
 # Model Card for Qwen2.5-7B-Instruct
-This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) on the [HuggingFaceH4/deita-10k-v0-sft](https://huggingface.co/datasets/HuggingFaceH4/deita-10k-v0-sft) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -31,7 +29,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773039020&sidebar=collapsed)
 This model was trained with SFT.

 ---
 base_model: Qwen/Qwen2.5-7B-Instruct
 library_name: transformers
 model_name: Qwen2.5-7B-Instruct
 tags:
 - generated_from_trainer
 - sft
+- trackio:https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773112846&sidebar=collapsed
 - trl
 licence: license
 ---
 # Model Card for Qwen2.5-7B-Instruct
+This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773112846&sidebar=collapsed)
 This model was trained with SFT.

adapter_config.json CHANGED Viewed

@@ -32,13 +32,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "k_proj",
     "up_proj",
-    "down_proj",
-    "q_proj",
     "o_proj",
-    "v_proj",
-    "gate_proj"
   ],
   "target_parameters": null,
   "task_type": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "gate_proj",
     "k_proj",
     "up_proj",
     "o_proj",
+    "down_proj",
+    "q_proj"
   ],
   "target_parameters": null,
   "task_type": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e55ae8371fc0f6d304843e3033d619cc7822e1672a9ca8b05620e15355320934
 size 161533584

 version https://git-lfs.github.com/spec/v1
+oid sha256:8c4717821557f8e0e634a5c4d568632ba895c2abd56bbd71b5de75c4e2169603
 size 161533584