Training in progress, step 10

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,18 +1,17 @@
 ---
 base_model: Qwen/Qwen2.5-VL-3B-Instruct
-datasets: lmms-lab/multimodal-open-r1-8k-verified
 library_name: transformers
 model_name: Qwen2.5-VL-3B-Instruct-Thinking
 tags:
 - generated_from_trainer
-- grpo
 - trl
 licence: license
 ---
 # Model Card for Qwen2.5-VL-3B-Instruct-Thinking
-This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct) on the [lmms-lab/multimodal-open-r1-8k-verified](https://huggingface.co/datasets/lmms-lab/multimodal-open-r1-8k-verified) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -37,7 +36,7 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
 - TRL: 0.22.0.dev0
 - Transformers: 4.55.0
-- Pytorch: 2.8.0+cu126
 - Datasets: 4.0.0
 - Tokenizers: 0.21.4

 ---
 base_model: Qwen/Qwen2.5-VL-3B-Instruct
 library_name: transformers
 model_name: Qwen2.5-VL-3B-Instruct-Thinking
 tags:
 - generated_from_trainer
 - trl
+- grpo
 licence: license
 ---
 # Model Card for Qwen2.5-VL-3B-Instruct-Thinking
+This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 - TRL: 0.22.0.dev0
 - Transformers: 4.55.0
+- Pytorch: 2.6.0+cu124
 - Datasets: 4.0.0
 - Tokenizers: 0.21.4

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:15be507a01940bd549ca58762394c90ef0c64a8f56cea5c290f19bd9dc6ed61b
 size 7393888

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0cec4cb2b8bbb020c1e706b65b5eecf0a83a80193c8a07d5882dfa4ba26aa80
 size 7393888

runs/Aug08_05-50-33_e9baed9eaec7/events.out.tfevents.1754632252.e9baed9eaec7.184.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4132bab9572b1f6ebdb8a25ca29eaec9e9b78fdb6a0f4c9056946af06c79f12b
+size 10601

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73a8c18c0fe004efd26f09e3e0731b3358c4bc6700d7661f43d85ee581a21741
-size 6993

 version https://git-lfs.github.com/spec/v1
+oid sha256:131042c4a2334e315d9e0e0b90a758a53888c968f07cf068c453a0cdb6c68a5c
+size 6584