Model save

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,16 +1,14 @@
 ---
-datasets: HuggingFaceH4/Bespoke-Stratos-17k
 library_name: transformers
 model_name: Qwen2.5-1.5B-Open-R1-Distill
 tags:
 - generated_from_trainer
-- open-r1
 licence: license
 ---
 # Model Card for Qwen2.5-1.5B-Open-R1-Distill
-This model is a fine-tuned version of [None](https://huggingface.co/None) on the [HuggingFaceH4/Bespoke-Stratos-17k](https://huggingface.co/datasets/HuggingFaceH4/Bespoke-Stratos-17k) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -26,7 +24,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/2741919970-hustvl/huggingface/runs/3jcwqhpk)
 This model was trained with SFT.
@@ -35,8 +33,8 @@ This model was trained with SFT.
 - TRL: 0.16.0.dev0
 - Transformers: 4.50.0.dev0
-- Pytorch: 2.6.0
-- Datasets: 3.3.2
 - Tokenizers: 0.21.0
 ## Citations

 ---
 library_name: transformers
 model_name: Qwen2.5-1.5B-Open-R1-Distill
 tags:
 - generated_from_trainer
 licence: license
 ---
 # Model Card for Qwen2.5-1.5B-Open-R1-Distill
+This model is a fine-tuned version of [None](https://huggingface.co/None).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/2741919970-hustvl/huggingface/runs/902j0zxa)
 This model was trained with SFT.
 - TRL: 0.16.0.dev0
 - Transformers: 4.50.0.dev0
+- Pytorch: 2.5.1
+- Datasets: 3.3.1
 - Tokenizers: 0.21.0
 ## Citations

all_results.json CHANGED Viewed

@@ -1,13 +1,8 @@
 {
-    "eval_loss": 0.9327651262283325,
-    "eval_runtime": 8.8796,
-    "eval_samples": 100,
-    "eval_samples_per_second": 7.433,
-    "eval_steps_per_second": 0.563,
     "total_flos": 0.0,
-    "train_loss": 0.9946218774200528,
-    "train_runtime": 21041.6427,
     "train_samples": 16610,
-    "train_samples_per_second": 1.566,
-    "train_steps_per_second": 0.049
 }

 {
     "total_flos": 0.0,
+    "train_loss": 0.8600422314672648,
+    "train_runtime": 45916.2025,
     "train_samples": 16610,
+    "train_samples_per_second": 1.085,
+    "train_steps_per_second": 0.068
 }

train_results.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
     "total_flos": 0.0,
-    "train_loss": 0.9946218774200528,
-    "train_runtime": 21041.6427,
     "train_samples": 16610,
-    "train_samples_per_second": 1.566,
-    "train_steps_per_second": 0.049
 }

 {
     "total_flos": 0.0,
+    "train_loss": 0.8600422314672648,
+    "train_runtime": 45916.2025,
     "train_samples": 16610,
+    "train_samples_per_second": 1.085,
+    "train_steps_per_second": 0.068
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff