End of training

Browse files

Files changed (7) hide show

README.md +26 -2
config.json +1 -1
generation_config.json +1 -1
model.safetensors +1 -1
runs/May23_04-45-24_34a35c0ba5e7/events.out.tfevents.1747975528.34a35c0ba5e7.35.0 +3 -0
runs/May23_04-52-40_34a35c0ba5e7/events.out.tfevents.1747975965.34a35c0ba5e7.35.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,6 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
 # SmolVLM2-500M-Video-Instruct-video-feedback
 This model is a fine-tuned version of [HuggingFaceTB/SmolVLM2-500M-Video-Instruct](https://huggingface.co/HuggingFaceTB/SmolVLM2-500M-Video-Instruct) on an unknown dataset.
 ## Model description
@@ -35,7 +37,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
 - train_batch_size: 4
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -44,11 +46,33 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions
-- Transformers 4.52.0.dev0
 - Pytorch 2.6.0+cu124
 - Datasets 3.6.0
 - Tokenizers 0.21.1

 # SmolVLM2-500M-Video-Instruct-video-feedback
 This model is a fine-tuned version of [HuggingFaceTB/SmolVLM2-500M-Video-Instruct](https://huggingface.co/HuggingFaceTB/SmolVLM2-500M-Video-Instruct) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0104
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
 - train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.0058        | 0.05  | 50   | 0.0106          |
+| 0.0056        | 0.1   | 100  | 0.0105          |
+| 0.0052        | 0.15  | 150  | 0.0123          |
+| 0.0077        | 0.2   | 200  | 0.0108          |
+| 0.0053        | 0.25  | 250  | 0.0107          |
+| 0.0062        | 0.3   | 300  | 0.0109          |
+| 0.0058        | 0.35  | 350  | 0.0104          |
+| 0.006         | 0.4   | 400  | 0.0119          |
+| 0.0053        | 0.45  | 450  | 0.0104          |
+| 0.0066        | 0.5   | 500  | 0.0111          |
+| 0.0057        | 0.55  | 550  | 0.0104          |
+| 0.0059        | 0.6   | 600  | 0.0108          |
+| 0.0053        | 0.65  | 650  | 0.0104          |
+| 0.0052        | 0.7   | 700  | 0.0103          |
+| 0.0054        | 0.75  | 750  | 0.0106          |
+| 0.0064        | 0.8   | 800  | 0.0104          |
+| 0.0056        | 0.85  | 850  | 0.0104          |
+| 0.0069        | 0.9   | 900  | 0.0104          |
+| 0.0052        | 0.95  | 950  | 0.0104          |
+| 0.0053        | 1.0   | 1000 | 0.0104          |
 ### Framework versions
+- Transformers 4.53.0.dev0
 - Pytorch 2.6.0+cu124
 - Datasets 3.6.0
 - Tokenizers 0.21.1

config.json CHANGED Viewed

@@ -126,7 +126,7 @@
       "q4f16": "float16"
     }
   },
-  "transformers_version": "4.52.0.dev0",
   "use_cache": false,
   "use_reentrant_checkpointing": false,
   "vision_config": {

       "q4f16": "float16"
     }
   },
+  "transformers_version": "4.53.0.dev0",
   "use_cache": false,
   "use_reentrant_checkpointing": false,
   "vision_config": {

generation_config.json CHANGED Viewed

@@ -3,5 +3,5 @@
   "bos_token_id": 0,
   "eos_token_id": 49279,
   "pad_token_id": 2,
-  "transformers_version": "4.52.0.dev0"
 }

   "bos_token_id": 0,
   "eos_token_id": 49279,
   "pad_token_id": 2,
+  "transformers_version": "4.53.0.dev0"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d0a992515a245eb88c8b3a5100906904af9721cc8673ec481c9adb3d344356a4
 size 1015025832

 version https://git-lfs.github.com/spec/v1
+oid sha256:23485d8b383dfefccb1c45db8d62469350ca304e5d844d7d3c29128917244a5a
 size 1015025832

runs/May23_04-45-24_34a35c0ba5e7/events.out.tfevents.1747975528.34a35c0ba5e7.35.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:edac263ea2c3d3887e33fb0fa5c37fcbb2a7136d821299f9b7bedffc20250d5d
+size 10039

runs/May23_04-52-40_34a35c0ba5e7/events.out.tfevents.1747975965.34a35c0ba5e7.35.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ced44fd8bd12ad65ea5e8aabf985fabadf84aa8b09c38ad22a4f42421c15e60
+size 22765

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b749cf4a34d05dc663009b7d31e6c7037f30ea5651d486a8d9e3fa8a28ae773b
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:4961d7b02f17f8d959da1aec84521d58def86edcf25dea8c8365853afcc0241a
 size 5368