End of training

Browse files

Files changed (5) hide show

README.md +6 -21
pytorch_model.bin +1 -1
runs/Feb12_11-43-31_mcity-rtx-4090/events.out.tfevents.1739378611.mcity-rtx-4090.1808996.0 +3 -0
runs/Feb12_11-54-31_mcity-rtx-4090/events.out.tfevents.1739379272.mcity-rtx-4090.1816145.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/conditional-detr-resnet-50](https://huggingface.co/microsoft/conditional-detr-resnet-50) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4142
 ## Model description
@@ -38,34 +38,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 32
 - eval_batch_size: 8
 - seed: 0
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
-- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 1.1217        | 1.0   | 2644  | 1.4738          |
-| 0.9689        | 2.0   | 5288  | 1.4067          |
-| 0.9194        | 3.0   | 7932  | 1.3879          |
-| 0.8369        | 4.0   | 10576 | 1.3840          |
-| 0.8061        | 5.0   | 13220 | 1.4551          |
-| 0.7761        | 6.0   | 15864 | 1.4041          |
-| 0.7278        | 7.0   | 18508 | 1.3229          |
-| 0.7241        | 8.0   | 21152 | 1.4653          |
-| 0.7117        | 9.0   | 23796 | 1.3242          |
-| 0.6811        | 10.0  | 26440 | 1.3248          |
-| 0.6471        | 11.0  | 29084 | 1.3078          |
-| 0.6293        | 12.0  | 31728 | 1.3126          |
-| 0.638         | 13.0  | 34372 | 1.3298          |
-| 0.6134        | 14.0  | 37016 | 1.3913          |
-| 0.5773        | 15.0  | 39660 | 1.3278          |
-| 0.5653        | 16.0  | 42304 | 1.4142          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/conditional-detr-resnet-50](https://huggingface.co/microsoft/conditional-detr-resnet-50) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2694
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 1
 - eval_batch_size: 8
 - seed: 0
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.8592        | 1.0   | 5288 | 1.2694          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:97822ecc2244d2f82fcd04da7e252d7cdd5917173f33a805534a3ae4a7f3388c
 size 174213178

 version https://git-lfs.github.com/spec/v1
+oid sha256:17eee6e449d888b0ee139026a3b28c3fbadb1f49a19eadb15f38fc7483a93d4c
 size 174213178

runs/Feb12_11-43-31_mcity-rtx-4090/events.out.tfevents.1739378611.mcity-rtx-4090.1808996.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fb765ffa2790d67dcbe573696abf535d2e92054a84b1a95bde5d78323c57248f
+size 6157

runs/Feb12_11-54-31_mcity-rtx-4090/events.out.tfevents.1739379272.mcity-rtx-4090.1816145.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:912d88bdf8fb1a792eb366203f6bac720c5dfc7f4c926ad83b0fa68ac3346f2d
+size 8893

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9244733d7440acb17477ba07b72211f417ccc61dd8f33f2f5b76484a6b1692d
 size 5624

 version https://git-lfs.github.com/spec/v1
+oid sha256:0f1df0bf996ee39e592d902ec53ad2dcad0016391ad98ede53ddc4937fac66f3
 size 5624