standardmodelbio
/

smb-vision-large

masked-image-modeling

Generated from Trainer

Model card Files Files and versions

chenz53 commited on Dec 3, 2024

Commit

816ac34

·

verified ·

1 Parent(s): 9ad9547

Update README.md

Files changed (1) hide show

README.md +14 -20

README.md CHANGED Viewed

@@ -7,9 +7,9 @@ tags:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# smb-vision-base-1029
-This model is trained from scratch using [VideoMAE](https://huggingface.co/docs/transformers/en/model_doc/videomae) on over 4.7k CT volumes.
 ## Model description
@@ -29,31 +29,25 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-04
-- train_batch_size: 32
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
-- num_epochs: 30.0
 ### Training results
 {
-  "_runtime": 54805.860011105,
-  "_step": 4351,
-  "eval/runtime": 17.8428,
-  "eval/samples_per_second": 2.578,
-  "eval/steps_per_second": 2.578,
-  "total_flos": 3.8084565648770335e+21,
-  "train/epoch": 30,
-  "train/global_step": 4350,
-  "train/grad_norm": 0.0735374316573143,
-  "train/learning_rate": 0,
-  "train/loss": 0.5736,
-  "train_loss": 0.5022664608695041,
-  "train_runtime": 54785.1298,
-  "train_samples_per_second": 2.527,
-  "train_steps_per_second": 0.079
 }
@@ -69,7 +63,7 @@ The following hyperparameters were used during training:
 # load data using `dataload.py`
 model = VideoMAEForPreTraining.from_pretrained(
-    standardmodelbio/smb-vision-base,
     trust_remote_code=True,
 )

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# smb-vision-large-1202
+This model is trained from scratch using [VideoMAE](https://huggingface.co/docs/transformers/en/model_doc/videomae) on over 55k CT volumes.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-04
+- train_batch_size: 16
 - eval_batch_size: 1
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
+- num_epochs: 10.0
 ### Training results
 {
+  "_runtime": 2641.091489502,
+  "_step": 399,
+  "_timestamp": 1733187755.3146417,
+  "_wandb.runtime": 2660,
+  "train/epoch": 8.425414364640885,
+  "train/global_step": 18300,
+  "train/grad_norm": 0.04110511764883995,
+  "train/learning_rate": 0.0001624558726951691,
+  "train/loss": 0.4292
 }
 # load data using `dataload.py`
 model = VideoMAEForPreTraining.from_pretrained(
+    standardmodelbio/smb-vision-large,
     trust_remote_code=True,
 )