pneupane
/

table-transformer-finetuned

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/table-structure-recognition-v1.1-all](https://huggingface.co/microsoft/table-structure-recognition-v1.1-all) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.9790
 ## Model description
@@ -35,21 +35,40 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 6.5464        | 3.7037 | 100  | 6.0902          |
-| 6.0842        | 7.4074 | 200  | 4.9790          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/table-structure-recognition-v1.1-all](https://huggingface.co/microsoft/table-structure-recognition-v1.1-all) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.2890
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-06
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
+- gradient_accumulation_steps: 8
+- total_train_batch_size: 8
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| 9.2523        | 1.0     | 14   | 9.3207          |
+| 7.9949        | 2.0     | 28   | 9.0533          |
+| 8.2292        | 3.0     | 42   | 8.6547          |
+| 7.1495        | 4.0     | 56   | 8.0844          |
+| 6.9055        | 5.0     | 70   | 7.3339          |
+| 7.0771        | 6.0     | 84   | 7.2344          |
+| 5.6554        | 7.0     | 98   | 6.6026          |
+| 5.5633        | 8.0     | 112  | 6.3610          |
+| 4.9506        | 9.0     | 126  | 5.9567          |
+| 4.0856        | 10.0    | 140  | 5.9191          |
+| 4.4286        | 11.0    | 154  | 5.6374          |
+| 2.9043        | 12.0    | 168  | 4.9525          |
+| 3.4755        | 13.0    | 182  | 4.6846          |
+| 3.4152        | 14.0    | 196  | 4.3171          |
+| 2.8456        | 15.0    | 210  | 3.5661          |
+| 2.4149        | 16.0    | 224  | 3.6150          |
+| 1.9328        | 17.0    | 238  | 3.2264          |
+| 1.8503        | 18.0    | 252  | 2.8540          |
+| 1.596         | 18.5981 | 260  | 2.2890          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d3269f9b14183a80e589bcf5c436be185bdd350bbefd5b9caa5625c24af3ee6
 size 115437156

 version https://git-lfs.github.com/spec/v1
+oid sha256:8ad6d3b9dcefbfc815bfb345c26214214d83e71ed99fb0a83ab3ca58566c0d63
 size 115437156

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7607d6e3e627387644f0df32e16112c21bc754938a2e67abd3872d22846f21ac
-size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:c598bf80301118bf0af4c60ba02df7fdade4f91983f49366f19a458209746503
+size 5304