fmnxl commited on
Commit
677796b
·
verified ·
1 Parent(s): 894d4b0

End of training

Browse files
Files changed (2) hide show
  1. README.md +12 -11
  2. model.safetensors +1 -1
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [jhu-clsp/mmBERT-base](https://huggingface.co/jhu-clsp/mmBERT-base) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.7960
25
- - Precision: 0.6647
26
- - Recall: 0.7145
27
- - F1: 0.6887
28
- - Accuracy: 0.8221
29
 
30
  ## Model description
31
 
@@ -44,7 +44,7 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 1e-05
48
  - train_batch_size: 8
49
  - eval_batch_size: 16
50
  - seed: 42
@@ -52,16 +52,17 @@ The following hyperparameters were used during training:
52
  - total_train_batch_size: 64
53
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
54
  - lr_scheduler_type: linear
55
- - lr_scheduler_warmup_ratio: 0.15
56
- - num_epochs: 2
57
- - label_smoothing_factor: 0.1
58
 
59
  ### Training results
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
62
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
63
- | 0.8178 | 1.0 | 342 | 0.8293 | 0.5955 | 0.7124 | 0.6488 | 0.8050 |
64
- | 0.7927 | 2.0 | 684 | 0.7960 | 0.6647 | 0.7145 | 0.6887 | 0.8221 |
 
65
 
66
 
67
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [jhu-clsp/mmBERT-base](https://huggingface.co/jhu-clsp/mmBERT-base) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.1572
25
+ - Precision: 0.3040
26
+ - Recall: 0.2644
27
+ - F1: 0.2828
28
+ - Accuracy: 0.9822
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 2e-05
48
  - train_batch_size: 8
49
  - eval_batch_size: 16
50
  - seed: 42
 
52
  - total_train_batch_size: 64
53
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
54
  - lr_scheduler_type: linear
55
+ - lr_scheduler_warmup_ratio: 0.1
56
+ - num_epochs: 3
57
+ - label_smoothing_factor: 0.05
58
 
59
  ### Training results
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
62
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
63
+ | 0.1594 | 1.0 | 342 | 0.1572 | 0.3040 | 0.2644 | 0.2828 | 0.9822 |
64
+ | 0.1587 | 2.0 | 684 | 0.1549 | 0.3445 | 0.2204 | 0.2688 | 0.9832 |
65
+ | 0.1557 | 3.0 | 1026 | 0.1546 | 0.3412 | 0.1952 | 0.2483 | 0.9831 |
66
 
67
 
68
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9e3bfb4a16f7d99d3690821683b001ed183085476e250851274ab0b31f301c4c
3
  size 1230141424
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10871116cf239b89cc6869579188bfe4dd3a0a457ab2181f429e528471a6a6ff
3
  size 1230141424