ele-sage/mdeberta-v3-base-name-classifier-v2

Browse files

Files changed (5) hide show

README.md +30 -16
model.safetensors +1 -1
runs/Dec07_22-37-29_elesage-pc/events.out.tfevents.1765165139.elesage-pc.285136.0 +3 -0
runs/Dec07_22-52-34_elesage-pc/events.out.tfevents.1765166044.elesage-pc.293700.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -21,10 +21,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1317
 - Accuracy: 0.9946
 - Precision: 0.9989
-- Recall: 0.9914
 - F1: 0.9951
 ## Model description
@@ -52,25 +52,39 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
 - num_epochs: 1
-- label_smoothing_factor: 0.05
 ### Training results
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.1375        | 0.0718 | 4000  | 0.1431          | 0.9909   | 0.9991    | 0.9847 | 0.9918 |
-| 0.1391        | 0.1436 | 8000  | 0.1356          | 0.9930   | 0.9973    | 0.9902 | 0.9937 |
-| 0.1344        | 0.2154 | 12000 | 0.1361          | 0.9934   | 0.9983    | 0.9899 | 0.9941 |
-| 0.1387        | 0.2872 | 16000 | 0.1333          | 0.9937   | 0.9984    | 0.9903 | 0.9943 |
-| 0.1353        | 0.3590 | 20000 | 0.1340          | 0.9940   | 0.9985    | 0.9907 | 0.9946 |
-| 0.1337        | 0.4308 | 24000 | 0.1332          | 0.9939   | 0.9982    | 0.9909 | 0.9946 |
-| 0.1332        | 0.5026 | 28000 | 0.1332          | 0.9940   | 0.9977    | 0.9916 | 0.9946 |
-| 0.1359        | 0.5744 | 32000 | 0.1319          | 0.9943   | 0.9992    | 0.9907 | 0.9949 |
-| 0.1314        | 0.6462 | 36000 | 0.1326          | 0.9943   | 0.9984    | 0.9914 | 0.9949 |
-| 0.132         | 0.7180 | 40000 | 0.1318          | 0.9945   | 0.9990    | 0.9911 | 0.9950 |
-| 0.1309        | 0.7898 | 44000 | 0.1319          | 0.9945   | 0.9989    | 0.9913 | 0.9951 |
-| 0.1319        | 0.8616 | 48000 | 0.1318          | 0.9945   | 0.9988    | 0.9914 | 0.9951 |
-| 0.1288        | 0.9334 | 52000 | 0.1317          | 0.9946   | 0.9989    | 0.9914 | 0.9951 |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0732
 - Accuracy: 0.9946
 - Precision: 0.9989
+- Recall: 0.9913
 - F1: 0.9951
 ## Model description
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
 - num_epochs: 1
+- label_smoothing_factor: 0.02
 ### Training results
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.0914        | 0.0359 | 2000  | 0.0889          | 0.9907   | 0.9952    | 0.9882 | 0.9917 |
+| 0.0796        | 0.0718 | 4000  | 0.0864          | 0.9907   | 0.9991    | 0.9843 | 0.9916 |
+| 0.0808        | 0.1077 | 6000  | 0.0809          | 0.9919   | 0.9944    | 0.9910 | 0.9927 |
+| 0.0828        | 0.1436 | 8000  | 0.0774          | 0.9930   | 0.9976    | 0.9899 | 0.9937 |
+| 0.0787        | 0.1795 | 10000 | 0.0771          | 0.9931   | 0.9989    | 0.9886 | 0.9938 |
+| 0.0761        | 0.2154 | 12000 | 0.0774          | 0.9935   | 0.9984    | 0.9899 | 0.9942 |
+| 0.0779        | 0.2513 | 14000 | 0.0771          | 0.9935   | 0.9991    | 0.9892 | 0.9941 |
+| 0.0833        | 0.2872 | 16000 | 0.0751          | 0.9937   | 0.9985    | 0.9903 | 0.9944 |
+| 0.0812        | 0.3231 | 18000 | 0.0764          | 0.9935   | 0.9967    | 0.9915 | 0.9941 |
+| 0.0763        | 0.3590 | 20000 | 0.0753          | 0.9940   | 0.9990    | 0.9902 | 0.9946 |
+| 0.0753        | 0.3949 | 22000 | 0.0759          | 0.9936   | 0.9968    | 0.9917 | 0.9942 |
+| 0.0749        | 0.4308 | 24000 | 0.0750          | 0.9940   | 0.9980    | 0.9912 | 0.9946 |
+| 0.0755        | 0.4667 | 26000 | 0.0746          | 0.9939   | 0.9974    | 0.9917 | 0.9945 |
+| 0.0755        | 0.5026 | 28000 | 0.0756          | 0.9937   | 0.9967    | 0.9919 | 0.9943 |
+| 0.0753        | 0.5385 | 30000 | 0.0745          | 0.9942   | 0.9979    | 0.9916 | 0.9948 |
+| 0.0791        | 0.5744 | 32000 | 0.0735          | 0.9943   | 0.9991    | 0.9908 | 0.9949 |
+| 0.0789        | 0.6103 | 34000 | 0.0743          | 0.9939   | 0.9972    | 0.9918 | 0.9945 |
+| 0.073         | 0.6462 | 36000 | 0.0741          | 0.9943   | 0.9985    | 0.9913 | 0.9949 |
+| 0.0714        | 0.6821 | 38000 | 0.0738          | 0.9944   | 0.9989    | 0.9911 | 0.9950 |
+| 0.0738        | 0.7180 | 40000 | 0.0733          | 0.9945   | 0.9989    | 0.9912 | 0.9950 |
+| 0.0796        | 0.7539 | 42000 | 0.0732          | 0.9945   | 0.9987    | 0.9915 | 0.9951 |
+| 0.0726        | 0.7898 | 44000 | 0.0734          | 0.9945   | 0.9988    | 0.9914 | 0.9951 |
+| 0.0778        | 0.8257 | 46000 | 0.0733          | 0.9945   | 0.9988    | 0.9913 | 0.9951 |
+| 0.0734        | 0.8616 | 48000 | 0.0733          | 0.9945   | 0.9989    | 0.9914 | 0.9951 |
+| 0.0735        | 0.8975 | 50000 | 0.0732          | 0.9945   | 0.9988    | 0.9914 | 0.9951 |
+| 0.0696        | 0.9334 | 52000 | 0.0732          | 0.9945   | 0.9989    | 0.9913 | 0.9951 |
+| 0.0754        | 0.9693 | 54000 | 0.0732          | 0.9946   | 0.9989    | 0.9913 | 0.9951 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60aa9aa53a0e36b245c5593dce8d7310a3f52c28f31c6edd7c2c5eabf260bcb5
 size 1115268200

 version https://git-lfs.github.com/spec/v1
+oid sha256:2fb9a4b1e3ebf54cc80a6ce7b7f4fc67bfda81ad31a60c63670443ac2c8fe96c
 size 1115268200

runs/Dec07_22-37-29_elesage-pc/events.out.tfevents.1765165139.elesage-pc.285136.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:849df73a03fa35bc23273b3ee0d017383428418dabb2d859d2d9c3be8a6e6401
+size 59842

runs/Dec07_22-52-34_elesage-pc/events.out.tfevents.1765166044.elesage-pc.293700.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b464652e95ce04360b479c4fdcbbc89c834542c6e71f05fa7b03e8310cb1b8d6
+size 256793

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61a02fb851b1134ce920c19e3f3a46c6f21e569e6068dc37c5228abe0d993582
 size 5841

 version https://git-lfs.github.com/spec/v1
+oid sha256:1937b4563433df8db307d4f3c09eee2971ef7c9c7980dff81f9d70c61066d17b
 size 5841