yale-cultural-heritage
/

name-parser-model

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
 license: apache-2.0
-base_model: t5-base
 tags:
 - generated_from_trainer
 metrics:
@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # name-parser-model
-This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0113
-- Accuracy: 0.9924
 ## Model description
@@ -47,63 +47,23 @@ The following hyperparameters were used during training:
 - optimizer: Use adafactor and the args are:
 No additional optimizer arguments
 - lr_scheduler_type: linear
-- training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Accuracy |
-|:-------------:|:-------:|:----:|:---------------:|:--------:|
-| No log        | 1.2540  | 20   | 5.6788          | 0.4187   |
-| No log        | 2.5079  | 40   | 0.6299          | 0.5176   |
-| No log        | 3.7619  | 60   | 0.2964          | 0.6436   |
-| No log        | 5.0     | 80   | 0.1270          | 0.8712   |
-| No log        | 6.2540  | 100  | 0.0709          | 0.9502   |
-| No log        | 7.5079  | 120  | 0.0490          | 0.9617   |
-| No log        | 8.7619  | 140  | 0.0349          | 0.9704   |
-| No log        | 10.0    | 160  | 0.0259          | 0.9855   |
-| No log        | 11.2540 | 180  | 0.0222          | 0.9862   |
-| No log        | 12.5079 | 200  | 0.0206          | 0.9873   |
-| No log        | 13.7619 | 220  | 0.0194          | 0.9872   |
-| No log        | 15.0    | 240  | 0.0177          | 0.9885   |
-| No log        | 16.2540 | 260  | 0.0168          | 0.9888   |
-| No log        | 17.5079 | 280  | 0.0161          | 0.9898   |
-| No log        | 18.7619 | 300  | 0.0157          | 0.9899   |
-| No log        | 20.0    | 320  | 0.0149          | 0.9903   |
-| No log        | 21.2540 | 340  | 0.0148          | 0.9908   |
-| No log        | 22.5079 | 360  | 0.0143          | 0.9903   |
-| No log        | 23.7619 | 380  | 0.0138          | 0.9902   |
-| No log        | 25.0    | 400  | 0.0137          | 0.9904   |
-| No log        | 26.2540 | 420  | 0.0134          | 0.9907   |
-| No log        | 27.5079 | 440  | 0.0131          | 0.9911   |
-| No log        | 28.7619 | 460  | 0.0130          | 0.9913   |
-| No log        | 30.0    | 480  | 0.0128          | 0.9915   |
-| 0.6611        | 31.2540 | 500  | 0.0128          | 0.9913   |
-| 0.6611        | 32.5079 | 520  | 0.0124          | 0.9915   |
-| 0.6611        | 33.7619 | 540  | 0.0125          | 0.9913   |
-| 0.6611        | 35.0    | 560  | 0.0123          | 0.9915   |
-| 0.6611        | 36.2540 | 580  | 0.0122          | 0.9913   |
-| 0.6611        | 37.5079 | 600  | 0.0121          | 0.9915   |
-| 0.6611        | 38.7619 | 620  | 0.0121          | 0.9916   |
-| 0.6611        | 40.0    | 640  | 0.0120          | 0.9918   |
-| 0.6611        | 41.2540 | 660  | 0.0118          | 0.9918   |
-| 0.6611        | 42.5079 | 680  | 0.0118          | 0.9918   |
-| 0.6611        | 43.7619 | 700  | 0.0117          | 0.9920   |
-| 0.6611        | 45.0    | 720  | 0.0115          | 0.9922   |
-| 0.6611        | 46.2540 | 740  | 0.0114          | 0.9924   |
-| 0.6611        | 47.5079 | 760  | 0.0114          | 0.9921   |
-| 0.6611        | 48.7619 | 780  | 0.0115          | 0.9922   |
-| 0.6611        | 50.0    | 800  | 0.0115          | 0.9921   |
-| 0.6611        | 51.2540 | 820  | 0.0115          | 0.9921   |
-| 0.6611        | 52.5079 | 840  | 0.0114          | 0.9922   |
-| 0.6611        | 53.7619 | 860  | 0.0114          | 0.9923   |
-| 0.6611        | 55.0    | 880  | 0.0114          | 0.9925   |
-| 0.6611        | 56.2540 | 900  | 0.0113          | 0.9923   |
-| 0.6611        | 57.5079 | 920  | 0.0113          | 0.9923   |
-| 0.6611        | 58.7619 | 940  | 0.0113          | 0.9924   |
-| 0.6611        | 60.0    | 960  | 0.0113          | 0.9924   |
-| 0.6611        | 61.2540 | 980  | 0.0113          | 0.9924   |
-| 0.0127        | 62.5079 | 1000 | 0.0113          | 0.9924   |
 ### Framework versions

 ---
 library_name: transformers
 license: apache-2.0
+base_model: yale-cultural-heritage/name-parser-model
 tags:
 - generated_from_trainer
 metrics:
 # name-parser-model
+This model is a fine-tuned version of [yale-cultural-heritage/name-parser-model](https://huggingface.co/yale-cultural-heritage/name-parser-model) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0332
+- Accuracy: 0.9921
 ## Model description
 - optimizer: Use adafactor and the args are:
 No additional optimizer arguments
 - lr_scheduler_type: linear
+- training_steps: 10000
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
+|:-------------:|:-------:|:-----:|:---------------:|:--------:|
+| 0.041         | 3.1952  | 1000  | 0.0352          | 0.9912   |
+| 0.0369        | 6.3904  | 2000  | 0.0345          | 0.9915   |
+| 0.0358        | 9.5856  | 3000  | 0.0336          | 0.9917   |
+| 0.0349        | 12.7808 | 4000  | 0.0333          | 0.9919   |
+| 0.0337        | 15.9760 | 5000  | 0.0331          | 0.9920   |
+| 0.0332        | 19.1696 | 6000  | 0.0334          | 0.9919   |
+| 0.0328        | 22.3648 | 7000  | 0.0332          | 0.9921   |
+| 0.0323        | 25.56   | 8000  | 0.0333          | 0.9921   |
+| 0.0318        | 28.7552 | 9000  | 0.0333          | 0.9921   |
+| 0.032         | 31.9504 | 10000 | 0.0332          | 0.9921   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b6d1e98e5d68c56f5cfe40d34b8161c0fd3a135415d24df9c2abce9d05791fd
 size 893005608

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c76fbc12bcc6f5b0364f8128097328350695117abed032fb801d14c5161fbc3
 size 893005608