shellypeng commited on
Commit
749784f
·
verified ·
1 Parent(s): 62fb3be

Training complete

Browse files
Files changed (2) hide show
  1. README.md +74 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: distilbert/distilbert-base-cased
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - precision
9
+ - recall
10
+ - f1
11
+ - accuracy
12
+ model-index:
13
+ - name: distillbert-base-cased-finetuned-ner3
14
+ results: []
15
+ ---
16
+
17
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
+ should probably proofread and complete it, then remove this comment. -->
19
+
20
+ # distillbert-base-cased-finetuned-ner3
21
+
22
+ This model is a fine-tuned version of [distilbert/distilbert-base-cased](https://huggingface.co/distilbert/distilbert-base-cased) on an unknown dataset.
23
+ It achieves the following results on the evaluation set:
24
+ - Loss: 0.1370
25
+ - Precision: 0.7851
26
+ - Recall: 0.8195
27
+ - F1: 0.8020
28
+ - Accuracy: 0.9579
29
+
30
+ ## Model description
31
+
32
+ More information needed
33
+
34
+ ## Intended uses & limitations
35
+
36
+ More information needed
37
+
38
+ ## Training and evaluation data
39
+
40
+ More information needed
41
+
42
+ ## Training procedure
43
+
44
+ ### Training hyperparameters
45
+
46
+ The following hyperparameters were used during training:
47
+ - learning_rate: 2e-05
48
+ - train_batch_size: 8
49
+ - eval_batch_size: 8
50
+ - seed: 42
51
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
+ - lr_scheduler_type: cosine
53
+ - lr_scheduler_warmup_ratio: 0.1
54
+ - num_epochs: 7
55
+
56
+ ### Training results
57
+
58
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
59
+ |:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
60
+ | 0.1896 | 1.0 | 4750 | 0.1830 | 0.7113 | 0.7623 | 0.7359 | 0.9457 |
61
+ | 0.1468 | 2.0 | 9500 | 0.1514 | 0.7723 | 0.7932 | 0.7826 | 0.9532 |
62
+ | 0.1321 | 3.0 | 14250 | 0.1421 | 0.7700 | 0.8050 | 0.7871 | 0.9557 |
63
+ | 0.124 | 4.0 | 19000 | 0.1369 | 0.7771 | 0.8102 | 0.7933 | 0.9574 |
64
+ | 0.1243 | 5.0 | 23750 | 0.1380 | 0.7815 | 0.8152 | 0.798 | 0.9572 |
65
+ | 0.1129 | 6.0 | 28500 | 0.1371 | 0.7862 | 0.8188 | 0.8022 | 0.9577 |
66
+ | 0.1138 | 7.0 | 33250 | 0.1370 | 0.7851 | 0.8195 | 0.8020 | 0.9579 |
67
+
68
+
69
+ ### Framework versions
70
+
71
+ - Transformers 4.50.1
72
+ - Pytorch 2.5.1+cu124
73
+ - Datasets 3.4.1
74
+ - Tokenizers 0.21.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff2f39cdfa2784a0eb5f657e9b6c41e3ccc98bc5a8ecade43724d084179812dd
3
  size 260828276
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f23b05264b6761378e9492396b2c52232d41059cb40c3efef459c1c0ec5d415d
3
  size 260828276