Fadli Aulawi commited on
Commit
a912ba0
·
1 Parent(s): 610be27

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +16 -54
  2. config.json +0 -1
  3. tf_model.h5 +3 -0
README.md CHANGED
@@ -1,53 +1,22 @@
1
  ---
2
  license: apache-2.0
3
  tags:
4
- - generated_from_trainer
5
- datasets:
6
- - conll2003
7
- metrics:
8
- - precision
9
- - recall
10
- - f1
11
- - accuracy
12
  model-index:
13
- - name: bert-finetuned-ner
14
- results:
15
- - task:
16
- name: Token Classification
17
- type: token-classification
18
- dataset:
19
- name: conll2003
20
- type: conll2003
21
- config: conll2003
22
- split: validation
23
- args: conll2003
24
- metrics:
25
- - name: Precision
26
- type: precision
27
- value: 0.9363531162175566
28
- - name: Recall
29
- type: recall
30
- value: 0.9532144059239314
31
- - name: F1
32
- type: f1
33
- value: 0.9447085313985489
34
- - name: Accuracy
35
- type: accuracy
36
- value: 0.9864308000235474
37
  ---
38
 
39
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
40
- should probably proofread and complete it, then remove this comment. -->
41
 
42
- # bert-finetuned-ner
43
 
44
- This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the conll2003 dataset.
45
  It achieves the following results on the evaluation set:
46
- - Loss: 0.0635
47
- - Precision: 0.9364
48
- - Recall: 0.9532
49
- - F1: 0.9447
50
- - Accuracy: 0.9864
51
 
52
  ## Model description
53
 
@@ -66,26 +35,19 @@ More information needed
66
  ### Training hyperparameters
67
 
68
  The following hyperparameters were used during training:
69
- - learning_rate: 2e-05
70
- - train_batch_size: 8
71
- - eval_batch_size: 8
72
- - seed: 42
73
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
74
- - lr_scheduler_type: linear
75
- - num_epochs: 3
76
 
77
  ### Training results
78
 
79
- | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
80
- |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
81
- | 0.0879 | 1.0 | 1756 | 0.0723 | 0.9264 | 0.9382 | 0.9323 | 0.9817 |
82
- | 0.0337 | 2.0 | 3512 | 0.0637 | 0.9298 | 0.9492 | 0.9394 | 0.9856 |
83
- | 0.0175 | 3.0 | 5268 | 0.0635 | 0.9364 | 0.9532 | 0.9447 | 0.9864 |
84
 
85
 
86
  ### Framework versions
87
 
88
  - Transformers 4.30.2
89
- - Pytorch 2.0.1+cu118
90
  - Datasets 2.13.1
91
  - Tokenizers 0.13.3
 
1
  ---
2
  license: apache-2.0
3
  tags:
4
+ - generated_from_keras_callback
 
 
 
 
 
 
 
5
  model-index:
6
+ - name: fadliaulawi/bert-finetuned-ner
7
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
+ <!-- This model card has been generated automatically according to the information Keras had access to. You should
11
+ probably proofread and complete it, then remove this comment. -->
12
 
13
+ # fadliaulawi/bert-finetuned-ner
14
 
15
+ This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.1753
18
+ - Validation Loss: 0.0690
19
+ - Epoch: 0
 
 
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 2634, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
+ - training_precision: mixed_float16
 
 
 
 
 
40
 
41
  ### Training results
42
 
43
+ | Train Loss | Validation Loss | Epoch |
44
+ |:----------:|:---------------:|:-----:|
45
+ | 0.1753 | 0.0690 | 0 |
 
 
46
 
47
 
48
  ### Framework versions
49
 
50
  - Transformers 4.30.2
51
+ - TensorFlow 2.12.0
52
  - Datasets 2.13.1
53
  - Tokenizers 0.13.3
config.json CHANGED
@@ -40,7 +40,6 @@
40
  "num_hidden_layers": 12,
41
  "pad_token_id": 0,
42
  "position_embedding_type": "absolute",
43
- "torch_dtype": "float32",
44
  "transformers_version": "4.30.2",
45
  "type_vocab_size": 2,
46
  "use_cache": true,
 
40
  "num_hidden_layers": 12,
41
  "pad_token_id": 0,
42
  "position_embedding_type": "absolute",
 
43
  "transformers_version": "4.30.2",
44
  "type_vocab_size": 2,
45
  "use_cache": true,
tf_model.h5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:990f9b93adb4b41990d4cf0bcde88f596e26ddb7d89e8718858137f4b851bdda
3
+ size 431179756