SofieElving commited on
Commit
6403f53
·
verified ·
1 Parent(s): f5c3244

End of training

Browse files
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.3019
23
- - Accuracy: 0.9193
24
- - F1: 0.9193
25
 
26
  ## Model description
27
 
@@ -41,24 +41,26 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
- - train_batch_size: 32
45
- - eval_batch_size: 32
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
- - num_epochs: 2
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
- | No log | 1.0 | 313 | 0.2786 | 0.9145 | 0.9142 |
56
- | 0.1372 | 2.0 | 626 | 0.3019 | 0.9193 | 0.9193 |
 
 
57
 
58
 
59
  ### Framework versions
60
 
61
- - Transformers 4.56.1
62
  - Pytorch 2.8.0+cu126
63
  - Datasets 4.0.0
64
- - Tokenizers 0.22.0
 
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.2655
23
+ - Accuracy: 0.9195
24
+ - F1: 0.9194
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
+ - train_batch_size: 64
45
+ - eval_batch_size: 64
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 4
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
+ | No log | 1.0 | 157 | 0.2740 | 0.9079 | 0.9075 |
56
+ | No log | 2.0 | 314 | 0.2392 | 0.9187 | 0.9187 |
57
+ | No log | 3.0 | 471 | 0.2531 | 0.9203 | 0.9202 |
58
+ | 0.236 | 4.0 | 628 | 0.2655 | 0.9195 | 0.9194 |
59
 
60
 
61
  ### Framework versions
62
 
63
+ - Transformers 4.57.1
64
  - Pytorch 2.8.0+cu126
65
  - Datasets 4.0.0
66
+ - Tokenizers 0.22.1
config.json CHANGED
@@ -9,17 +9,17 @@
9
  "dtype": "float32",
10
  "hidden_dim": 3072,
11
  "id2label": {
12
- "0": "World",
13
- "1": "Sports",
14
- "2": "Business",
15
- "3": "Sci/Tech"
16
  },
17
  "initializer_range": 0.02,
18
  "label2id": {
19
- "World": 0,
20
- "Sports": 1,
21
- "Business": 2,
22
- "Sci/Tech": 3
23
  },
24
  "max_position_embeddings": 512,
25
  "model_type": "distilbert",
@@ -31,6 +31,6 @@
31
  "seq_classif_dropout": 0.2,
32
  "sinusoidal_pos_embds": false,
33
  "tie_weights_": true,
34
- "transformers_version": "4.56.1",
35
  "vocab_size": 30522
36
  }
 
9
  "dtype": "float32",
10
  "hidden_dim": 3072,
11
  "id2label": {
12
+ "0": "LABEL_0",
13
+ "1": "LABEL_1",
14
+ "2": "LABEL_2",
15
+ "3": "LABEL_3"
16
  },
17
  "initializer_range": 0.02,
18
  "label2id": {
19
+ "LABEL_0": 0,
20
+ "LABEL_1": 1,
21
+ "LABEL_2": 2,
22
+ "LABEL_3": 3
23
  },
24
  "max_position_embeddings": 512,
25
  "model_type": "distilbert",
 
31
  "seq_classif_dropout": 0.2,
32
  "sinusoidal_pos_embds": false,
33
  "tie_weights_": true,
34
+ "transformers_version": "4.57.1",
35
  "vocab_size": 30522
36
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c3424cb715e3ab6f7bd94103e9dc661a6c71c4932389fcc1c8f1bbd4ca0bd78
3
  size 267838720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:652f467cffea2f93585b37d7e76b3739839a279175109fb22bc08ab6e34e6d0c
3
  size 267838720
runs/Oct24_10-14-49_dd37e4074e0b/events.out.tfevents.1761301147.dd37e4074e0b.8576.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:acca38c6e70ecc483d6121aed2399ef1cdfbde784361e150af67860ef147c3f8
3
+ size 6402
runs/Oct24_10-14-49_dd37e4074e0b/events.out.tfevents.1761301486.dd37e4074e0b.8576.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea69f51ec9ae5b81f984e5246e73d438e898a6e5806e717748d7e67306109ed7
3
+ size 6465
runs/Oct24_10-32-25_dd37e4074e0b/events.out.tfevents.1761301951.dd37e4074e0b.8576.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a822b74309d4406397c57e9582e9228d7d67747ce4d41030a5934558a32ef40f
3
+ size 7140
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a47a2c9bf49d1624c0cae95bbcb702a238d64783e730c008185a4f0730dbfa60
3
- size 5777
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2c5bba8ae96c6cb8ffd6f93dada8dcba8e194185ba4c7cda0eb980a69c820bc
3
+ size 5841