End of training

Browse files

Files changed (9) hide show

README.md +112 -0
config.json +98 -0
model.safetensors +3 -0
runs/Apr02_10-21-14_282a37b54bd0/events.out.tfevents.1743589275.282a37b54bd0.1070.0 +3 -0
special_tokens_map.json +7 -0
tokenizer.json +0 -0
tokenizer_config.json +56 -0
training_args.bin +3 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,112 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: bert-base-uncased
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+- f1
+model-index:
+- name: absa
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# absa
+This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.6213
+- Accuracy: 0.7059
+- F1: 0.4389
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 50
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
+| 2.2394        | 1.0   | 149  | 1.5797          | 0.6125   | 0.1657 |
+| 1.505         | 2.0   | 298  | 1.3557          | 0.6370   | 0.1928 |
+| 1.2839        | 3.0   | 447  | 1.2337          | 0.6753   | 0.2603 |
+| 1.1167        | 4.0   | 596  | 1.1764          | 0.6868   | 0.2958 |
+| 0.9805        | 5.0   | 745  | 1.1334          | 0.6925   | 0.3355 |
+| 0.8614        | 6.0   | 894  | 1.1179          | 0.6954   | 0.3472 |
+| 0.7573        | 7.0   | 1043 | 1.1196          | 0.7011   | 0.3701 |
+| 0.6696        | 8.0   | 1192 | 1.1380          | 0.6992   | 0.3821 |
+| 0.5909        | 9.0   | 1341 | 1.1420          | 0.7031   | 0.3892 |
+| 0.5244        | 10.0  | 1490 | 1.1554          | 0.7021   | 0.3972 |
+| 0.4637        | 11.0  | 1639 | 1.1932          | 0.6978   | 0.3915 |
+| 0.4116        | 12.0  | 1788 | 1.2032          | 0.6968   | 0.4044 |
+| 0.3679        | 13.0  | 1937 | 1.2351          | 0.6892   | 0.3857 |
+| 0.3333        | 14.0  | 2086 | 1.2645          | 0.7035   | 0.4071 |
+| 0.303         | 15.0  | 2235 | 1.2936          | 0.6992   | 0.4183 |
+| 0.2743        | 16.0  | 2384 | 1.3310          | 0.7045   | 0.4082 |
+| 0.2504        | 17.0  | 2533 | 1.3357          | 0.7026   | 0.4339 |
+| 0.2295        | 18.0  | 2682 | 1.3274          | 0.7050   | 0.4347 |
+| 0.212         | 19.0  | 2831 | 1.3709          | 0.6949   | 0.4229 |
+| 0.1953        | 20.0  | 2980 | 1.3922          | 0.6949   | 0.4394 |
+| 0.1863        | 21.0  | 3129 | 1.4025          | 0.7011   | 0.4407 |
+| 0.1726        | 22.0  | 3278 | 1.4215          | 0.7040   | 0.4266 |
+| 0.1625        | 23.0  | 3427 | 1.4324          | 0.6887   | 0.4300 |
+| 0.1536        | 24.0  | 3576 | 1.4505          | 0.7040   | 0.4198 |
+| 0.1452        | 25.0  | 3725 | 1.4800          | 0.7064   | 0.4356 |
+| 0.1369        | 26.0  | 3874 | 1.4869          | 0.6988   | 0.4409 |
+| 0.1333        | 27.0  | 4023 | 1.5108          | 0.6978   | 0.4317 |
+| 0.1294        | 28.0  | 4172 | 1.4938          | 0.7021   | 0.4408 |
+| 0.1225        | 29.0  | 4321 | 1.5053          | 0.7021   | 0.4356 |
+| 0.1169        | 30.0  | 4470 | 1.5472          | 0.6959   | 0.4292 |
+| 0.1105        | 31.0  | 4619 | 1.5470          | 0.6988   | 0.4393 |
+| 0.1086        | 32.0  | 4768 | 1.5285          | 0.7007   | 0.4311 |
+| 0.1017        | 33.0  | 4917 | 1.5598          | 0.6949   | 0.4250 |
+| 0.1022        | 34.0  | 5066 | 1.5873          | 0.7059   | 0.4374 |
+| 0.0965        | 35.0  | 5215 | 1.5721          | 0.7045   | 0.4345 |
+| 0.0964        | 36.0  | 5364 | 1.5777          | 0.7055   | 0.4345 |
+| 0.0951        | 37.0  | 5513 | 1.5789          | 0.6940   | 0.4323 |
+| 0.0894        | 38.0  | 5662 | 1.5818          | 0.6954   | 0.4305 |
+| 0.0878        | 39.0  | 5811 | 1.5938          | 0.7083   | 0.4387 |
+| 0.0828        | 40.0  | 5960 | 1.6007          | 0.7064   | 0.4406 |
+| 0.0846        | 41.0  | 6109 | 1.6040          | 0.6983   | 0.4299 |
+| 0.0783        | 42.0  | 6258 | 1.6126          | 0.7055   | 0.4393 |
+| 0.0807        | 43.0  | 6407 | 1.6083          | 0.7002   | 0.4313 |
+| 0.0774        | 44.0  | 6556 | 1.6123          | 0.7059   | 0.4407 |
+| 0.0765        | 45.0  | 6705 | 1.6197          | 0.7055   | 0.4378 |
+| 0.0751        | 46.0  | 6854 | 1.6168          | 0.7055   | 0.4365 |
+| 0.0729        | 47.0  | 7003 | 1.6190          | 0.7074   | 0.4432 |
+| 0.0718        | 48.0  | 7152 | 1.6217          | 0.7055   | 0.4393 |
+| 0.0706        | 49.0  | 7301 | 1.6212          | 0.7069   | 0.4401 |
+| 0.0665        | 50.0  | 7450 | 1.6213          | 0.7059   | 0.4389 |
+### Framework versions
+- Transformers 4.50.2
+- Pytorch 2.6.0+cu124
+- Datasets 3.5.0
+- Tokenizers 0.21.1

config.json ADDED Viewed

	@@ -0,0 +1,98 @@

+{
+  "architectures": [
+    "BertForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2",
+    "3": "LABEL_3",
+    "4": "LABEL_4",
+    "5": "LABEL_5",
+    "6": "LABEL_6",
+    "7": "LABEL_7",
+    "8": "LABEL_8",
+    "9": "LABEL_9",
+    "10": "LABEL_10",
+    "11": "LABEL_11",
+    "12": "LABEL_12",
+    "13": "LABEL_13",
+    "14": "LABEL_14",
+    "15": "LABEL_15",
+    "16": "LABEL_16",
+    "17": "LABEL_17",
+    "18": "LABEL_18",
+    "19": "LABEL_19",
+    "20": "LABEL_20",
+    "21": "LABEL_21",
+    "22": "LABEL_22",
+    "23": "LABEL_23",
+    "24": "LABEL_24",
+    "25": "LABEL_25",
+    "26": "LABEL_26",
+    "27": "LABEL_27",
+    "28": "LABEL_28",
+    "29": "LABEL_29",
+    "30": "LABEL_30",
+    "31": "LABEL_31",
+    "32": "LABEL_32",
+    "33": "LABEL_33"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_10": 10,
+    "LABEL_11": 11,
+    "LABEL_12": 12,
+    "LABEL_13": 13,
+    "LABEL_14": 14,
+    "LABEL_15": 15,
+    "LABEL_16": 16,
+    "LABEL_17": 17,
+    "LABEL_18": 18,
+    "LABEL_19": 19,
+    "LABEL_2": 2,
+    "LABEL_20": 20,
+    "LABEL_21": 21,
+    "LABEL_22": 22,
+    "LABEL_23": 23,
+    "LABEL_24": 24,
+    "LABEL_25": 25,
+    "LABEL_26": 26,
+    "LABEL_27": 27,
+    "LABEL_28": 28,
+    "LABEL_29": 29,
+    "LABEL_3": 3,
+    "LABEL_30": 30,
+    "LABEL_31": 31,
+    "LABEL_32": 32,
+    "LABEL_33": 33,
+    "LABEL_4": 4,
+    "LABEL_5": 5,
+    "LABEL_6": 6,
+    "LABEL_7": 7,
+    "LABEL_8": 8,
+    "LABEL_9": 9
+  },
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "torch_dtype": "float32",
+  "transformers_version": "4.50.2",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 30522
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:524db4cf3331926733e0d9c3d0580696b8c3823e83387f45675068f93e4faf07
+size 438057080

runs/Apr02_10-21-14_282a37b54bd0/events.out.tfevents.1743589275.282a37b54bd0.1070.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a5b4665df7394ab09f2f166ffa0fa01dbb2a7b2b5d8ca73e6e559a399c3a8ee
+size 35784

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2cab4c506efe9b2929974f23acb89723b54cc68b908288bdb562510d223bb13f
+size 5304

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff