End of training

Browse files

Files changed (15) hide show

README.md +85 -0
config.json +40 -0
logs/events.out.tfevents.1713773368.afa6ace5267c.567.0 +3 -0
logs/events.out.tfevents.1713773617.afa6ace5267c.567.1 +3 -0
logs/events.out.tfevents.1713773731.afa6ace5267c.567.2 +3 -0
logs/events.out.tfevents.1713775242.afa6ace5267c.567.3 +3 -0
logs/events.out.tfevents.1713775712.afa6ace5267c.567.4 +3 -0
logs/events.out.tfevents.1713775828.afa6ace5267c.567.5 +3 -0
logs/events.out.tfevents.1713776185.afa6ace5267c.567.6 +3 -0
logs/events.out.tfevents.1713777038.afa6ace5267c.567.7 +3 -0
logs/events.out.tfevents.1713777131.afa6ace5267c.567.8 +3 -0
logs/events.out.tfevents.1713777274.afa6ace5267c.567.9 +3 -0
logs/events.out.tfevents.1713778184.afa6ace5267c.567.10 +3 -0
model.safetensors +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,85 @@

+---
+license: apache-2.0
+base_model: projecte-aina/roberta-base-ca-v2-cawikitc
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+- precision
+- recall
+- f1
+model-index:
+- name: stocks
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# stocks
+This model is a fine-tuned version of [projecte-aina/roberta-base-ca-v2-cawikitc](https://huggingface.co/projecte-aina/roberta-base-ca-v2-cawikitc) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6639
+- Accuracy: 0.7637
+- Precision: 0.5304
+- Recall: 0.4710
+- F1: 0.4778
+- Ratio: 0.7903
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 10
+- eval_batch_size: 2
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 20
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.06
+- lr_scheduler_warmup_steps: 4
+- num_epochs: 1
+- label_smoothing_factor: 0.1
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     | Ratio  |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:------:|
+| 0.8468        | 0.07  | 10   | 0.8350          | 0.6185   | 0.3093    | 0.5    | 0.3822 | 1.0    |
+| 0.7922        | 0.14  | 20   | 0.8314          | 0.6185   | 0.3093    | 0.5    | 0.3822 | 1.0    |
+| 0.8005        | 0.21  | 30   | 0.8059          | 0.6169   | 0.2060    | 0.3325 | 0.2544 | 0.9984 |
+| 0.8038        | 0.28  | 40   | 0.7907          | 0.6185   | 0.3093    | 0.5    | 0.3822 | 1.0    |
+| 0.7846        | 0.34  | 50   | 0.8060          | 0.6185   | 0.3093    | 0.5    | 0.3822 | 1.0    |
+| 0.7539        | 0.41  | 60   | 0.7573          | 0.6274   | 0.5024    | 0.3422 | 0.2763 | 0.9847 |
+| 0.725         | 0.48  | 70   | 0.8018          | 0.7435   | 0.4978    | 0.4906 | 0.4940 | 0.5847 |
+| 0.6842        | 0.55  | 80   | 0.8437          | 0.7419   | 0.5035    | 0.4795 | 0.4901 | 0.6444 |
+| 0.7415        | 0.62  | 90   | 0.7783          | 0.7468   | 0.5006    | 0.4832 | 0.4909 | 0.6444 |
+| 0.6303        | 0.69  | 100  | 0.7194          | 0.7452   | 0.5009    | 0.4723 | 0.4808 | 0.7040 |
+| 0.6844        | 0.76  | 110  | 0.7137          | 0.7702   | 0.5106    | 0.4996 | 0.5044 | 0.6468 |
+| 0.699         | 0.83  | 120  | 0.6666          | 0.7806   | 0.5159    | 0.5039 | 0.5084 | 0.6653 |
+| 0.7229        | 0.9   | 130  | 0.6636          | 0.7629   | 0.5233    | 0.4730 | 0.4799 | 0.775  |
+| 0.6555        | 0.97  | 140  | 0.6646          | 0.7637   | 0.5312    | 0.4707 | 0.4775 | 0.7919 |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 2.1.0+cu121
+- Datasets 2.19.0
+- Tokenizers 0.15.2

config.json ADDED Viewed

	@@ -0,0 +1,40 @@

+{
+  "_name_or_path": "projecte-aina/roberta-base-ca-v2-cawikitc",
+  "architectures": [
+    "RobertaForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "finetuning_task": "mnli",
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "entailment",
+    "1": "neutral",
+    "2": "contradiction"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "contradiction": 2,
+    "entailment": 0,
+    "neutral": 1
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "torch_dtype": "float32",
+  "transformers_version": "4.38.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 52253
+}

logs/events.out.tfevents.1713773368.afa6ace5267c.567.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:15e6b7dce40c2a1dd37e8d7af36c86dad7ec3dfdbfa1690597f1321f4846b255
+size 4861

logs/events.out.tfevents.1713773617.afa6ace5267c.567.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:94ed70b64d00aadb0ac05cfa690d35f91f05d5436377aef763e0142c527bd41c
+size 532

logs/events.out.tfevents.1713773731.afa6ace5267c.567.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:081f871aeca783bf0b10b3038889288cfa22ad280f483d44ac91f594835194d4
+size 16535

logs/events.out.tfevents.1713775242.afa6ace5267c.567.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5cd3d49221acc9401642e6fe159553cd42ca4b41e5c93f7e062f3b08fb8187ca
+size 8591

logs/events.out.tfevents.1713775712.afa6ace5267c.567.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cf0451e55d6c04d4b115dc01a0a5780f259e91ec93576d8f1609a36e12364d8f
+size 5514

logs/events.out.tfevents.1713775828.afa6ace5267c.567.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:69450dff37a5b2457e9478fb21eeef96858739e39908d2832e2e6cbbee8e236a
+size 7668

logs/events.out.tfevents.1713776185.afa6ace5267c.567.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:de758e18d2a63c686f385e7a78ef226020770bee729c8025724ef64fe2f4244f
+size 8594

logs/events.out.tfevents.1713777038.afa6ace5267c.567.7 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2e3e427c19769498283e4b57d93368a6b62cdcf5510e4e2af2b04390f993a25e
+size 5515

logs/events.out.tfevents.1713777131.afa6ace5267c.567.8 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7d92f9a49c62c39ac8362d53d034bb4d79c6c263bd280516b2a218424245881f
+size 5305

logs/events.out.tfevents.1713777274.afa6ace5267c.567.9 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2cf14b55e750f7c3f490c24d34ab169d63296faf49c66b4d0c12fb80a749bc7f
+size 20719

logs/events.out.tfevents.1713778184.afa6ace5267c.567.10 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3b49a91822e938777b16bdfb4ea8eb7ffe882466e3f333cf42fe50a220da4a5a
+size 609

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a9a2a13463c68625927212ffa838a422a11db7f637648c2271071dcdb75528cf
+size 504723036

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:015f38b8a13092dda89b76579d5a96d7c8db09dbd8504520133689881831c51d
+size 4856