eovdedn commited on
Commit
4df07f4
·
verified ·
1 Parent(s): 81684f0

eovdedn/bert_essay

Browse files
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: distilbert/distilbert-base-uncased
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ model-index:
9
+ - name: essay_score
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # essay_score
17
+
18
+ This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.3566
21
+ - Mse: 0.3566
22
+ - Mae: 0.4629
23
+ - R2: 0.6715
24
+ - Accuracy: 0.2789
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 2e-05
44
+ - train_batch_size: 16
45
+ - eval_batch_size: 16
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 2
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Mse | Mae | R2 | Accuracy |
54
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:--------:|
55
+ | 0.6508 | 1.0 | 866 | 0.4269 | 0.4269 | 0.5067 | 0.6067 | 0.2917 |
56
+ | 0.3457 | 2.0 | 1732 | 0.3566 | 0.3566 | 0.4629 | 0.6715 | 0.2789 |
57
+
58
+
59
+ ### Framework versions
60
+
61
+ - Transformers 4.39.3
62
+ - Pytorch 2.1.2
63
+ - Datasets 2.18.0
64
+ - Tokenizers 0.15.2
config.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "distilbert/distilbert-base-uncased",
3
+ "activation": "gelu",
4
+ "architectures": [
5
+ "DistilBertForSequenceClassification"
6
+ ],
7
+ "attention_dropout": 0.1,
8
+ "dim": 768,
9
+ "dropout": 0.1,
10
+ "hidden_dim": 3072,
11
+ "id2label": {
12
+ "0": "LABEL_0"
13
+ },
14
+ "initializer_range": 0.02,
15
+ "label2id": {
16
+ "LABEL_0": 0
17
+ },
18
+ "max_position_embeddings": 512,
19
+ "model_type": "distilbert",
20
+ "n_heads": 12,
21
+ "n_layers": 6,
22
+ "pad_token_id": 0,
23
+ "qa_dropout": 0.1,
24
+ "seq_classif_dropout": 0.2,
25
+ "sinusoidal_pos_embds": false,
26
+ "tie_weights_": true,
27
+ "torch_dtype": "float32",
28
+ "transformers_version": "4.39.3",
29
+ "vocab_size": 30522
30
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7176d6c7de9434d684ca581692730e1492885f1e5f9d60ffe3dd8e45bb19d179
3
+ size 267829484
runs/May01_21-43-11_84acb8edde1f/events.out.tfevents.1714599792.84acb8edde1f.33.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f425660f7f38116e6a034e1253fb587b9ca38c4282621f850a496af9363202c
3
+ size 4525
runs/May02_13-29-13_1b559838f02c/events.out.tfevents.1714656554.1b559838f02c.34.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3cb5adbdd9e77d0ad7a79c144bee6df606b6dfa504549f5c8ec254b77753934
3
+ size 4603
runs/May02_13-30-54_1b559838f02c/events.out.tfevents.1714656654.1b559838f02c.34.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79ad7420175a7c39f5bebb26836d0f599830b3fcb2818a55c5abca4c56f4e84a
3
+ size 4603
runs/May02_13-34-16_1b559838f02c/events.out.tfevents.1714656857.1b559838f02c.34.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:931d4a0b8e3eb8cc1e4f101a01d4c04279a4d8741d7643351aa8b1ae51504d3b
3
+ size 5859
runs/May02_15-17-35_f87be7dee12d/events.out.tfevents.1714663056.f87be7dee12d.35.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:968254ae99807b9c282f9a4460227be0749a309f285bc1c97ca2d6ff4ad736dd
3
+ size 6484
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f724c8de9170f4847cb1bf566886aa65f6d1c1b19bb311c6a7bedc0c4928eca2
3
+ size 4920