Clemylia commited on
Commit
55dabec
·
verified ·
1 Parent(s): ada0197

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -79
README.md CHANGED
@@ -5,82 +5,11 @@ tags:
5
  model-index:
6
  - name: Tya
7
  results: []
8
- ---
9
-
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
12
-
13
- # Tya
14
-
15
- This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
- It achieves the following results on the evaluation set:
17
- - Loss: 9.0425
18
-
19
- ## Model description
20
-
21
- More information needed
22
-
23
- ## Intended uses & limitations
24
-
25
- More information needed
26
-
27
- ## Training and evaluation data
28
-
29
- More information needed
30
-
31
- ## Training procedure
32
-
33
- ### Training hyperparameters
34
-
35
- The following hyperparameters were used during training:
36
- - learning_rate: 5e-05
37
- - train_batch_size: 4
38
- - eval_batch_size: 4
39
- - seed: 42
40
- - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
41
- - lr_scheduler_type: linear
42
- - lr_scheduler_warmup_steps: 50
43
- - num_epochs: 30
44
-
45
- ### Training results
46
-
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 4 | 10.2476 |
50
- | No log | 2.0 | 8 | 10.2346 |
51
- | 10.2242 | 3.0 | 12 | 10.2122 |
52
- | 10.2242 | 4.0 | 16 | 10.1811 |
53
- | 10.1914 | 5.0 | 20 | 10.1409 |
54
- | 10.1914 | 6.0 | 24 | 10.0923 |
55
- | 10.1914 | 7.0 | 28 | 10.0357 |
56
- | 10.0766 | 8.0 | 32 | 9.9716 |
57
- | 10.0766 | 9.0 | 36 | 9.9017 |
58
- | 9.9563 | 10.0 | 40 | 9.8282 |
59
- | 9.9563 | 11.0 | 44 | 9.7509 |
60
- | 9.9563 | 12.0 | 48 | 9.6713 |
61
- | 9.7542 | 13.0 | 52 | 9.5897 |
62
- | 9.7542 | 14.0 | 56 | 9.5149 |
63
- | 9.54 | 15.0 | 60 | 9.4476 |
64
- | 9.54 | 16.0 | 64 | 9.3876 |
65
- | 9.54 | 17.0 | 68 | 9.3340 |
66
- | 9.417 | 18.0 | 72 | 9.2862 |
67
- | 9.417 | 19.0 | 76 | 9.2429 |
68
- | 9.2368 | 20.0 | 80 | 9.2049 |
69
- | 9.2368 | 21.0 | 84 | 9.1715 |
70
- | 9.2368 | 22.0 | 88 | 9.1426 |
71
- | 9.1577 | 23.0 | 92 | 9.1179 |
72
- | 9.1577 | 24.0 | 96 | 9.0972 |
73
- | 9.1191 | 25.0 | 100 | 9.0803 |
74
- | 9.1191 | 26.0 | 104 | 9.0667 |
75
- | 9.1191 | 27.0 | 108 | 9.0563 |
76
- | 9.0687 | 28.0 | 112 | 9.0488 |
77
- | 9.0687 | 29.0 | 116 | 9.0443 |
78
- | 9.021 | 30.0 | 120 | 9.0425 |
79
-
80
-
81
- ### Framework versions
82
-
83
- - Transformers 4.57.1
84
- - Pytorch 2.8.0+cu126
85
- - Datasets 4.0.0
86
- - Tokenizers 0.22.1
 
5
  model-index:
6
  - name: Tya
7
  results: []
8
+ license: mit
9
+ datasets:
10
+ - Nora-006/Wtf-spam
11
+ language:
12
+ - en
13
+ - nl
14
+ pipeline_tag: text-generation
15
+ ---