grexit-d commited on
Commit
b896439
·
verified ·
1 Parent(s): 4749a49

End of training

Browse files
Files changed (2) hide show
  1. README.md +70 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ base_model: Musixmatch/umberto-commoncrawl-cased-v1
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ - precision
9
+ - recall
10
+ - f1
11
+ model-index:
12
+ - name: multipride_umberto_sent_label
13
+ results: []
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # multipride_umberto_sent_label
20
+
21
+ This model is a fine-tuned version of [Musixmatch/umberto-commoncrawl-cased-v1](https://huggingface.co/Musixmatch/umberto-commoncrawl-cased-v1) on the None dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 0.2326
24
+ - Accuracy: 0.9325
25
+ - Precision: 0.8816
26
+ - Recall: 0.9090
27
+ - F1: 0.8943
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ More information needed
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 5e-05
47
+ - train_batch_size: 8
48
+ - eval_batch_size: 8
49
+ - seed: 42
50
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
51
+ - lr_scheduler_type: linear
52
+ - num_epochs: 5
53
+
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
57
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
58
+ | 0.3403 | 1.0 | 95 | 0.2707 | 0.9141 | 0.9285 | 0.7865 | 0.8346 |
59
+ | 0.1875 | 2.0 | 190 | 0.2311 | 0.9202 | 0.8902 | 0.8397 | 0.8618 |
60
+ | 0.1657 | 3.0 | 285 | 0.2787 | 0.9325 | 0.8736 | 0.9337 | 0.8989 |
61
+ | 0.1117 | 4.0 | 380 | 0.2278 | 0.9325 | 0.9026 | 0.8719 | 0.8862 |
62
+ | 0.0448 | 5.0 | 475 | 0.2326 | 0.9325 | 0.8816 | 0.9090 | 0.8943 |
63
+
64
+
65
+ ### Framework versions
66
+
67
+ - Transformers 4.57.2
68
+ - Pytorch 2.9.0+cu126
69
+ - Datasets 4.0.0
70
+ - Tokenizers 0.22.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:858231b79843e333b6d75460d6ebfaf091739a72a75cfce01324c839837ed85b
3
  size 442518104
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2096424239c5b8b611b193111e5ec1e4ce219b37fb1d610cf69fa848c07ca9e5
3
  size 442518104