irudachirath commited on
Commit
be0b8d4
·
verified ·
1 Parent(s): 6e22073

End of training

Browse files
Files changed (2) hide show
  1. README.md +67 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - si
5
+ license: apache-2.0
6
+ base_model: google/mt5-base
7
+ tags:
8
+ - generated_from_trainer
9
+ datasets:
10
+ - SPEAK-ASR/akura-sinhala-dyslexia-corrected
11
+ model-index:
12
+ - name: SPEAK-ASR/mt5-base-si
13
+ results: []
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # SPEAK-ASR/mt5-base-si
20
+
21
+ This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the SPEAK-ASR/akura-sinhala-dyslexia-corrected dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 0.0735
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 5e-05
43
+ - train_batch_size: 32
44
+ - eval_batch_size: 32
45
+ - seed: 42
46
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
+ - lr_scheduler_type: linear
48
+ - lr_scheduler_warmup_steps: 500
49
+ - num_epochs: 5
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss |
54
+ |:-------------:|:------:|:----:|:---------------:|
55
+ | 0.9075 | 0.9921 | 500 | 0.6071 |
56
+ | 0.3976 | 1.9841 | 1000 | 0.1844 |
57
+ | 0.0881 | 2.9762 | 1500 | 0.0740 |
58
+ | 0.2419 | 3.9683 | 2000 | 0.0734 |
59
+ | 0.1458 | 4.9603 | 2500 | 0.0735 |
60
+
61
+
62
+ ### Framework versions
63
+
64
+ - Transformers 4.57.3
65
+ - Pytorch 2.9.0+cu126
66
+ - Datasets 4.4.2
67
+ - Tokenizers 0.22.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f593217a3c5e01370692eb71ba7f5ce25c543091bc2f8eeef0292c51c041cc23
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c30d1f07af811cecdc099e6ddb1299a37820bb25ec3b4e5d7a60ed3c46390d5c
3
  size 2329638768