kiritan commited on
Commit
17bf8cc
·
verified ·
1 Parent(s): 8db9c4d

训练结束,上传最终模型

Browse files
Files changed (5) hide show
  1. README.md +15 -14
  2. all_results.json +4 -4
  3. model.safetensors +1 -1
  4. train_results.json +4 -4
  5. trainer_state.json +0 -0
README.md CHANGED
@@ -16,8 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.1515
20
- - Wer: inf
 
21
 
22
  ## Model description
23
 
@@ -50,18 +51,18 @@ The following hyperparameters were used during training:
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Wer |
54
- |:-------------:|:-------:|:-----:|:---------------:|:---:|
55
- | 1.834 | 1.6507 | 1000 | 1.9117 | inf |
56
- | 0.9955 | 3.3006 | 2000 | 1.2766 | inf |
57
- | 0.7584 | 4.9513 | 3000 | 1.1081 | inf |
58
- | 0.5473 | 6.6012 | 4000 | 1.0569 | inf |
59
- | 0.4191 | 8.2510 | 5000 | 1.0568 | inf |
60
- | 0.3167 | 9.9017 | 6000 | 1.0609 | inf |
61
- | 0.2322 | 11.5516 | 7000 | 1.0933 | inf |
62
- | 0.1913 | 13.2015 | 8000 | 1.1227 | inf |
63
- | 0.1424 | 14.8522 | 9000 | 1.1418 | inf |
64
- | 0.1615 | 16.5021 | 10000 | 1.1515 | inf |
65
 
66
 
67
  ### Framework versions
 
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.1519
20
+ - Wer: 96.7751
21
+ - Cer: 49.6435
22
 
23
  ## Model description
24
 
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
55
+ |:-------------:|:-------:|:-----:|:---------------:|:-------:|:-------:|
56
+ | 1.8339 | 1.6507 | 1000 | 1.9115 | 99.6794 | 93.6205 |
57
+ | 0.9948 | 3.3006 | 2000 | 1.2763 | 97.3503 | 59.4213 |
58
+ | 0.7577 | 4.9513 | 3000 | 1.1085 | 96.6431 | 53.3468 |
59
+ | 0.5464 | 6.6012 | 4000 | 1.0575 | 95.4927 | 48.2507 |
60
+ | 0.4182 | 8.2510 | 5000 | 1.0574 | 96.2376 | 47.2929 |
61
+ | 0.3164 | 9.9017 | 6000 | 1.0616 | 96.3885 | 49.4417 |
62
+ | 0.2319 | 11.5516 | 7000 | 1.0929 | 96.2565 | 49.5535 |
63
+ | 0.1899 | 13.2015 | 8000 | 1.1223 | 97.2749 | 48.6737 |
64
+ | 0.1425 | 14.8522 | 9000 | 1.1422 | 96.6148 | 48.4484 |
65
+ | 0.161 | 16.5021 | 10000 | 1.1519 | 96.7751 | 49.6435 |
66
 
67
 
68
  ### Framework versions
all_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 16.50206440957886,
3
  "total_flos": 1.949150849531904e+19,
4
- "train_loss": 0.8644906110763549,
5
- "train_runtime": 22776.5193,
6
- "train_samples_per_second": 21.074,
7
- "train_steps_per_second": 0.439
8
  }
 
1
  {
2
  "epoch": 16.50206440957886,
3
  "total_flos": 1.949150849531904e+19,
4
+ "train_loss": 0.8641470371723176,
5
+ "train_runtime": 21435.9254,
6
+ "train_samples_per_second": 22.392,
7
+ "train_steps_per_second": 0.467
8
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f1172332b4f4cf2ce6af9c26f710924116b9939c47b97977e6e6ac30b8704561
3
  size 223144592
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1695d6ce8bd5b7d2355edbbd0898111b45ff3435a1ab08b8365805bc3f3ad00
3
  size 223144592
train_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 16.50206440957886,
3
  "total_flos": 1.949150849531904e+19,
4
- "train_loss": 0.8644906110763549,
5
- "train_runtime": 22776.5193,
6
- "train_samples_per_second": 21.074,
7
- "train_steps_per_second": 0.439
8
  }
 
1
  {
2
  "epoch": 16.50206440957886,
3
  "total_flos": 1.949150849531904e+19,
4
+ "train_loss": 0.8641470371723176,
5
+ "train_runtime": 21435.9254,
6
+ "train_samples_per_second": 22.392,
7
+ "train_steps_per_second": 0.467
8
  }
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff