Training in progress, step 1000

Browse files

Files changed (4) hide show

README.md +27 -25
config.json +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,33 +1,23 @@
 ---
-license: apache-2.0
-base_model: zainulhakim/240626-wav2vec2-ASR_Global
 tags:
 - generated_from_trainer
 metrics:
 - wer
 model-index:
-- name: 240801-wav2vec2-ASR-Global-All-Clients
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/wobjnlgw)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/theziq77)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/projektarbeit_dalim/huggingface/runs/s1l6rff5)
-# 240801-wav2vec2-ASR-Global-All-Clients
-This model is a fine-tuned version of [zainulhakim/240626-wav2vec2-ASR_Global](https://huggingface.co/zainulhakim/240626-wav2vec2-ASR_Global) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.0667
-- Wer: 1.0
 ## Model description
@@ -46,28 +36,40 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer |
-|:-------------:|:------:|:----:|:---------------:|:---:|
-| No log        | 1.2658 | 100  | 3.0667          | 1.0 |
-| No log        | 2.5316 | 200  | 3.0889          | 1.0 |
-| No log        | 3.7975 | 300  | 3.1129          | 1.0 |
 ### Framework versions
-- Transformers 4.42.4
 - Pytorch 2.3.1+cu121
 - Datasets 2.19.2
 - Tokenizers 0.19.1

 ---
 tags:
 - generated_from_trainer
 metrics:
 - wer
 model-index:
+- name: fl_asr_speech_recognition
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# fl_asr_speech_recognition
+This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2947
+- Wer: 0.1449
+- Cer: 0.0451
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 200
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch    | Step  | Validation Loss | Wer    | Cer    |
+|:-------------:|:--------:|:-----:|:---------------:|:------:|:------:|
+| 0.625         | 12.6582  | 1000  | 0.4832          | 0.5625 | 0.1090 |
+| 0.3037        | 25.3165  | 2000  | 0.3879          | 0.3665 | 0.0686 |
+| 0.2127        | 37.9747  | 3000  | 0.4096          | 0.2926 | 0.0617 |
+| 0.1767        | 50.6329  | 4000  | 0.3967          | 0.25   | 0.0552 |
+| 0.1238        | 63.2911  | 5000  | 0.3024          | 0.2273 | 0.0529 |
+| 0.0868        | 75.9494  | 6000  | 0.3768          | 0.2330 | 0.0487 |
+| 0.0823        | 88.6076  | 7000  | 0.2742          | 0.2244 | 0.0420 |
+| 0.0696        | 101.2658 | 8000  | 0.2792          | 0.2074 | 0.0383 |
+| 0.0496        | 113.9241 | 9000  | 0.3362          | 0.1591 | 0.0359 |
+| 0.0413        | 126.5823 | 10000 | 0.3061          | 0.1562 | 0.0400 |
+| 0.0286        | 139.2405 | 11000 | 0.3264          | 0.1591 | 0.0406 |
+| 0.0294        | 151.8987 | 12000 | 0.3046          | 0.1648 | 0.0424 |
+| 0.0183        | 164.5570 | 13000 | 0.3083          | 0.1506 | 0.0400 |
+| 0.0159        | 177.2152 | 14000 | 0.2947          | 0.1449 | 0.0451 |
+| 0.009         | 189.8734 | 15000 | 0.3198          | 0.1477 | 0.0411 |
 ### Framework versions
+- Transformers 4.43.3
 - Pytorch 2.3.1+cu121
 - Datasets 2.19.2
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "zainulhakim/all_accents_combined_4x79",
   "activation_dropout": 0.0,
   "adapter_attn_dim": null,
   "adapter_kernel_size": 3,

 {
+  "_name_or_path": "240801-wav2vec2-ASR-Global-All-Clients",
   "activation_dropout": 0.0,
   "adapter_attn_dim": null,
   "adapter_kernel_size": 3,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:118c2f721de756def913657d007b857c974e040ff9cef667ec579828adda8772
 size 377611120

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c9caaf8b6b2fdfb62d92a919208fc03efd5f67ed4502715c259b154e153fce9
 size 377611120

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc1d91ab26fe885386d325e08d1a27822dc4aa2f7f0af4672e696c06482df8cb
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:fff3e75b01a4269829fc0ee65716f57b1f52fabd7be1b69252d9ef3c67f77eb3
 size 5240