abidanoaman
/

whisper-tiny-urdu-merged-data

Automatic Speech Recognition

Model card Files Files and versions

abidanoaman commited on Feb 22

Commit

315f2ee

·

verified ·

1 Parent(s): d05c7fd

Add resume_config.json

Files changed (1) hide show

resume_config.json +25 -0

resume_config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "model_name": "openai/whisper-tiny",
+  "language": "urdu",
+  "task": "transcribe",
+  "batch_size": 8,
+  "grad_accum_steps": 2,
+  "effective_batch": 16,
+  "learning_rate": 3e-06,
+  "warmup_steps": 200,
+  "num_epochs": 20,
+  "early_stopping_patience": 3,
+  "eval_every_steps": 500,
+  "save_every_steps": 500,
+  "target_sr": 16000,
+  "max_audio_length_s": 30,
+  "fp16": true,
+  "frozen_layers": [
+    "model.encoder.conv1",
+    "model.encoder.conv2"
+  ],
+  "optimizer": "AdamW (via Seq2SeqTrainer)",
+  "scheduler": "linear with warmup",
+  "generation_max_length": 225,
+  "notes": "To resume fine-tuning, load this model with WhisperForConditionalGeneration.from_pretrained(repo_id) and use the same Seq2SeqTrainer config. Re-freeze conv1/conv2 if continuing on the same dataset; unfreeze them if adapting to a new domain."
+}