techiaith
/

whisper-tiny-ft-cy-en

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

DewiBrynJones commited on Mar 4, 2024

Commit

5dfd7cd

·

verified ·

1 Parent(s): 9a56b7c

Update README.md

Files changed (1) hide show

README.md +16 -12

README.md CHANGED Viewed

@@ -6,29 +6,33 @@ metrics:
 model-index:
 - name: whisper-tiny-ft-cy
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# whisper-tiny-ft-cy
-This model was trained from scratch on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.7176
-- Wer: 53.1135
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -61,4 +65,4 @@ The following hyperparameters were used during training:
 - Transformers 4.37.2
 - Pytorch 2.2.0+cu121
 - Datasets 2.16.1
-- Tokenizers 0.15.1

 model-index:
 - name: whisper-tiny-ft-cy
   results: []
+license: apache-2.0
+language:
+- cy
+- en
+pipeline_tag: automatic-speech-recognition
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# whisper-tiny-ft-cy-en
+This model is a fine-tune of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) using custom splits from
+Common Voice 16.1 Welsh and English datasets as well as normalized verbatim transcriptions from
+[techiaith/banc-trawsgrifiadau-bangor](https://huggingface.co/datasets/techiaith/banc-trawsgrifiadau-bangor)
 ## Intended uses & limitations
+Due to its small size, this model is intended to be used as the basis for offline speech recognition on devices such as
+Android phones.
 ## Training and evaluation data
+It achieves the following results on the evaluation set:
+- Loss: 0.7176
+- Wer: 53.1135
 ## Training procedure
 - Transformers 4.37.2
 - Pytorch 2.2.0+cu121
 - Datasets 2.16.1
+- Tokenizers 0.15.1