Automatic Speech Recognition
Welsh
English
whisper.cpp
DewiBrynJones commited on
Commit
fc4d854
·
verified ·
1 Parent(s): 39e35f3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -3
README.md CHANGED
@@ -1,3 +1,27 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - techiaith/commonvoice_18_0_cy_en
5
+ language:
6
+ - cy
7
+ - en
8
+ base_model:
9
+ - openai/whisper-base
10
+ pipeline_tag: automatic-speech-recognition
11
+ tags:
12
+ - whisper.cpp
13
+ ---
14
+
15
+ # whisper-base-ft-cv-cy-en-cpp
16
+
17
+ This model is a version of the [openai/whisper-base](https://huggingface.co/openai/whisper-base) model, fine-tuned on the
18
+ [techiaith/commonvoice_18_0_cy_en](https://huggingface.co/datasets/techiaith/commonvoice_18_0_cy_en) dataset, and then
19
+ [converted for use in whisper.cpp](https://github.com/ggerganov/whisper.cpp/tree/master/models#fine-tuned-models). Whispercpp is
20
+ a C/C++ port of Whisper that provides high performance inference on offline hardware such as desktops, laptops and mobile devices.
21
+
22
+ The model is a smaller in size to the corresponding cloud hosted model [techiaith/whisper-large-v3-ft-cv-cy-en](https://huggingface.co/techiaith/whisper-large-v3-ft-cv-cy-en)
23
+ Since the model is smaller in size, it achieves a success rate of 98.34% on detecting the correct language,
24
+ while for transcribing it achieves the following WER results:
25
+
26
+ - Welsh: 40.10
27
+ - English: 30.9