digiphyte commited on
Commit
a5aa79b
·
verified ·
1 Parent(s): 779eaf4

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - af
5
+ base_model: openai/whisper-tiny
6
+ library_name: ctranslate2
7
+ pipeline_tag: automatic-speech-recognition
8
+ tags:
9
+ - whisper
10
+ - fluister
11
+ - afrikaans
12
+ - south-africa
13
+ - faster-whisper
14
+ - ctranslate2
15
+ ---
16
+
17
+ # Fluister (tiny)
18
+
19
+ **Fluister** is an Afrikaans-optimised Whisper. ("Fluister" is Afrikaans for "to whisper".) It is a
20
+ fine-tune of OpenAI Whisper (openai/whisper-tiny), merged into the base weights and converted to CTranslate2
21
+ (int8) for use with faster-whisper. By DigiPhyte (Pty) Ltd, South Africa.
22
+
23
+ On Afrikaans audio it reduces Whisper's drift to Dutch-style spellings ("gebou" not "gebouw", "mense" not "mensen") compared to stock Whisper whisper-tiny. As one of the smallest Whisper sizes its overall accuracy is limited; please read Limitations below before using it.
24
+
25
+ ## Use (faster-whisper)
26
+
27
+ ```python
28
+ from faster_whisper import WhisperModel
29
+ model = WhisperModel("digiphyte/fluister-tiny", device="cuda", compute_type="int8_float16") # CPU: device="cpu", compute_type="int8"
30
+ segments, info = model.transcribe("audio.wav", language="af", beam_size=5)
31
+ for s in segments:
32
+ print(s.text)
33
+ ```
34
+
35
+ Pass `language="af"`; the Fluister models are tuned for Afrikaans and SA English and should be told
36
+ the language rather than relying on auto-detect.
37
+
38
+ ## Limitations
39
+
40
+ Fluister narrows one specific failure: Whisper spelling Afrikaans as Dutch. It does not turn a small model into a large one. Absolute accuracy is still bounded by the base size, language auto-detect can still mislabel the audio (tell it `language="af"`), and proper nouns, numbers, and rare or technical terms can still be wrong. For English-only audio, stock Whisper or a larger size is usually the better choice.
41
+
42
+ **This is the smallest, fastest tier (tiny), meant for very modest or CPU-only machines.** It cuts the Dutch drift relative to stock Whisper whisper-tiny, but it is the least accurate model in the Fluister family: expect noticeably more errors overall, weaker Afrikaans/English code-switching (some Afrikaans words can leak into English passages), and weaker proper nouns. It is a speed-and-size trade-off, not a quality model. If your machine can run them, prefer `fluister-medium` or `fluister-large-v3`.
43
+
44
+ ## Licence and attribution
45
+
46
+ MIT (see `LICENSE`). This is a derivative work; the base model (OpenAI Whisper, Apache-2.0) and the training data (`andreoosthuizen/afrikaans-30s`, CC-BY-4.0) (this size was fine-tuned by DigiPhyte directly on that dataset) are credited in `NOTICE`.