amitkot commited on
Commit
ccf7fe8
·
verified ·
1 Parent(s): c1690fe

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: he
3
+ tags:
4
+ - whisper
5
+ - hebrew
6
+ - acft
7
+ - speech-recognition
8
+ library_name: transformers
9
+ base_model: yoad/whisper-tiny
10
+ ---
11
+
12
+ # whisper-yoad-tiny-he-acft
13
+
14
+ ACFT (Audio Context Fine-Tuning) applied to [yoad/whisper-tiny](https://huggingface.co/yoad/whisper-tiny) for Hebrew speech recognition.
15
+
16
+ ACFT aligns partial-context encoder representations with full-context ones, improving short-utterance inference (e.g., keyboard dictation).
17
+
18
+ ## Evaluation
19
+
20
+ WER on ivrit-ai/whisper-training test split (2000 samples, no normalization):
21
+
22
+ | Model | WER |
23
+ |-------|-----|
24
+ | yoad/whisper-tiny (base) | 0.3978 |
25
+ | yoad/whisper-tiny + ACFT (this model) | 0.3993 |
26
+
27
+ ## Training
28
+
29
+ - **Method:** ACFT (encoder MSE alignment)
30
+ - **Dataset:** google/fleurs he_il
31
+ - **Epochs:** 8
32
+ - **Device:** Apple MPS (M4 Pro)
33
+
34
+ ## Usage
35
+
36
+ ```python
37
+ from transformers import WhisperForConditionalGeneration, WhisperProcessor
38
+
39
+ model = WhisperForConditionalGeneration.from_pretrained("amitkot/whisper-yoad-tiny-he-acft")
40
+ processor = WhisperProcessor.from_pretrained("amitkot/whisper-yoad-tiny-he-acft")
41
+ ```