mudler commited on
Commit
1b7d7f8
·
verified ·
1 Parent(s): f9e092b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +83 -0
README.md ADDED
@@ -0,0 +1,83 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: it
3
+ license: mit
4
+ tags:
5
+ - whisper
6
+ - whisper.cpp
7
+ - ggml
8
+ - automatic-speech-recognition
9
+ - italian
10
+ - localai
11
+ - quantized
12
+ datasets:
13
+ - mozilla-foundation/common_voice_25_0
14
+ - facebook/multilingual_librispeech
15
+ - facebook/voxpopuli
16
+ - google/fleurs
17
+ base_model: openai/whisper-tiny
18
+ pipeline_tag: automatic-speech-recognition
19
+ ---
20
+
21
+ # whisper-tiny-it-multi-ggml
22
+
23
+ [whisper.cpp](https://github.com/ggerganov/whisper.cpp) GGML quantizations of [LocalAI-io/whisper-tiny-it-multi](https://huggingface.co/LocalAI-io/whisper-tiny-it-multi) for fast CPU/GPU inference.
24
+
25
+ **Author:** Ettore Di Giacinto
26
+
27
+ Brought to you by the [LocalAI](https://github.com/mudler/LocalAI) team. These models can be used directly with [LocalAI](https://localai.io) and any whisper.cpp-based runtime.
28
+
29
+ ## Files
30
+
31
+ | File | Quantization | Description |
32
+ |------|--------------|-------------|
33
+ | `ggml-model-f16.bin` | float16 | Full precision (no quantization) — highest quality |
34
+ | `ggml-model-q8_0.bin` | int8 | 8-bit quantization — minimal quality loss |
35
+ | `ggml-model-q5_1.bin` | int5 | 5-bit quantization with extra precision |
36
+ | `ggml-model-q5_0.bin` | int5 | 5-bit quantization — smallest size |
37
+
38
+ ## Training
39
+
40
+ Fine-tuned [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) (39M params) on **Common Voice 25.0 + MLS + VoxPopuli + FLEURS Italian**.
41
+
42
+ See [LocalAI-io/whisper-tiny-it-multi](https://huggingface.co/LocalAI-io/whisper-tiny-it-multi) for the full safetensors model and detailed WER results.
43
+
44
+ ## Usage
45
+
46
+ ### whisper.cpp
47
+
48
+ ```bash
49
+ # Download a quant
50
+ huggingface-cli download LocalAI-io/whisper-tiny-it-multi-ggml ggml-model-q5_0.bin --local-dir .
51
+
52
+ # Run
53
+ ./whisper-cli -m ggml-model-q5_0.bin -f audio.wav -l it
54
+ ```
55
+
56
+ ### whisper.cpp Python bindings (pywhispercpp)
57
+
58
+ ```python
59
+ from pywhispercpp.model import Model
60
+
61
+ model = Model("ggml-model-q5_0.bin", language="it")
62
+ segments = model.transcribe("audio.wav")
63
+ for seg in segments:
64
+ print(seg.text)
65
+ ```
66
+
67
+ ### LocalAI
68
+
69
+ ```yaml
70
+ # In your LocalAI model config
71
+ name: whisper-tiny-it-multi
72
+ backend: whisper
73
+ parameters:
74
+ model: ggml-model-q5_0.bin
75
+ ```
76
+
77
+ ## Links
78
+
79
+ - **HF Safetensors:** [LocalAI-io/whisper-tiny-it-multi](https://huggingface.co/LocalAI-io/whisper-tiny-it-multi)
80
+ - **CTranslate2 INT8:** [LocalAI-io/whisper-tiny-it-multi-ct2-int8](https://huggingface.co/LocalAI-io/whisper-tiny-it-multi-ct2-int8)
81
+ - **Code:** [github.com/localai-org/whisper-it](https://github.com/localai-org/whisper-it)
82
+ - **whisper.cpp:** [github.com/ggerganov/whisper.cpp](https://github.com/ggerganov/whisper.cpp)
83
+ - **LocalAI:** [github.com/mudler/LocalAI](https://github.com/mudler/LocalAI)