kcvmk commited on
Commit
f3e4faa
·
verified ·
1 Parent(s): c1ae423

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - mlx
5
+ - 4bit
6
+ - whisper
7
+ - stt
8
+ - metalrt
9
+ - apple-silicon
10
+ ---
11
+
12
+ # Whisper Small — MLX 4-bit Quantized
13
+
14
+ Custom MLX 4-bit quantization of [OpenAI Whisper Small](https://huggingface.co/openai/whisper-small) optimized for [MetalRT](https://github.com/RunanywhereAI/metalrt-binaries) GPU inference on Apple Silicon.
15
+
16
+ ## Usage
17
+
18
+ Used by [RCLI](https://github.com/RunanywhereAI/RCLI) with the MetalRT engine for speech-to-text:
19
+
20
+ ```bash
21
+ rcli setup # select MetalRT or Both engines
22
+ ```
23
+
24
+ > **Note**: Whisper Small is in GPU beta. Whisper Tiny is recommended for production use.
25
+
26
+ ## License
27
+
28
+ Model weights: [MIT](https://github.com/openai/whisper/blob/main/LICENSE) (OpenAI)
29
+ MetalRT engine: [Proprietary](https://github.com/RunanywhereAI/metalrt-binaries/blob/main/LICENSE) (RunAnywhere, Inc.)
30
+
31
+ ## Contact
32
+
33
+ founder@runanywhere.ai | https://runanywhere.ai
config.json ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "n_mels": 80,
3
+ "n_audio_ctx": 1500,
4
+ "n_audio_state": 768,
5
+ "n_audio_head": 12,
6
+ "n_audio_layer": 12,
7
+ "n_vocab": 51865,
8
+ "n_text_ctx": 448,
9
+ "n_text_state": 768,
10
+ "n_text_head": 12,
11
+ "n_text_layer": 12,
12
+ "quantization": {
13
+ "group_size": 64,
14
+ "bits": 4
15
+ },
16
+ "model_type": "whisper"
17
+ }
whisper-small-mlx-4bit/.gitattributes ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
whisper-small-mlx-4bit/README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: mlx
3
+ ---
4
+
5
+ # whisper-small-mlx-4bit
6
+ This model was converted to MLX format from [`small`]().
7
+
8
+ ## Use with mlx
9
+ ```bash
10
+ git clone https://github.com/ml-explore/mlx-examples.git
11
+ cd mlx-examples/whisper/
12
+ pip install -r requirements.txt
13
+
14
+ >> import whisper
15
+ >> whisper.transcribe("FILE_NAME")
16
+ ```
whisper-small-mlx-4bit/config.json ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "n_mels": 80,
3
+ "n_audio_ctx": 1500,
4
+ "n_audio_state": 768,
5
+ "n_audio_head": 12,
6
+ "n_audio_layer": 12,
7
+ "n_vocab": 51865,
8
+ "n_text_ctx": 448,
9
+ "n_text_state": 768,
10
+ "n_text_head": 12,
11
+ "n_text_layer": 12,
12
+ "quantization": {
13
+ "group_size": 64,
14
+ "bits": 4
15
+ },
16
+ "model_type": "whisper"
17
+ }
whisper-small-mlx-4bit/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5798338ac817b4c1a57c7862f40d0bcb20aef3a94efca1289c1df11d5ed47103
3
+ size 196370700
whisper-small-mlx-4bit/tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
whisper-small-mlx-4bit/weights.npz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca6659298fe7550468ff0fc49dea7442615d9a53d1ce087aaded1b7627451998
3
+ size 196537352