Upload 15 files

Browse files

Files changed (16) hide show

.gitattributes +13 -0
LICENSE +21 -0
README.md +119 -0
whisper-base-q4_0.gguf +3 -0
whisper-base-q4_1.gguf +3 -0
whisper-base-q8_0.gguf +3 -0
whisper.cpp/whisper-base-q2_k.gguf +3 -0
whisper.cpp/whisper-base-q3_k.gguf +3 -0
whisper.cpp/whisper-base-q4_0.gguf +3 -0
whisper.cpp/whisper-base-q4_1.gguf +3 -0
whisper.cpp/whisper-base-q4_k.gguf +3 -0
whisper.cpp/whisper-base-q5_0.gguf +3 -0
whisper.cpp/whisper-base-q5_1.gguf +3 -0
whisper.cpp/whisper-base-q5_k.gguf +3 -0
whisper.cpp/whisper-base-q6_k.gguf +3 -0
whisper.cpp/whisper-base-q8_0.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,16 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+whisper-base-q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+whisper-base-q4_1.gguf filter=lfs diff=lfs merge=lfs -text
+whisper-base-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q2_k.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q3_k.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q4_1.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q4_k.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q5_1.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q5_k.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
+whisper.cpp/whisper-base-q8_0.gguf filter=lfs diff=lfs merge=lfs -text

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2022 OpenAI
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md ADDED Viewed

	@@ -0,0 +1,119 @@

+---
+license: mit
+language:
+- multilingual
+- en
+- ru
+tags:
+- whisper
+- gguf
+- quantized
+- speech-recognition
+- rust
+base_model:
+- openai/whisper-base
+---
+# WHISPER-BASE - GGUF Quantized Models
+Quantized versions of [openai/whisper-base](https://huggingface.co/openai/whisper-base) in GGUF format.
+## Directory Structure
+```
+base/
+├── whisper-base-q*.gguf       # Candle-compatible GGUF models (root)
+├── config.json                # Model configuration for Candle
+├── tokenizer.json             # Tokenizer for Candle
+└── whisper.cpp/               # whisper.cpp-compatible models
+    └── whisper-base-q*.gguf
+```
+### Format Compatibility
+- **Root directory** (`whisper-base-*.gguf`): Use with **Candle** (Rust ML framework)
+  - Tensor names include `model.` prefix (e.g., `model.encoder.conv1.weight`)
+  - Compatible with Neurolang application
+  - Requires `config-base.json` and `tokenizer-base.json`
+- **whisper.cpp/** directory: Use with **whisper.cpp** (C++ implementation)
+  - Tensor names without `model.` prefix (e.g., `encoder.conv1.weight`)
+  - Compatible with whisper.cpp CLI tools
+  - Both directories contain `.gguf` files, not `.bin` files
+## Available Formats
+| Format |  Quality | Use Case |
+|--------| ---------|----------|
+| q2_k |   Smallest | Extreme compression |
+| q3_k |  Small | Mobile devices |
+| q4_0 |  Good | Legacy compatibility |
+| q4_k |  Good | **Recommended for production** |
+| q4_1 | Good+ | Legacy with bias |
+| q5_0 |  Very Good | Legacy compatibility |
+| q5_k |  Very Good | High quality |
+| q5_1 |  Very Good+ | Legacy with bias |
+| q6_k |   Excellent | Near-lossless |
+| q8_0 |   Excellent | Minimal loss, benchmarking |
+## Usage
+### With Candle (Rust)
+> For this model, you need to modify the example code in candle. To try whisper in candle faster and easier, it's better to use the tiny model → https://huggingface.co/oxide-lab/whisper-tiny-GGUF
+**Command line example:**
+```bash
+# Run Candle Whisper with local quantized model
+cargo run --example whisper --release -- \
+  --features symphonia \
+  --quantized \
+  --model base \
+  --model-id oxide-lab/whisper-base-GGUF \
+```
+### With whisper.cpp (C++)
+```bash
+# Use models from whisper.cpp/ subdirectory
+./whisper.cpp/build/bin/whisper-cli \
+  --model models/openai/base/whisper.cpp/whisper-base-q4_k.gguf \
+  --file audio.wav
+```
+### Recommended Format
+For most use cases, we recommend **q4_k** format as it provides the best balance of:
+- Size reduction (~65% smaller)
+- Quality (minimal degradation)
+- Speed (faster inference than higher quantizations)
+## Quantization Details
+- **Source Model**: [openai/whisper-base](https://huggingface.co/openai/whisper-base)
+- **Quantization Methods**:
+  - **Candle GGUF** (root directory): Python-based quantization via `convert_whisper_to_gguf.py`
+    - Adds `model.` prefix to tensor names for Candle compatibility
+  - **whisper.cpp GGML** (whisper.cpp/ subdirectory): whisper-quantize tool
+    - Uses original tensor names without prefix
+- **Format**: GGUF (GGML Universal Format) for both directories
+- **Total Formats**: 10 quantization levels (q2_k through q8_0)
+## License
+Same as the original Whisper model (MIT License).
+## Citation
+```bibtex
+@misc{radford2022whisper,
+  doi = {10.48550/ARXIV.2212.04356},
+  url = {https://arxiv.org/abs/2212.04356},
+  author = {Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
+  title = {Robust Speech Recognition via Large-Scale Weak Supervision},
+  publisher = {arXiv},
+  year = {2022},
+  copyright = {arXiv.org perpetual, non-exclusive license}
+}
+```

whisper-base-q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:22809041141b46886636a0fc7797c4c0115b511813d587c9d5173648807498d3
+size 42225024

whisper-base-q4_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:31bdfaeab20436cd45ba0dfb85c6b0d2c1fe200ecf8d6c048d74cfa45dfa4982
+size 46705312

whisper-base-q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7073e51db7ab02b38cc4fceeac39adc2d7a19beb98badf66aa708f4f0ac71aa9
+size 78067328

whisper.cpp/whisper-base-q2_k.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b4b31be3e5502ba7448e9eeac62060350b78e00030b601e2bd2092fdf90b3f25
+size 29925346

whisper.cpp/whisper-base-q3_k.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1574ac666e7f91c3eb9d41f9817ec22863db00cf9a9edfe8769aef8adbb925fb
+size 37095158

whisper.cpp/whisper-base-q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e5ff00771a552622fede4e1414f774807977e954c979e726663f1e1f0c09dea3
+size 46471066

whisper.cpp/whisper-base-q4_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6beda1cd50bb57fb715047ad30db9a15bae104ca48a25075c57b6f23a49573fe
+size 50883258

whisper.cpp/whisper-base-q4_k.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1cc4b7f3abe855fd0ad772dc1f76728837a8cfe704f8026e79af73a997e7e7d2
+size 46471066

whisper.cpp/whisper-base-q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7221da9b3d43a7367ae87cda6cdde2728ea70f086bf93939537bba7d9996729a
+size 55295450

whisper.cpp/whisper-base-q5_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:36a6bf6281261fec331fcc65235e0f466d855ea4dae1bbbfb28ed4c0cf72d8c0
+size 59707642

whisper.cpp/whisper-base-q5_k.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:98319420145d1890e89814fd670de1986ed7e97f3fc847f9c820ae4df932ba12
+size 55295450

whisper.cpp/whisper-base-q6_k.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:aee213144524dd1d6595b828063528b3ce5415422e3a7be069827860a2585dd8
+size 64671358

whisper.cpp/whisper-base-q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cced49a244b3f776f42cfb4c41920037a4bc00058f41ef9579dee7f327d746de
+size 81768602