Instructions to use mohitsha/whisper-tiny-smooth-quant with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mohitsha/whisper-tiny-smooth-quant with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="mohitsha/whisper-tiny-smooth-quant")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("mohitsha/whisper-tiny-smooth-quant") model = AutoModelForSpeechSeq2Seq.from_pretrained("mohitsha/whisper-tiny-smooth-quant") - Notebooks
- Google Colab
- Kaggle
Upload decoder_with_past_model_quantized.onnx with huggingface_hub
Browse files
decoder_with_past_model_quantized.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6bf6fafef624f3bc890424124e7629a898a654cd927bb57f26211d7b2a2613a8
|
| 3 |
+
size 89097428
|