RedHatAI/whisper-large-v3-turbo-FP8-dynamic Automatic Speech Recognition • 0.9B • Updated Apr 22, 2025 • 4.97k • 6
RedHatAI/whisper-large-v3-turbo-quantized.w8a8 Automatic Speech Recognition • 0.9B • Updated Apr 22, 2025 • 614 • 4
RedHatAI/whisper-large-v3-turbo-quantized.w4a16 Automatic Speech Recognition • 0.9B • Updated 19 days ago • 844 • 8
RedHatAI/whisper-large-v3-quantized.w4a16 Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 971 • 3
RedHatAI/whisper-large-v3-FP8-dynamic Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 681 • 4
RedHatAI/whisper-large-v3-quantized.w8a8 Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 132 • 1
RedHatAI/whisper-medium-quantized.w8a8 Automatic Speech Recognition • 0.8B • Updated Apr 22, 2025 • 161
RedHatAI/whisper-tiny-quantized.w8a8 Automatic Speech Recognition • 57.8M • Updated Apr 22, 2025 • 63 • 1
RedHatAI/whisper-small-quantized.w8a8 Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 272
RedHatAI/whisper-large-v2-quantized.w8a8 Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 37
RedHatAI/whisper-medium-quantized.w4a16 Automatic Speech Recognition • 0.2B • Updated Apr 22, 2025 • 39
RedHatAI/whisper-small-quantized.w4a16 Automatic Speech Recognition • 77M • Updated Apr 22, 2025 • 43 • 1
RedHatAI/whisper-large-v2-quantized.w4a16 Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 45 • 1
RedHatAI/whisper-large-v2-quantized.w4a16 Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 45 • 1
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models Paper • 2203.07259 • Published Mar 14, 2022 • 4