Neural Magic Enterprise

company

AI & ML interests

LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV

updated 17 models over 1 year ago

RedHatAI/whisper-large-v3-turbo-FP8-dynamic

Automatic Speech Recognition • 0.9B • Updated Apr 22, 2025 • 1.48k • 6

RedHatAI/whisper-large-v3-turbo-quantized.w8a8

Automatic Speech Recognition • 0.9B • Updated Apr 22, 2025 • 1.29k • 4

RedHatAI/whisper-large-v3-turbo-quantized.w4a16

Automatic Speech Recognition • 0.9B • Updated Apr 28 • 2.87k • 9

RedHatAI/whisper-large-v3-quantized.w4a16

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 3.04k • 4

RedHatAI/whisper-large-v3-FP8-dynamic

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 2.56k • 5

RedHatAI/whisper-large-v3-quantized.w8a8

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 2.13k • 1

RedHatAI/whisper-medium-quantized.w8a8

Automatic Speech Recognition • 0.8B • Updated Apr 22, 2025 • 29

RedHatAI/whisper-medium-FP8-dynamic

Automatic Speech Recognition • 0.8B • Updated Apr 22, 2025 • 20

RedHatAI/whisper-tiny-quantized.w8a8

Automatic Speech Recognition • 57.8M • Updated Apr 22, 2025 • 36 • 1

RedHatAI/whisper-small-FP8-Dynamic

Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 19

RedHatAI/whisper-small-quantized.w8a8

Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 909

RedHatAI/whisper-large-v2-quantized.w8a8

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 27

RedHatAI/whisper-medium-quantized.w4a16

Automatic Speech Recognition • 0.8B • Updated Apr 22, 2025 • 24

RedHatAI/whisper-tiny-FP8-Dynamic

Automatic Speech Recognition • 57.8M • Updated Apr 22, 2025 • 21

RedHatAI/whisper-small-quantized.w4a16

Automatic Speech Recognition • 0.3B • Updated Apr 22, 2025 • 22 • 1

RedHatAI/whisper-large-v2-FP8-Dynamic

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 61

RedHatAI/whisper-large-v2-quantized.w4a16

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 26 • 1

published a model over 1 year ago

RedHatAI/whisper-large-v2-quantized.w4a16

Automatic Speech Recognition • 2B • Updated Apr 22, 2025 • 26 • 1

authored 2 papers almost 2 years ago

How Well Do Sparse Imagenet Models Transfer?

Paper • 2111.13445 • Published Nov 26, 2021 • 1

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

Paper • 2203.07259 • Published Mar 14, 2022 • 4