Whisper Small - MLX FP16

This is the OpenAI Whisper Small model converted to MLX format with FP16 precision, optimized for Apple Silicon inference.

Model Details

Property Value
Base Model openai/whisper-small
Parameters ~244M
Format MLX SafeTensors (FP16)
Model Size 458.92 MB
Sample Rate 16,000 Hz
Audio Layers 12
Text Layers 12
Hidden Size 768
Attention Heads 12
Vocabulary Size 51,865

Intended Use

This model is optimized for on-device automatic speech recognition (ASR) on Apple Silicon devices (Mac, iPhone, iPad). It is designed for use with the WhisperKit or MLX frameworks.

Files

  • config.json - Model configuration
  • model.safetensors - Model weights in SafeTensors format (FP16)
  • multilingual.tiktoken - Tokenizer

Usage

import mlx_whisper

result = mlx_whisper.transcribe(
    "audio.mp3",
    path_or_hf_repo="aitytech/Whisper-Small-MLX-FP16",
)
print(result["text"])

Original Model

Downloads last month
-
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for aitytech/Whisper-Small-MLX-FP16

Finetuned
(3306)
this model

Paper for aitytech/Whisper-Small-MLX-FP16