YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Whisper Tiny Cantonese - CoreML

CoreML conversion of Oblivion208/whisper-tiny-cantonese for use with WhisperKit on Apple devices.

Model Description

This is a CoreML-converted and quantized version of the Whisper Tiny model fine-tuned for Cantonese (粵語) speech recognition. The model outputs colloquial Cantonese (口語) rather than written Chinese (書面語).

Performance

  • Character Error Rate (CER): 11.10% on MDCC test set (original model)
  • Optimized for: Apple Neural Engine (ANE)

License

This model is distributed under the Apache 2.0 License.

Attribution

This work is based on:

  1. OpenAI Whisper - Original Whisper model

  2. Oblivion208/whisper-tiny-cantonese - Fine-tuned Cantonese model

  3. WhisperKit - CoreML conversion tools

Changes Made

  • Converted from PyTorch to CoreML format using whisperkittools
  • Applied 8-bit mixed-precision quantization for reduced model size
  • Packaged with tokenizer and configuration files for WhisperKit compatibility

Usage

This model is designed for use with WhisperKit on iOS/macOS:

import WhisperKit

let whisperKit = try await WhisperKit(
    modelFolder: "path/to/whisper-tiny-cantonese",
    computeOptions: .init(audioEncoderCompute: .cpuAndNE)
)

let result = try await whisperKit.transcribe(audioPath: "audio.wav")

Disclaimer

This model is provided "as is" without warranty of any kind. Performance may vary based on audio quality, accent, and speaking style.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support