YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Whisper Tiny Cantonese - CoreML

CoreML conversion of Oblivion208/whisper-tiny-cantonese for use with WhisperKit on Apple devices.

Model Description

This is a CoreML-converted and quantized version of the Whisper Tiny model fine-tuned for Cantonese (粵語) speech recognition. The model outputs colloquial Cantonese (口語) rather than written Chinese (書面語).

Performance

  • Character Error Rate (CER): 11.10% on MDCC test set (original model)
  • Optimized for: Apple Neural Engine (ANE)

License

This model is distributed under the Apache 2.0 License.

Attribution

This work is based on:

  1. OpenAI Whisper - Original Whisper model

  2. Oblivion208/whisper-tiny-cantonese - Fine-tuned Cantonese model

  3. WhisperKit - CoreML conversion tools

Changes Made

  • Converted from PyTorch to CoreML format using whisperkittools
  • Applied 8-bit mixed-precision quantization for reduced model size
  • Packaged with tokenizer and configuration files for WhisperKit compatibility

Usage

This model is designed for use with WhisperKit on iOS/macOS:

import WhisperKit

let whisperKit = try await WhisperKit(
    modelFolder: "path/to/whisper-tiny-cantonese",
    computeOptions: .init(audioEncoderCompute: .cpuAndNE)
)

let result = try await whisperKit.transcribe(audioPath: "audio.wav")

Disclaimer

This model is provided "as is" without warranty of any kind. Performance may vary based on audio quality, accent, and speaking style.

Downloads last month
22
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support