Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 53
GGML conversion of xmzhu/whisper-tiny-zh-TW for use with whisper.cpp
This is a GGML-converted version of xmzhu/whisper-tiny-zh-TW.
Original model credit: xmzhu/whisper-tiny-zh-TW
All credit for the fine-tuning work goes to the original authors. This repository only provides the GGML conversion for use with whisper.cpp and mobile applications.
ggml-tiny-zh.binconvert-h5-to-ggml.py# Clone whisper.cpp
git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp
# Build
make
# Download this model
wget https://huggingface.co/wabisabisocial/whisper-tiny-mandarin-ggml/resolve/main/ggml-tiny-zh.bin
# Run transcription
./main -m ggml-tiny-zh.bin -f audio.wav -l zh
import WhisperContext from 'whisper.rn';
const context = await WhisperContext.initWithUrl(
'https://huggingface.co/wabisabisocial/whisper-tiny-mandarin-ggml/resolve/main/ggml-tiny-zh.bin'
);
const { result } = await context.transcribe(audioPath, {
language: 'zh',
});
wget https://huggingface.co/wabisabisocial/whisper-tiny-mandarin-ggml/resolve/main/ggml-tiny-zh.bin
For performance metrics and evaluation results, please refer to the original model.
This model is released under the Apache 2.0 License, matching the original model's license.
Per Apache 2.0 requirements:
If you use this model, please cite both the original model and OpenAI Whisper:
@misc{whisper-zh-ggml-wabisabisocial,
author = {wabisabisocial},
title = {whisper-tiny-mandarin-ggml},
year = {2026},
publisher = {Hugging Face},
note = {GGML conversion of xmzhu/whisper-tiny-zh-TW},
howpublished = {\url{https://huggingface.co/wabisabisocial/whisper-tiny-mandarin-ggml}}
}
@article{radford2022whisper,
title = {Robust Speech Recognition via Large-Scale Weak Supervision},
author = {Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
journal = {arXiv preprint arXiv:2212.04356},
year = {2022}
}
Base model
openai/whisper-base