fluister-tiny / NOTICE
digiphyte's picture
Add NOTICE
eeb8ad9 verified
Raw
History Blame Contribute Delete
1.64 kB
Afrikaans-tuned Whisper model by DigiPhyte (Pty) Ltd
====================================================
This model is a fine-tune / derivative work, distributed under the MIT License
(see LICENSE). The upstream works below are gratefully acknowledged and their
licences are preserved as required.
1. Base model: OpenAI Whisper
-----------------------------
openai/whisper-large-v3 (and large-v3-turbo / medium / small variants)
Copyright OpenAI. Licensed under the Apache License, Version 2.0.
https://huggingface.co/openai/whisper-large-v3
A copy of the Apache 2.0 licence is included as LICENSE-whisper-apache-2.0.txt.
Modifications by DigiPhyte: fine-tuned with a LoRA adapter for South African
Afrikaans, then merged into the base weights and converted to the CTranslate2
format for use with faster-whisper.
2. Training data: afrikaans-30s
-------------------------------
andreoosthuizen/afrikaans-30s (approx. 56 hours of Afrikaans speech).
Licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).
https://huggingface.co/datasets/andreoosthuizen/afrikaans-30s
3. LoRA adapter (large-v3 model only)
-------------------------------------
andreoosthuizen/whisper-large-v3-afrikaans
Copyright (c) Andre Oosthuizen. Licensed under the MIT License.
https://huggingface.co/andreoosthuizen/whisper-large-v3-afrikaans
This attribution applies to the large-v3 Afrikaans model, which reuses this
adapter. The medium, large-v3-turbo, and small Afrikaans models were fine-tuned
by DigiPhyte directly on the afrikaans-30s dataset (item 2) and do not include
this adapter.