metadata
base_model:
- openai/whisper-large-v3-turbo
datasets:
- aether-raid/noisereduced_SGdataset
metrics:
- wer
pipeline_tag: automatic-speech-recognition
library_name: transformers
license: mit
Whisper Large V3 Turbo (WLV3t) trained on denoised-sgatc with
- The following Augmentations (HLBT):
- H: high pass
- L: low pass
- B: band pass
- T: tanh distortion
Code: to be added
Citation
If you use the data, please cite the following paper:
@misc{wee2025adaptingautomaticspeechrecognition,
title={Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications},
author={Marcus Yu Zhe Wee and Justin Juin Hng Wong and Lynus Lim and Joe Yu Wei Tan and Prannaya Gupta and Dillion Lim and En Hao Tew and Aloysius Keng Siew Han and Yong Zhi Lim},
year={2025},
eprint={2502.20311},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2502.20311},
}