WS-nrSG-HLBT / README.md

nielsr HF Staff

Add library_name, license and link to code

a7e0e21 verified about 1 year ago

962 Bytes

base_model:
  - openai/whisper-large-v3-turbo
datasets:
  - aether-raid/noisereduced_SGdataset
metrics:
  - wer
pipeline_tag: automatic-speech-recognition
library_name: transformers
license: mit

Whisper Large V3 Turbo (WLV3t) trained on denoised-sgatc with

The following Augmentations (HLBT):
- H: high pass
- L: low pass
- B: band pass
- T: tanh distortion

Code: to be added

Citation

If you use the data, please cite the following paper:

@misc{wee2025adaptingautomaticspeechrecognition,
      title={Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications}, 
      author={Marcus Yu Zhe Wee and Justin Juin Hng Wong and Lynus Lim and Joe Yu Wei Tan and Prannaya Gupta and Dillion Lim and En Hao Tew and Aloysius Keng Siew Han and Yong Zhi Lim},
      year={2025},
      eprint={2502.20311},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.20311}, 
}