---
datasets:
  - aether-raid/SGdataset
metrics:
  - wer
base_model:
  - openai/whisper-large-v3-turbo
pipeline_tag: automatic-speech-recognition
---

Whisper Large V3 Turbo (WLV3t) trained on `sgatc` with
- Loud Normalization (LN)
- The following Augmentations (HLBT):
  - T: time stretch
  - S: seven band parametric EQ
  - H: high pass
  - L: low pass
  - B: band pass
  - T: tanh distortion


## Citation
If you use the data, please cite the following paper:

```bibtex
@misc{wee2025adaptingautomaticspeechrecognition,
      title={Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications}, 
      author={Marcus Yu Zhe Wee and Justin Juin Hng Wong and Lynus Lim and Joe Yu Wei Tan and Prannaya Gupta and Dillion Lim and En Hao Tew and Aloysius Keng Siew Han and Yong Zhi Lim},
      year={2025},
      eprint={2502.20311},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.20311}, 
}
```