ESPnet
English
audio
audio_captioning
shikhar7ssu's picture
Upload 19 files
e7d1aab verified

RESULTS

Environments

  • date: Fri Nov 29 20:06:53 EST 2024
  • python version: 3.9.20 (main, Oct 3 2024, 07:27:41) [GCC 11.2.0]
  • espnet version: espnet 202409
  • pytorch version: pytorch 2.4.0
  • Git hash: 65ea259e8effab5a43cdff87161a301dc0f20930
    • Commit date: Fri Nov 29 10:54:44 2024 -0500

exp/asr_ft

WER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_latest/evaluation 1045 0 0.0 0.0 0.0 0.0 0.0 100.0
inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_valid.acc.ave_5best/evaluation 1045 0 0.0 0.0 0.0 0.0 0.0 100.0

CER

dataset Snt Wrd Corr Sub Del Ins Err S.Err
inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_latest/evaluation 1045 0 0.0 0.0 0.0 0.0 0.0 100.0
inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_valid.acc.ave_5best/evaluation 1045 0 0.0 0.0 0.0 0.0 0.0 100.0

TER

dataset Snt Wrd Corr Sub Del Ins Err S.Err