ESPnet
English
audio
audio_captioning
File size: 1,132 Bytes
e7d1aab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
<!-- Generated by scripts/utils/show_asr_result.sh -->
# RESULTS
## Environments
- date: `Fri Nov 29 20:06:53 EST 2024`
- python version: `3.9.20 (main, Oct  3 2024, 07:27:41)  [GCC 11.2.0]`
- espnet version: `espnet 202409`
- pytorch version: `pytorch 2.4.0`
- Git hash: `65ea259e8effab5a43cdff87161a301dc0f20930`
  - Commit date: `Fri Nov 29 10:54:44 2024 -0500`

## exp/asr_ft
### WER

|dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
|---|---|---|---|---|---|---|---|---|
|inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_latest/evaluation|1045|0|0.0|0.0|0.0|0.0|0.0|100.0|
|inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_valid.acc.ave_5best/evaluation|1045|0|0.0|0.0|0.0|0.0|0.0|100.0|

### CER

|dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
|---|---|---|---|---|---|---|---|---|
|inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_latest/evaluation|1045|0|0.0|0.0|0.0|0.0|0.0|100.0|
|inference_ctc_weight0.0_hugging_face_decoderTrue_asr_model_valid.acc.ave_5best/evaluation|1045|0|0.0|0.0|0.0|0.0|0.0|100.0|

### TER

|dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
|---|---|---|---|---|---|---|---|---|