tiny_transducer_ctc / large_bpe_500 /decode_results /greedy_search /log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16
Tiance Wang
upload models and results
2c2e885
2023-01-16 09:52:16,827 INFO [decode.py:565] Decoding started
2023-01-16 09:52:16,827 INFO [decode.py:571] Device: cuda:0
2023-01-16 09:52:17,018 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
2023-01-16 09:52:17,057 INFO [decode.py:588] { 'activation': 'doubleswish',
'avg': 1,
'batch_idx_train': 0,
'beam': 20.0,
'beam_size': 4,
'best_train_epoch': -1,
'best_train_loss': inf,
'best_valid_epoch': -1,
'best_valid_loss': inf,
'blank_id': 0,
'bucketing_sampler': True,
'channels': 400,
'concatenate_cuts': False,
'context_size': 2,
'conv_layers': 18,
'decoder_dim': 400,
'decoding_method': 'greedy_search',
'drop_last': True,
'duration_factor': 1.0,
'enable_musan': True,
'enable_spec_aug': True,
'encoder_dim': 400,
'env_info': { 'IP address': '127.0.1.1',
'hostname': 'kao-dgxa-f12-u17',
'icefall-git-branch': 'tiny',
'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
'icefall-git-sha1': '5c8e962-dirty',
'icefall-path': '/home/jsong/git/icefall',
'k2-build-type': 'Release',
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
'k2-version': '1.23.2',
'k2-with-cuda': True,
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
'lhotse-version': '1.7.0',
'python-version': '3.9',
'torch-cuda-available': True,
'torch-cuda-version': '11.3',
'torch-version': '1.12.0'},
'epoch': 30,
'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
'feature_dim': 80,
'full_libri': True,
'gap': 1.0,
'input_strategy': 'PrecomputedFeatures',
'iter': 0,
'joiner_dim': 400,
'lang_dir': PosixPath('data/lang_bpe_500'),
'log_interval': 500,
'manifest_dir': PosixPath('data/fbank'),
'max_contexts': 8,
'max_duration': 600,
'max_states': 64,
'max_sym_per_frame': 1,
'nbest_scale': 0.5,
'ngram_lm_scale': 0.1,
'num_buckets': 30,
'num_paths': 100,
'num_workers': 2,
'on_the_fly_feats': False,
'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search'),
'reset_interval': 200,
'return_cuts': True,
'shuffle': True,
'skip_add': True,
'spec_aug_time_warp_factor': 80,
'subsampling_factor': 4,
'suffix': 'epoch-30-avg-1-context-2-max-sym-per-frame-1-uam',
'unk_id': 2,
'use_averaged_model': True,
'use_double_scores': True,
'use_dscnn': True,
'valid_interval': 9000,
'vocab_size': 500,
'warm_step': 5000}
2023-01-16 09:52:17,058 INFO [decode.py:590] About to create model
2023-01-16 09:52:19,589 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
2023-01-16 09:52:19,594 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
2023-01-16 09:52:19,977 INFO [decode.py:697] Number of model parameters: 4821330
2023-01-16 09:52:19,977 INFO [decode.py:698] Parameters for transducer decoding: 4219830
2023-01-16 09:52:19,977 INFO [asr_datamodule.py:449] About to get test-clean cuts
2023-01-16 09:52:19,978 INFO [asr_datamodule.py:456] About to get test-other cuts
2023-01-16 09:52:21,770 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
2023-01-16 09:52:39,401 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
2023-01-16 09:52:39,467 INFO [utils.py:536] [test-clean-greedy_search] %WER 8.29% [4359 / 52576, 501 ins, 443 del, 3415 sub ]
2023-01-16 09:52:39,617 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
2023-01-16 09:52:39,618 INFO [decode.py:508]
For test-clean, WER of different settings are:
greedy_search 8.29 best for test-clean
2023-01-16 09:52:40,281 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
2023-01-16 09:52:56,315 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
2023-01-16 09:52:56,386 INFO [utils.py:536] [test-other-greedy_search] %WER 21.11% [11052 / 52343, 1006 ins, 1534 del, 8512 sub ]
2023-01-16 09:52:56,547 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
2023-01-16 09:52:56,547 INFO [decode.py:508]
For test-other, WER of different settings are:
greedy_search 21.11 best for test-other
2023-01-16 09:52:56,547 INFO [decode.py:730] Done!