tiny_transducer_ctc
/
large_bpe_500
/decode_results
/greedy_search
/log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16
2023-01-16 09:52:16,827 INFO [decode.py:565] Decoding started | |
2023-01-16 09:52:16,827 INFO [decode.py:571] Device: cuda:0 | |
2023-01-16 09:52:17,018 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt | |
2023-01-16 09:52:17,057 INFO [decode.py:588] { 'activation': 'doubleswish', | |
'avg': 1, | |
'batch_idx_train': 0, | |
'beam': 20.0, | |
'beam_size': 4, | |
'best_train_epoch': -1, | |
'best_train_loss': inf, | |
'best_valid_epoch': -1, | |
'best_valid_loss': inf, | |
'blank_id': 0, | |
'bucketing_sampler': True, | |
'channels': 400, | |
'concatenate_cuts': False, | |
'context_size': 2, | |
'conv_layers': 18, | |
'decoder_dim': 400, | |
'decoding_method': 'greedy_search', | |
'drop_last': True, | |
'duration_factor': 1.0, | |
'enable_musan': True, | |
'enable_spec_aug': True, | |
'encoder_dim': 400, | |
'env_info': { 'IP address': '127.0.1.1', | |
'hostname': 'kao-dgxa-f12-u17', | |
'icefall-git-branch': 'tiny', | |
'icefall-git-date': 'Fri Jan 13 07:21:29 2023', | |
'icefall-git-sha1': '5c8e962-dirty', | |
'icefall-path': '/home/jsong/git/icefall', | |
'k2-build-type': 'Release', | |
'k2-git-date': 'Fri Nov 25 08:23:51 2022', | |
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937', | |
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py', | |
'k2-version': '1.23.2', | |
'k2-with-cuda': True, | |
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py', | |
'lhotse-version': '1.7.0', | |
'python-version': '3.9', | |
'torch-cuda-available': True, | |
'torch-cuda-version': '11.3', | |
'torch-version': '1.12.0'}, | |
'epoch': 30, | |
'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'), | |
'feature_dim': 80, | |
'full_libri': True, | |
'gap': 1.0, | |
'input_strategy': 'PrecomputedFeatures', | |
'iter': 0, | |
'joiner_dim': 400, | |
'lang_dir': PosixPath('data/lang_bpe_500'), | |
'log_interval': 500, | |
'manifest_dir': PosixPath('data/fbank'), | |
'max_contexts': 8, | |
'max_duration': 600, | |
'max_states': 64, | |
'max_sym_per_frame': 1, | |
'nbest_scale': 0.5, | |
'ngram_lm_scale': 0.1, | |
'num_buckets': 30, | |
'num_paths': 100, | |
'num_workers': 2, | |
'on_the_fly_feats': False, | |
'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search'), | |
'reset_interval': 200, | |
'return_cuts': True, | |
'shuffle': True, | |
'skip_add': True, | |
'spec_aug_time_warp_factor': 80, | |
'subsampling_factor': 4, | |
'suffix': 'epoch-30-avg-1-context-2-max-sym-per-frame-1-uam', | |
'unk_id': 2, | |
'use_averaged_model': True, | |
'use_double_scores': True, | |
'use_dscnn': True, | |
'valid_interval': 9000, | |
'vocab_size': 500, | |
'warm_step': 5000} | |
2023-01-16 09:52:17,058 INFO [decode.py:590] About to create model | |
2023-01-16 09:52:19,589 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M | |
2023-01-16 09:52:19,594 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30 | |
2023-01-16 09:52:19,977 INFO [decode.py:697] Number of model parameters: 4821330 | |
2023-01-16 09:52:19,977 INFO [decode.py:698] Parameters for transducer decoding: 4219830 | |
2023-01-16 09:52:19,977 INFO [asr_datamodule.py:449] About to get test-clean cuts | |
2023-01-16 09:52:19,978 INFO [asr_datamodule.py:456] About to get test-other cuts | |
2023-01-16 09:52:21,770 INFO [decode.py:459] batch 0/?, cuts processed until now is 43 | |
2023-01-16 09:52:39,401 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt | |
2023-01-16 09:52:39,467 INFO [utils.py:536] [test-clean-greedy_search] %WER 8.29% [4359 / 52576, 501 ins, 443 del, 3415 sub ] | |
2023-01-16 09:52:39,617 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt | |
2023-01-16 09:52:39,618 INFO [decode.py:508] | |
For test-clean, WER of different settings are: | |
greedy_search 8.29 best for test-clean | |
2023-01-16 09:52:40,281 INFO [decode.py:459] batch 0/?, cuts processed until now is 52 | |
2023-01-16 09:52:56,315 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt | |
2023-01-16 09:52:56,386 INFO [utils.py:536] [test-other-greedy_search] %WER 21.11% [11052 / 52343, 1006 ins, 1534 del, 8512 sub ] | |
2023-01-16 09:52:56,547 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt | |
2023-01-16 09:52:56,547 INFO [decode.py:508] | |
For test-other, WER of different settings are: | |
greedy_search 21.11 best for test-other | |
2023-01-16 09:52:56,547 INFO [decode.py:730] Done! | |