icefall

Author	SHA1	Message	Date
Lucky Wong	31686ac829	Fix not enough values to unpack error . (#533 )	2022-08-18 10:45:06 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Fangjun Kuang	669401869d	Filter non-finite losses (#525 ) * Filter non-finite losses * Fixes after review	2022-08-17 12:22:43 +08:00
yangsuxia	951b03f6d7	Add function display_and_save_batch in wenetspeech/pruned_transducer_stateless2/train.py (#528 ) * Add function display_and_save_batch in egs/wenetspeech/ASR/pruned_transducer_stateless2/train.py * Modify function: display_and_save_batch * Delete empty line in pruned_transducer_stateless2/train.py * Modify code format	2022-08-13 11:09:54 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Fangjun Kuang	5149788cb2	Fix computing averaged loss in the aishell recipe. (#523 ) * Fix computing averaged loss in the aishell recipe. * Set find_unused_parameters optionally.	2022-08-09 10:53:31 +08:00
FNLPprojects	f24b76e64b	fix torchaudio version (#524 ) * fix torchaudio version * fix torchaudio version	2022-08-06 18:33:43 +08:00
Fangjun Kuang	1f7832b93c	Fix loading sampler state dict. (#421 ) * Fix loading sampler state dict. * skip scan_pessimistic_batches_for_oom if params.start_batch > 0	2022-08-06 10:00:08 +08:00
Yunusemre	7157f62af3	Merging onnx models (#518 ) * add export function of onnx-all-in-one to export.py * add onnx_check script for all-in-one onnx model * minor fix * remove unused arguments * add onnx-all-in-one test * fix style * fix style * fix requirements * fix input/output names * fix installing onnx_graphsurgeon * fix instaliing onnx_graphsurgeon * revert to previous requirements.txt * fix minor	2022-08-04 23:03:41 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Mingshuang Luo	e538232485	change for pruned rnnt5 train.py (#519 )	2022-08-04 12:29:39 +08:00
Weiji Zhuang	36eacaccb2	Fix preparing char based lang and add multiprocessing for wenetspeech text segmentation (#513 ) * add multiprocessing for wenetspeech text segmentation * Fix preparing char based lang for wenetspeech * fix style Co-authored-by: WeijiZhuang <zhuangweiji@xiaomi.com>	2022-08-03 19:19:40 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
LIyong.Guo	132132f52a	liear_fst_with_self_loops (#512 )	2022-08-02 22:28:12 +08:00
Wei Kang	2f75236c05	Support dynamic chunk streaming training in pruned_transcuder_stateless5 (#454 ) * support dynamic chunk streaming training * Add simulate streaming decoding * Support streaming decoding * fix causal * Minor fixes * fix streaming decode; add results	2022-07-29 16:40:06 +08:00
Mingshuang Luo	1b478d3ac3	Add other decoding methods (nbest, nbest oracle, nbest LG) for wenetspeech pruned rnnt2 (#482 ) * add other decoding methods for wenetspeech * changes for RESULTS.md * add ngram-lm-scale=0.35 results * set ngram-lm-scale=0.35 as default * Update README.md * add nbest-scale for flie name	2022-07-29 12:03:08 +08:00
Lucky Wong	34b4356bad	correction for get rank id. (#507 ) * Fix no attribute 'data' error. * minor fixes * correction for get rank id.	2022-07-29 11:28:52 +08:00
Fangjun Kuang	ec69967584	Set overwrite=True when extracting features in batches. (#487 )	2022-07-29 11:17:19 +08:00
Mingshuang Luo	389f9c77e5	correction for prepare.sh (#506 )	2022-07-28 17:01:46 +08:00
boji123	3c9e7f733b	[debug] raise remind when git-lfs not available (#504 ) * [debug] raise remind when git-lfs not available * modify comment	2022-07-28 16:17:49 +08:00
Mingshuang Luo	f26b62ac00	[WIP] Pruned-transducer-stateless5-for-WenetSpeech (offline and streaming) (#447 ) * pruned-rnnt5-for-wenetspeech * style check * style check * add streaming conformer * add streaming decode * changes codes for fast_beam_search and export cpu jit * add modified-beam-search for streaming decoding * add modified-beam-search for streaming decoding * change for streaming_beam_search.py * add README.md and RESULTS.md * change for style_check.yml * do some changes * do some changes for export.py * add some decode commands for usage * add streaming results on README.md	2022-07-28 12:54:27 +08:00
Fangjun Kuang	385645d533	Fix get_transducer_model() for aishell. (#497 ) PR #495 introduces an error. This commit fixes it.	2022-07-26 15:42:21 +08:00
Fangjun Kuang	d3fc4b031e	Support using aidatatang_200zh optionally in aishell training (#495 ) * Use aidatatang_200zh optionally in aishell training.	2022-07-26 11:25:01 +08:00
Fangjun Kuang	4612b03947	Fix using G before assignment in pruned_transducer_stateless/decode.py (#494 )	2022-07-26 10:37:02 +08:00
Wei Kang	b1d0956855	Add modified_beam_search for streaming decode (#489 ) * Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci	2022-07-25 16:53:23 +08:00
Zengwei Yao	8203d10be7	Add stats about duration and padding proportion (#485 ) * add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change	2022-07-25 16:40:43 +08:00
Fangjun Kuang	d99796898c	Update doc to add a link to Nadira Povey's YouTube channel. (#492 ) * Update doc to add a link to Nadira Povey's YouTube channel. * fix a typo	2022-07-25 12:06:40 +08:00
Quandwang	116d0cf26d	CTC attention model with reworked Conformer encoder and reworked Transformer decoder (#462 ) * ctc attention model with reworked conformer encoder and reworked transformer decoder * remove unnecessary func * resolve flake8 conflicts * fix typos and modify the expr of ScaledEmbedding * use original beam size * minor changes to the scripts * add rnn lm decoding * minor changes * check whether q k v weight is None * check whether q k v weight is None * check whether q k v weight is None * style correction * update results * update results * upload the decoding results of rnn-lm to the RESULTS * upload the decoding results of rnn-lm to the RESULTS * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-07-22 15:31:25 +08:00
Mingshuang Luo	3d2986b4c2	Update conformer.py for aishell4 (#484 ) * update conformer.py for aishell4 * update conformer.py * add strict=False when model.load_state_dict	2022-07-20 21:32:53 +08:00
Daniel Povey	a8696b36fc	Merge pull request #483 from yaozengwei/fix_diagnostic Fix diagnostic	2022-07-18 23:33:45 -07:00
yaozengwei	a35b28cd8d	fix for case of None stats	2022-07-19 14:29:23 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
Mingshuang Luo	aec222e2fe	add compile_lg.py for aishell2 recipe (#481 )	2022-07-18 14:36:40 +08:00
ezerhouni	ffca1ae7fb	[WIP] Rnn-T LM nbest rescoring (#471 )	2022-07-15 10:32:54 +08:00
Yuekai Zhang	c17233eca7	[Ready] [Recipes] add aishell2 (#465 ) * add aishell2 * fix aishell2 * add manifest stats * update prepare char dict * fix lint * setting max duration * lint * change context size to 1 * update result * update hf link * fix decoding comment * add more decoding methods * update result * change context-size 2 default	2022-07-14 14:46:56 +08:00
LIyong.Guo	f8d28f0998	update multi_quantization installation (#469 ) * update multi_quantization installation * Update egs/librispeech/ASR/pruned_transducer_stateless6/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-07-13 21:16:45 +08:00
Zengwei Yao	bc2882ddcc	Simplified memory bank for Emformer (#440 ) * init files * use average value as memory vector for each chunk * change tail padding length from right_context_length to chunk_length * correct the files, ln -> cp * fix bug in conv_emformer_transducer_stateless2/emformer.py * fix doc in conv_emformer_transducer_stateless/emformer.py * refactor init states for stream * modify .flake8 * fix bug about memory mask when memory_size==0 * add @torch.jit.export for init_states function * update RESULTS.md * minor change * update README.md * modify doc * replace torch.div() with << * fix bug, >> -> << * use i&i-1 to judge if it is a power of 2 * minor fix * fix error in RESULTS.md	2022-07-12 19:19:58 +08:00
Zengwei Yao	ce26495238	Rand combine update result (#467 ) * update RESULTS.md * fix test code in pruned_transducer_stateless5/conformer.py * minor fix * delete doc * fix style	2022-07-11 18:13:31 +08:00
Fangjun Kuang	6c69c4e253	Support running icefall outside of a git tracked directory. (#470 ) * Support running icefall outside of a git tracked directory. * Minor fixes.	2022-07-08 15:03:07 +08:00
Fangjun Kuang	e5fdbcd480	Revert changes to setup_logger. (#468 )	2022-07-08 09:15:37 +08:00
Fangjun Kuang	8761452a2c	Add multi_quantization to requirements.txt (#464 ) * Add multi_quantization to requirements.txt	2022-07-07 14:36:08 +08:00
Mingshuang Luo	8e0b7ea518	mv split cuts before computing feature (#461 )	2022-07-04 11:59:37 +08:00
Mingshuang Luo	10e8bc5b56	do a change (#460 )	2022-07-03 19:35:01 +08:00
Tiance Wang	ac9fe5342b	Fix TIMIT lexicon generation bug (#456 )	2022-06-30 19:13:46 +08:00
Zengwei Yao	d80f29e662	Modification about random combine (#452 ) * comment some lines, random combine from 1/3 layers, on linear layers in combiner * delete commented lines * minor change	2022-06-30 12:23:49 +08:00
Mingshuang Luo	c10aec5656	load_manifest_lazy for asr_datamodule.py (#453 )	2022-06-29 17:45:30 +08:00
Mingshuang Luo	29e407fd04	Code checks for pruned rnnt2 wenetspeech (#451 ) * code check * jq install	2022-06-28 18:57:53 +08:00
Mingshuang Luo	bfa8264697	code check (#450 )	2022-06-28 17:32:20 +08:00
Mingshuang Luo	2cb1618c95	[Ready to merge] Pruned transducer stateless5 recipe for tal_csasr (mix Chinese chars and English BPE) (#428 ) * add pruned transducer stateless5 recipe for tal_csasr * do some changes for merging * change for conformer.py * add wer and cer for Chinese and English respectively * fix a error for conformer.py	2022-06-28 11:02:10 +08:00

... 3 4 5 6 7 ...

752 Commits