icefall

Author	SHA1	Message	Date
wgb14	72abd38f27	use KaldifeatFbank to compute fbank for musan	2022-01-17 18:54:16 -05:00
wgb14	6e5b189fc5	DynamicBucketingSampler	2021-12-29 15:22:46 -05:00
wgb14	bea78f6094	lazy loading and use SingleCutSampler	2021-12-17 00:38:52 -05:00
Guanbo Wang	532309bf72	Add conformer.py without pre-commit checking	2021-12-16 20:20:41 -05:00
wgb14	76a289126f	add conformer training recipe	2021-12-16 20:18:02 -05:00
Guanbo Wang	71ef6a9e11	Merge remote-tracking branch 'upstream/master' into gigaspeech_recipe	2021-12-16 19:13:14 -05:00
Wei Kang	76a51bf037	Fix aishell tdnn_lstm_ctc decoding (#149 )	2021-12-14 14:42:58 +08:00
Wei Kang	a183d5bfd7	Remove batchnorm (#147 ) * Remove batch normalization * Minor fixes * Fix typo * Fix comments * Add assertion for use_feat_batchnorm	2021-12-14 08:20:03 +08:00
Fangjun Kuang	95af039733	RNN-T training for yesno. (#141 ) * RNN-T training for yesno. * Rename Jointer to Joiner.	2021-12-07 21:44:37 +08:00
Fangjun Kuang	1aff64b708	Apply layer normalization to the output of each gate in LSTM/GRU. (#139 ) * Apply layer normalization to the output of each gate in LSTM. * Apply layer normalization to the output of each gate in GRU. * Add projection support to LayerNormLSTMCell. * Add GPU tests. * Use typeguard.check_argument_types() to validate type annotations. * Add typeguard as a requirement. * Minor fixes. * Fix CI. * Fix CI. * Fix test failures for torch 1.8.0 * Fix errors.	2021-12-07 18:38:03 +08:00
pingfengluo	d1adc25338	Update AIShell recipe result (#140 ) * add MMI to AIShell * fix MMI decode graph * export model * typo * fix code style * typo * fix data prepare to just use train text by uid * use a faster way to get the intersection of train and aishell_transcript_v0.8.txt * update AIShell result * update * typo	2021-12-04 14:43:04 +08:00
wgb14	4316ec43d7	small fix	2021-12-03 16:34:36 -05:00
pingfengluo	89b84208aa	add phone based LF-MMI training to AIShell recipe (#137 ) * add MMI to AIShell * fix MMI decode graph * export model * typo * fix code style * typo	2021-12-02 12:32:23 +08:00
wgb14	64bd3f7df4	set audio duration mismatch tolerance to 0.01	2021-12-01 17:49:46 -05:00
Wang, Guanbo	b8beb00ecc	Merge pull request #2 from csukuangfj/fix-giga Split manifests into 2000 pieces.	2021-11-30 00:28:58 -05:00
Fangjun Kuang	8109c2b913	Split manifests into 2000 pieces.	2021-11-30 12:04:15 +08:00
Fangjun Kuang	ec591698b0	Associate a cut with token alignment (without repeats) (#125 ) * WIP: Associate a cut with token alignment (without repeats) * Save framewise alignments with/without repeats. * Minor fixes.	2021-11-29 18:50:54 +08:00
Wang, Guanbo	ee7c56c7d9	Merge pull request #1 from csukuangfj/fix-giga Compute features for GigaSpeech by splitting the manifest.	2021-11-28 02:19:57 -05:00
Fangjun Kuang	4351e1ea14	Fixes after review.	2021-11-28 15:10:55 +08:00
Fangjun Kuang	317f5ec64e	Compute features for GigaSpeech by splitting the manifest.	2021-11-28 13:24:05 +08:00
Fangjun Kuang	243fb9723c	Fix an error introduced while supporting torchscript. (#134 ) Should be `G.dummy = 1`, not `G["dummy"] = 1`.	2021-11-27 09:07:04 +08:00
Fangjun Kuang	0e541f5b5d	Print hostname and IP address to the log. (#131 ) We are using multiple machines to do various experiments. It makes life easier to know which experiment is running on which machine if we also log the IP and hostname of the machine.	2021-11-26 11:25:59 +08:00
LIyong.Guo	00e2f0ade8	Draft streaming decoding (#89 ) * reusable parts from conformer_ctc * streaming conformer code * a trained model	2021-11-24 19:35:18 +08:00
Lucky Wong	769a9791ec	Fix no attribute 'data' error. (#129 )	2021-11-22 18:31:04 +08:00
Wei Kang	e2c9c728d9	Update aishell tensorboard log for new LabelSmoothing loss (#128 ) * Update aishell tensorboard log for new LabelSmoothing loss * Minor fixes	2021-11-22 12:26:44 +08:00
Wei Kang	4151cca147	Add torch script support for Aishell and update documents (#124 ) * Add aishell recipe * Remove unnecessary code and update docs * adapt to k2 v1.7, add docs and results * Update conformer ctc model * Update docs, pretrained.py & results * Fix code style * Fix code style * Fix code style * Minor fix * Minor fix * Fix pretrained.py * Update pretrained model & corresponding docs * Export torch script model for Aishell * Add C++ deployment docs * Minor fixes * Fix unit test * Update Readme	2021-11-19 16:37:05 +08:00
Wei Kang	30c43b7f69	Add aishell recipe (#30 ) * Add aishell recipe * Remove unnecessary code and update docs * adapt to k2 v1.7, add docs and results * Update conformer ctc model * Update docs, pretrained.py & results * Fix code style * Fix code style * Fix code style * Minor fix * Minor fix * Fix pretrained.py * Update pretrained model & corresponding docs	2021-11-18 10:00:47 +08:00
Fangjun Kuang	0660d12e4e	Fix computing WERs for empty hypotheses (#118 ) * Fix computing WERs when empty lattices are generated. * Minor fixes.	2021-11-17 19:25:47 +08:00
Fangjun Kuang	336283f872	New label smoothing (#109 ) * Modify label smoothing to match the one implemented in PyTorch. * Enable CI for torch 1.10 * Fix CI errors. * Fix CI installation errors. * Fix CI installation errors. * Minor fixes. * Minor fixes. * Minor fixes. * Minor fixes. * Minor fixes. * Fix CI errors.	2021-11-17 19:24:07 +08:00
Mingshuang Luo	10e46f3e1d	A little changes for timit recipe (#122 ) * Update train.py * Update train.py * Update train.py * Update tdnn_ligru_ctc.rst	2021-11-17 16:13:51 +08:00
Mingshuang Luo	2e0f255ada	Add timit recipe (including the code scripts and the docs) for icefall (#114 ) * add timit recipe for icefall * add shared file * update the docs for timit recipe * Delete shared * update the timit recipe and check style * Update model.py * Do some changes * Update model.py * Update model.py * Add README.md and RESULTS.md * Update RESULTS.md * Update README.md * update the docs for timit recipe	2021-11-17 11:23:45 +08:00
wgb14	fa734e01a3	chunked feature extraction by default	2021-11-16 20:23:12 -05:00
Fangjun Kuang	68506609ad	Set fsa.properties to None after changing its labels in-place. (#121 )	2021-11-16 23:11:30 +08:00
wgb14	89c0e2e7ff	small fix	2021-11-14 01:13:07 -05:00
wgb14	9d08b44b19	small fix	2021-11-14 00:44:30 -05:00
wgb14	16f1799ef3	support HLG for BPE	2021-11-13 23:59:50 -05:00
wgb14	3dbb15bda2	support BPE based lang	2021-11-13 23:27:45 -05:00
wgb14	1d58765bd5	on-the-fly feature extraction by default	2021-11-13 17:45:35 -05:00
wgb14	75860159a2	support download, data prep, and fbank	2021-11-12 14:43:19 -05:00
Daniel Povey	b9452235d5	Merge pull request #117 from csukuangfj/fix-empty-lattice Handle empty lattices in attention decoder rescoring.	2021-11-11 16:26:02 +08:00
Fangjun Kuang	5b10310bd1	Handle empty lattices in attention decoder rescoring.	2021-11-11 15:42:30 +08:00
Fangjun Kuang	8d679c3e74	Fix typos. (#115 )	2021-11-10 14:45:30 +08:00
Fangjun Kuang	21096e99d8	Update result for the librispeech recipe using vocab size 500 and att rate 0.8 (#113 ) * Update RESULTS using vocab size 500, att rate 0.8 * Update README. * Refactoring. Since FSAs in an Nbest object are linear in structure, we can add the scores of a path to compute the total scores. * Update documentation. * Change default vocab size from 5000 to 500.	2021-11-10 14:32:52 +08:00
wgb14	b7bda9eaf6	initial commit	2021-11-09 01:12:21 -05:00
Fangjun Kuang	04029871b6	Fix a bug in Nbest.compute_am_scores and Nbest.compute_lm_scores. (#111 )	2021-11-09 13:44:51 +08:00
Fangjun Kuang	91cfecebf2	Remove duplicated token seq in rescoring. (#108 ) * Remove duplicated token seq in rescoring. * Use a larger range for ngram_lm_scale and attention_scale	2021-11-06 08:54:45 +08:00
Fangjun Kuang	810b193dcc	Clarify the doc about ctc-decoding. (#104 )	2021-11-03 07:16:49 +08:00
Fangjun Kuang	42b437bea6	Use pre-sorted text to generate token ids for attention decoder. (#98 ) * Use pre-sorted text to generate token ids for attention decoder. See https://github.com/k2-fsa/icefall/issues/97 for more details. * Fix typos.	2021-10-29 13:46:41 +08:00
Fangjun Kuang	12d647d899	Add a note about the CUDA OOM error. (#94 ) * Add a note about the CUDA OOM error. Some users consider this kind of OOM as an error during decoding, but actually it is not. This pull request clarifies that. * Fix style issues.	2021-10-29 12:17:56 +08:00
Fangjun Kuang	8cb7f712e4	Use GPU for averaging checkpoints if possible. (#84 )	2021-10-26 17:10:04 +08:00

1 2 3

134 Commits