icefall

Author	SHA1	Message	Date
Fangjun Kuang	7100c33820	Add pruned RNN-T for aishell. (#436 ) * Add pruned RNN-T for aishell. * support torch script. * Update CI. * Minor fixes. * Add links to sherpa.	2022-06-21 21:17:22 +08:00
2xwwx2	91b2765cfd	Fixs spelling mistake (#438 )	2022-06-20 16:41:04 +08:00
Mingshuang Luo	998091ef52	do some changes for export.py (#437 )	2022-06-20 14:57:08 +08:00
Fangjun Kuang	bfeab319c9	Fix aishell. (#416 )	2022-06-10 11:47:43 +08:00
Fangjun Kuang	dbda1644b5	Replace load_manifest_lazy with load_manifest for MUSAN. (#412 )	2022-06-09 11:42:18 +08:00
Fangjun Kuang	ed66877694	Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. (#411 )	2022-06-09 11:18:52 +08:00
Fangjun Kuang	1094a3cb37	Replace LilcomChunkyWriter with ChunkedLilcomHdf5Writer. (#404 )	2022-06-07 18:14:25 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
Ewald Enzinger	8c5722de8c	[egs] Add prefix when reading manifests due to recent lhotse changes (#382 ) * [egs] Add prefix when reading manifests due to recent lhotse changes * Fix wenetspeech * Fix style issues	2022-05-23 23:37:35 +08:00
Fangjun Kuang	aeb8986e35	Ignore padding frames during RNN-T decoding. (#358 ) * Ignore padding frames during RNN-T decoding. * Fix outdated decoding code. * Minor fixes.	2022-05-13 07:39:14 +08:00
Mingshuang Luo	f783e10dc8	Do some changes for aishell/ASR/transducer stateless/export.py (#347 ) * do some changes for aishell/ASR/transducer_stateless/export.py	2022-05-07 11:09:31 +08:00
Fangjun Kuang	78b8792d1d	Fix potential bugs in PyTorch that exist in label_smoothing. (#300 )	2022-04-08 13:41:33 +08:00
Wei Kang	cb3ba16f2b	Fix aishell prepare.sh when using pre-download data (#291 )	2022-04-05 10:22:49 +08:00
Fangjun Kuang	395a3f952b	Batch decoding for models trained with optimized_transducer (#267 ) * Add greedy search in batch mode. * Add modified beam search in batch mode.	2022-03-23 19:11:34 +08:00
Mingshuang Luo	d0d806560f	Change for asr_datamodule.py (#241 ) * change for asr_datamodule.py * fix style check * do a fix	2022-03-14 00:30:58 +08:00
Fangjun Kuang	2f0fbf430c	Remove duplicate files. (#236 )	2022-03-04 11:56:31 +08:00
Fangjun Kuang	3ec219dfa0	Add stateless transducer tutorial. (#235 ) * WIP: Add stateless transducer tutorial. * Add more doc. * Minor fixes.	2022-03-03 22:33:47 +08:00
Fangjun Kuang	50d2281524	Add modified transducer loss for AIShell dataset (#219 ) * Add modified transducer for aishell. * Minor fixes. * Add extra data in transducer training. The extra data is from http://www.openslr.org/62/ * Update export.py and pretrained.py * Update CI to install pretrained models with aishell. * Update results. * Update results. * Update README. * Use symlinks to avoid copies.	2022-03-02 16:02:38 +08:00
PF Luo	ac7c2d84bc	minor fix for aishell recipe (#223 ) * just remove unnecessary torch.sum * minor fixs for aishell	2022-02-23 08:33:20 +08:00
Fangjun Kuang	1c35ae1dba	Reset seed at the beginning of each epoch. (#221 ) * Reset seed at the beginning of each epoch. * Use a different seed for each epoch.	2022-02-21 15:16:39 +08:00
Fangjun Kuang	cbf8c18ebd	Minor fixes for aishell (#218 ) * Minor fixes to aishell. * Minor fixes.	2022-02-19 22:28:19 +08:00
PF Luo	277cc3f9bf	update aishell-1 recipe with k2.rnnt_loss (#215 ) * update aishell-1 recipe with k2.rnnt_loss * fix flak8 style * typo * add pretrained model link to result.md	2022-02-19 15:56:39 +08:00
Duo Ma	827b9df51a	Updated Aishell-1 transducer-stateless result (#217 ) * Update RESULTS.md * Update RESULTS.md	2022-02-19 15:56:04 +08:00
Wei Kang	5ae80dfca7	Minor fixes (#193 )	2022-01-27 18:01:17 +08:00
Lucky Wong	6caff5fd38	minor fixes (#169 ) * Fix no attribute 'data' error. * minor fixes	2022-01-06 10:24:16 +08:00
pingfengluo	ea8af0ee9a	add transducer_stateless with char unit to AIShell (#164 )	2022-01-01 18:32:08 +08:00
Wei Kang	76a51bf037	Fix aishell tdnn_lstm_ctc decoding (#149 )	2021-12-14 14:42:58 +08:00
pingfengluo	d1adc25338	Update AIShell recipe result (#140 ) * add MMI to AIShell * fix MMI decode graph * export model * typo * fix code style * typo * fix data prepare to just use train text by uid * use a faster way to get the intersection of train and aishell_transcript_v0.8.txt * update AIShell result * update * typo	2021-12-04 14:43:04 +08:00
pingfengluo	89b84208aa	add phone based LF-MMI training to AIShell recipe (#137 ) * add MMI to AIShell * fix MMI decode graph * export model * typo * fix code style * typo	2021-12-02 12:32:23 +08:00
Lucky Wong	769a9791ec	Fix no attribute 'data' error. (#129 )	2021-11-22 18:31:04 +08:00
Wei Kang	4151cca147	Add torch script support for Aishell and update documents (#124 ) * Add aishell recipe * Remove unnecessary code and update docs * adapt to k2 v1.7, add docs and results * Update conformer ctc model * Update docs, pretrained.py & results * Fix code style * Fix code style * Fix code style * Minor fix * Minor fix * Fix pretrained.py * Update pretrained model & corresponding docs * Export torch script model for Aishell * Add C++ deployment docs * Minor fixes * Fix unit test * Update Readme	2021-11-19 16:37:05 +08:00
Wei Kang	30c43b7f69	Add aishell recipe (#30 ) * Add aishell recipe * Remove unnecessary code and update docs * adapt to k2 v1.7, add docs and results * Update conformer ctc model * Update docs, pretrained.py & results * Fix code style * Fix code style * Fix code style * Minor fix * Minor fix * Fix pretrained.py * Update pretrained model & corresponding docs	2021-11-18 10:00:47 +08:00

1 2

82 Commits