icefall

Author	SHA1	Message	Date
Mingshuang Luo	bdd890bab9	Add files via upload	2021-09-29 19:23:11 +08:00
Mingshuang Luo	f837818af3	Add files via upload	2021-09-29 19:11:14 +08:00
Mingshuang Luo	cca1399e0f	Delete train.py	2021-09-29 19:10:49 +08:00
Mingshuang Luo	67762e308f	Add files via upload	2021-09-29 19:10:33 +08:00
Mingshuang Luo	49af342863	Delete train.py	2021-09-29 19:10:08 +08:00
Mingshuang Luo	7278b699b3	Add files via upload	2021-09-29 19:09:36 +08:00
Mingshuang Luo	3c5b49006e	Delete train.py	2021-09-29 19:09:11 +08:00
Mingshuang Luo	852efe1b87	Rename train_lossrecord.py to train.py	2021-09-29 17:37:21 +08:00
Mingshuang Luo	8b6c139623	Add files via upload	2021-09-29 17:36:52 +08:00
Mingshuang Luo	bc101093c6	Delete train.py	2021-09-29 17:36:30 +08:00
Mingshuang Luo	00b2d4c9c7	Rename train_lossrecord_tdnn.py to train.py	2021-09-29 17:35:58 +08:00
Mingshuang Luo	84eb064675	Add files via upload	2021-09-29 17:35:24 +08:00
Mingshuang Luo	f03fb67aa1	Delete train.py	2021-09-29 17:35:03 +08:00
Mingshuang Luo	426418c703	Rename train_lossrecord_con.py to train.py	2021-09-29 17:34:34 +08:00
Mingshuang Luo	787ca3b89c	Add files via upload	2021-09-29 17:33:56 +08:00
Mingshuang Luo	6de7f0c062	Delete train.py	2021-09-29 17:33:32 +08:00
Mingshuang Luo	de9b2a9cd1	Update train.py	2021-09-29 16:54:21 +08:00
Mingshuang Luo	1c0792796b	Update train.py	2021-09-29 12:57:23 +08:00
Mingshuang Luo	43cf016ae5	Update train.py	2021-09-29 12:56:47 +08:00
Mingshuang Luo	34e36a926b	Update train.py	2021-09-29 12:55:42 +08:00
Mingshuang Luo	597ff01158	Update train.py	2021-09-29 12:51:38 +08:00
Mingshuang Luo	0fa46bf68a	Update train.py	2021-09-29 12:49:59 +08:00
Mingshuang Luo	e74e75acc6	Use LossRecord to record and print loss for the training process	2021-09-29 10:08:38 +08:00
Mingshuang Luo	73f21a379b	Merge branch 'k2-fsa:master' into master	2021-09-27 15:34:08 +08:00
Fangjun Kuang	707d7017a7	Support pure ctc decoding requiring neither a lexicon nor an n-gram LM (#58 ) * Rename lattice_score_scale to nbest_scale. * Support pure CTC decoding requiring neither a lexicion nor an n-gram LM. * Fix style issues. * Fix a typo. * Minor fixes.	2021-09-26 14:21:49 +08:00
Mingshuang Luo	6c4a58273f	Fix some spelling errors.	2021-09-26 12:55:51 +08:00
Mingshuang Luo	6abd1bcd0a	Fix some spelling errors.	2021-09-26 12:54:35 +08:00
Fangjun Kuang	a80e58e15d	Refactor decode.py to make it more readable and more modular. (#44 ) * Refactor decode.py to make it more readable and more modular. * Fix an error. Nbest.fsa should always have token IDs as labels and word IDs as aux_labels. * Add nbest decoding. * Compute edit distance with k2. * Refactor nbest-oracle. * Add rescore with nbest lists. * Add whole-lattice rescoring. * Add rescoring with attention decoder. * Refactoring. * Fixes after refactoring. * Fix a typo. * Minor fixes. * Replace [] with () for shapes. * Use k2 v1.9 * Use Levenshtein graphs/alignment from k2 v1.9 * [doc] Require k2 >= v1.9 * Minor fixes.	2021-09-20 15:44:54 +08:00
Wei Kang	9a6e0489c8	update api for RaggedTensor (#45 ) * Fix code style * update k2 version in CI * fix compile hlg	2021-09-14 16:39:56 +08:00
Wei Kang	24656e9749	Update docs and remove unnecessary arguments (#42 ) * Fix typo in docs * Update docs and remove unnecessary arguments * Fix code style	2021-09-13 18:28:57 +08:00
Fangjun Kuang	f792b466bf	Change default value of lattice-score-scale from 1.0 to 0.5 (#41 ) * Change the default value of lattice-score-scale from 1.0 to 0.5 * Fix CI.	2021-09-13 10:49:18 +08:00
Fangjun Kuang	7f8e3a673a	Add commands for reproducing. (#40 ) * Add commands for reproducing. * Use --bucketing-sampler by default.	2021-09-09 13:50:31 +08:00
Fangjun Kuang	abadc71415	Use new APIs with k2.RaggedTensor (#38 ) * Use new APIs with k2.RaggedTensor * Fix style issues. * Update the installation doc, saying it requires at least k2 v1.7 * Use k2 v1.7	2021-09-08 14:55:30 +08:00
Fangjun Kuang	184dbb3ea5	Add documentation about code style and creating new recipes. (#27 )	2021-08-25 14:48:41 +08:00
Fangjun Kuang	96e7f5c7ea	Release v0.1 (#26 )	2021-08-24 21:30:30 +08:00
pkufool	f4223ee110	Add TDNN-LSTM-CTC Results (#25 ) * Add tdnn-lstm pretrained model and results * Add docs for TDNN-LSTM-CTC * Minor fix * Fix typo * Fix style checking	2021-08-24 21:09:27 +08:00
Fangjun Kuang	1bd5dcc8ac	WIP: Add doc for the LibriSpeech recipe. (#24 ) * WIP: Add doc for the LibriSpeech recipe. * Add more doc for LibriSpeech recipe. * Add more doc for the LibriSpeech recipe. * More doc.	2021-08-24 20:28:32 +08:00
Fangjun Kuang	01da00dca0	WIP: Add documentation. (#22 ) * Begin to add documentation. * WIP: Add documentation. * Fix a typo. * Add more doc for the recipe yesno. * Add more doc for the yesno recipe.	2021-08-24 14:28:08 +08:00
Fangjun Kuang	57cb611665	[yesno] Remove padding in TDNN (#21 ) * Disable SpecAug for yesno. Also replace Adam with SGD. * Remove padding in the model to make the results reproducible.	2021-08-23 15:59:36 +08:00
Fangjun Kuang	6c2c9b9d74	Add recipe for the yes_no dataset. (#16 ) * Add recipe for the yes_no dataset. * Refactoring: Remove unused code. * Add Colab notebook for the yesno dataset. * Add GitHub actions to run yesno. * Fix a typo. * Minor fixes. * Train more epochs for GitHub actions. * Minor fixes. * Minor fixes. * Fix style issues.	2021-08-23 11:36:29 +08:00
pkufool	19c4214958	Fix code style and add copyright. (#18 ) * Fix style and add copyright * Minor fix * Remove duplicate lines * Reformat conformer.py by black * Reformat code style with black. * Fix github workflows * Fix lhotse installation * Install icefall requirements * Update k2 version, remove lhotse from test workflow	2021-08-23 10:43:59 +08:00
Fangjun Kuang	8469f9ae0a	Refactor asr_datamodule. (#15 ) * WIP: Refactor asr_datamodule. * Fixes after review. * Minor fixes.	2021-08-21 09:53:46 +08:00
Fangjun Kuang	0b656e4e1c	Add a link to Colab. (#14 ) It demonstrates the usages of pre-trained models.	2021-08-20 15:43:25 +08:00
Fangjun Kuang	9d0cc9d829	Support computing nbest oracle WER. (#10 ) * Support computing nbest oracle WER. * Add scale to all nbest based decoding/rescoring methods. * Add script to run pretrained models. * Use torchaudio to extract features. * Support decoding multiple files at the same time. Also, use kaldifeat for feature extraction. * Support decoding with LM rescoring and attention-decoder rescoring. * Minor fixes. * Replace scale with lattice-score-scale. * Add usage example with a provided pretrained model.	2021-08-20 11:53:37 +08:00
pkufool	ef233486ae	The training script produce WER of 2.57% on librispeech test-clean (#13 ) * Add grad_clip and weight-decay, small fix of dataloader and masking * Add RESULTS.md	2021-08-20 10:08:08 +08:00
Fangjun Kuang	caa0b9e942	Fix an error in displaying decoding process. (#12 )	2021-08-19 14:54:01 +08:00
Fangjun Kuang	1c3b13c7eb	Minor fixes. (#9 )	2021-08-16 19:01:25 +08:00
Fangjun Kuang	12a2fd023e	Add doc about installation and usage (#7 ) * Add readme. * Add TOC. * fix typos * Minor fixes after review.	2021-08-12 12:44:04 +08:00
Fangjun Kuang	5a0b9bcb23	Refactoring (#4 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.	2021-08-04 14:53:02 +08:00
Fangjun Kuang	398ed80d7a	Minor fixes to support DDP training.	2021-07-31 15:26:57 +08:00

1 2

70 Commits