icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-09 18:12:19 +00:00

Author	SHA1	Message	Date
Fangjun Kuang	6af15914fa	Validate generated manifest files. (#338 )	2022-05-03 07:02:54 +08:00
Wei Kang	021c79824e	Add LG decoding (#277 ) * Add LG decoding * Add log weight pushing * Minor fixes	2022-04-19 17:23:46 +08:00
Fangjun Kuang	2f4e71f433	Add force alignment for stateless transducer. (#239 ) * Add force alignment for stateless transducer. * Add more documentation. * Compute word starting time from framewise token alignment. * Update README to include force alignment information. * Fix typos. * Fix more typos. * Fixes after review.	2022-03-12 16:16:15 +08:00
Fangjun Kuang	53b79fafa7	Add MMI training with word pieces as modelling unit. (#6 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes. * Fix decoder padding mask. * Add MMI training with word pieces. * Remove unused files. * Minor fixes. * Refactoring. * Minor fixes. * Use pre-computed alignments in LF-MMI training. * Minor fixes. * Update decoding script. * Add doc about how to check and use extracted alignments. * Fix style issues. * Fix typos. * Fix style issues. * Disable macOS tests for now.	2021-10-18 15:20:32 +08:00
Fangjun Kuang	4890e27b45	Extract framewise alignment information using CTC decoding (#39 ) * Use new APIs with k2.RaggedTensor * Fix style issues. * Update the installation doc, saying it requires at least k2 v1.7 * Extract framewise alignment information using CTC decoding. * Print environment information. Print information about k2, lhotse, PyTorch, and icefall. * Fix CI. * Fix CI. * Compute framewise alignment information of the LibriSpeech dataset. * Update comments for the time to compute alignments of train-960. * Preserve cut id in mix cut transformer. * Minor fixes. * Add doc about how to extract framewise alignments.	2021-10-18 14:24:33 +08:00
Mingshuang Luo	597c5efdb1	Use LossRecord to record and print the loss for the training process (#62 ) * Update index.rst (AS->ASR) * Update conformer_ctc.rst (pretraind->pretrained) * Fix some spelling errors. * Fix some spelling errors. * Use LossRecord to record and print loss in the training process * Change the name "LossRecord" to "MetricsTracker"	2021-10-12 15:58:03 +08:00
Fangjun Kuang	beb54ddb61	Support torch script. (#65 ) * WIP: Support torchscript. * Minor fixes. * Fix style issues. * Add documentation about how to deploy a trained model.	2021-10-12 14:55:05 +08:00
Fangjun Kuang	1c3b13c7eb	Minor fixes. (#9 )	2021-08-16 19:01:25 +08:00
Fangjun Kuang	5a0b9bcb23	Refactoring (#4 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.	2021-08-04 14:53:02 +08:00
Fangjun Kuang	acc63a9172	WIP: Add BPE training code.	2021-07-29 20:23:52 +08:00
Fangjun Kuang	4ccae509d3	WIP: Begin to add BPE decoding	2021-07-26 20:06:58 +08:00
Fangjun Kuang	00f8371f37	begin to add LM rescoring.	2021-07-24 18:24:04 +08:00
Fangjun Kuang	f3542c7793	Add CTC training.	2021-07-24 17:13:20 +08:00
Fangjun Kuang	a01d08f73c	Add self-loops to propagate disambiguation symbols.	2021-07-21 13:12:20 +08:00
Fangjun Kuang	e005ea062c	Minor fixes after review.	2021-07-20 10:02:20 +08:00
Fangjun Kuang	f25eedf2d4	Fixes after review.	2021-07-20 00:14:24 +08:00
Fangjun Kuang	0b19aa09c1	Compute features of librispeech and musan.	2021-07-19 23:35:32 +08:00
Fangjun Kuang	40eed74460	Download LM for LibriSpeech.	2021-07-15 21:09:14 +08:00

18 Commits