icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-08 09:32:20 +00:00

Author	SHA1	Message	Date
Yifan Yang	89728dd4f8	Refactor data preparation for GigaSpeech recipe (#1986 )	2025-07-10 11:17:37 +08:00
Wei Kang	711d6bc462	Refactor prepare.sh in librispeech (#1493 ) * Refactor prepare.sh in librispeech, break it into three parts, prepare.sh (basic, minimal requirement for transducer), prepare_lm.sh (ngram & nnlm staff), prepare_mmi.sh (for MMI training).	2024-02-09 10:44:19 +08:00
zr_jin	1814bbb0e7	typo fixed (#1334 )	2023-10-25 00:03:33 +08:00
zr_jin	162ceaf4b3	fixes for data preparation (#1307 ) Issue: #1306	2023-10-12 17:05:41 +08:00
Fangjun Kuang	772ee3955b	Support HLG decoding using OpenFst with kaldi decoders (#1275 )	2023-09-27 14:49:27 +08:00
Fangjun Kuang	2318c3fbd0	Support CTC decoding on CPU using OpenFst and kaldi decoders. (#1244 )	2023-09-26 16:36:19 +08:00
Fangjun Kuang	c0de78d3c0	Add data preparation for the MuST-C speech translation corpus (#1107 )	2023-06-05 15:49:41 +08:00
Yuanhang Zhang	b0228c536e	Fix typo in librispeech OpenFST-based HLG preparation script (#1028 )	2023-04-28 19:52:32 +08:00
Fangjun Kuang	dfbcf606e7	small fixes to prepare.sh (#789 )	2022-12-27 09:25:42 +08:00
Fangjun Kuang	fbc8894804	Add comment for compile_hlg_using_openfst.py (#762 )	2022-12-14 13:47:23 +08:00
Fangjun Kuang	4501821fd9	Support using OpenFst to compile HLG. (#606 ) * Support using OpenFst to compile HLG. * Fix style issues	2022-12-09 16:46:44 +08:00
huangruizhe	6693d907d3	shuffle full Librispeech data (#574 ) * shuffled full/partial librispeech data * fixed the code style issue * Shuffled full librispeech data off-line * Fixed style, addressed comments, and removed redandunt codes * Used the suggested version of black * Propagated the changes to other folders for librispeech (except conformer_mmi and streaming_conformer_ctc)	2022-11-27 11:26:09 +08:00
marcoyang	53454701cb	fix segmentation fault	2022-11-22 11:39:21 +08:00
ezerhouni	0475d75d15	[Ready to be merged] Add RNN-LM to Conformer-CTC decoding (#439 )	2022-06-23 19:37:03 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
Fangjun Kuang	0f180b3ce2	Validate that there are no OOV tokens in BPE-based lexicons. (#359 ) * Validate that there are no OOV tokens in BPE-based lexicons. * Typo fixes.	2022-05-13 14:00:35 +08:00
Fangjun Kuang	6af15914fa	Validate generated manifest files. (#338 )	2022-05-03 07:02:54 +08:00
Wei Kang	021c79824e	Add LG decoding (#277 ) * Add LG decoding * Add log weight pushing * Minor fixes	2022-04-19 17:23:46 +08:00
Fangjun Kuang	2f4e71f433	Add force alignment for stateless transducer. (#239 ) * Add force alignment for stateless transducer. * Add more documentation. * Compute word starting time from framewise token alignment. * Update README to include force alignment information. * Fix typos. * Fix more typos. * Fixes after review.	2022-03-12 16:16:15 +08:00
Fangjun Kuang	53b79fafa7	Add MMI training with word pieces as modelling unit. (#6 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes. * Fix decoder padding mask. * Add MMI training with word pieces. * Remove unused files. * Minor fixes. * Refactoring. * Minor fixes. * Use pre-computed alignments in LF-MMI training. * Minor fixes. * Update decoding script. * Add doc about how to check and use extracted alignments. * Fix style issues. * Fix typos. * Fix style issues. * Disable macOS tests for now.	2021-10-18 15:20:32 +08:00
Fangjun Kuang	4890e27b45	Extract framewise alignment information using CTC decoding (#39 ) * Use new APIs with k2.RaggedTensor * Fix style issues. * Update the installation doc, saying it requires at least k2 v1.7 * Extract framewise alignment information using CTC decoding. * Print environment information. Print information about k2, lhotse, PyTorch, and icefall. * Fix CI. * Fix CI. * Compute framewise alignment information of the LibriSpeech dataset. * Update comments for the time to compute alignments of train-960. * Preserve cut id in mix cut transformer. * Minor fixes. * Add doc about how to extract framewise alignments.	2021-10-18 14:24:33 +08:00
Mingshuang Luo	597c5efdb1	Use LossRecord to record and print the loss for the training process (#62 ) * Update index.rst (AS->ASR) * Update conformer_ctc.rst (pretraind->pretrained) * Fix some spelling errors. * Fix some spelling errors. * Use LossRecord to record and print loss in the training process * Change the name "LossRecord" to "MetricsTracker"	2021-10-12 15:58:03 +08:00
Fangjun Kuang	beb54ddb61	Support torch script. (#65 ) * WIP: Support torchscript. * Minor fixes. * Fix style issues. * Add documentation about how to deploy a trained model.	2021-10-12 14:55:05 +08:00
Fangjun Kuang	1c3b13c7eb	Minor fixes. (#9 )	2021-08-16 19:01:25 +08:00
Fangjun Kuang	5a0b9bcb23	Refactoring (#4 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.	2021-08-04 14:53:02 +08:00
Fangjun Kuang	acc63a9172	WIP: Add BPE training code.	2021-07-29 20:23:52 +08:00
Fangjun Kuang	4ccae509d3	WIP: Begin to add BPE decoding	2021-07-26 20:06:58 +08:00
Fangjun Kuang	00f8371f37	begin to add LM rescoring.	2021-07-24 18:24:04 +08:00
Fangjun Kuang	f3542c7793	Add CTC training.	2021-07-24 17:13:20 +08:00
Fangjun Kuang	a01d08f73c	Add self-loops to propagate disambiguation symbols.	2021-07-21 13:12:20 +08:00
Fangjun Kuang	e005ea062c	Minor fixes after review.	2021-07-20 10:02:20 +08:00
Fangjun Kuang	f25eedf2d4	Fixes after review.	2021-07-20 00:14:24 +08:00
Fangjun Kuang	0b19aa09c1	Compute features of librispeech and musan.	2021-07-19 23:35:32 +08:00
Fangjun Kuang	40eed74460	Download LM for LibriSpeech.	2021-07-15 21:09:14 +08:00

34 Commits