icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-12-11 06:55:27 +00:00

Author	SHA1	Message	Date
Daniel Povey	d187ad8b73	Change max_frames from 0.2 to 0.15	2022-02-11 16:24:17 +08:00
Daniel Povey	4cd2c02fff	Fix num_time_masks code; revert 0.8 to 0.9	2022-02-10 15:53:11 +08:00
Daniel Povey	c170c53006	Change p=0.9 to p=0.8 in SpecAug	2022-02-10 14:59:14 +08:00
Daniel Povey	8aa50df4f0	Change p=0.5->0.9, mask_fraction 0.3->0.2	2022-02-09 22:52:53 +08:00
Daniel Povey	dd19a6a2b1	Fix to num_feature_masks bug I introduced; reduce max_frames_mask_fraction 0.4->0.3	2022-02-09 12:02:19 +08:00
Daniel Povey	bd36216e8c	Use much more aggressive SpecAug setup	2022-02-08 21:55:20 +08:00
Mingshuang Luo	3323cabf46	Experiments based on SpecAugment change	2022-02-08 14:25:31 +08:00
Fangjun Kuang	ec591698b0	Associate a cut with token alignment (without repeats) (#125 ) * WIP: Associate a cut with token alignment (without repeats) * Save framewise alignments with/without repeats. * Minor fixes.	2021-11-29 18:50:54 +08:00
Wei Kang	4151cca147	Add torch script support for Aishell and update documents (#124 ) * Add aishell recipe * Remove unnecessary code and update docs * adapt to k2 v1.7, add docs and results * Update conformer ctc model * Update docs, pretrained.py & results * Fix code style * Fix code style * Fix code style * Minor fix * Minor fix * Fix pretrained.py * Update pretrained model & corresponding docs * Export torch script model for Aishell * Add C++ deployment docs * Minor fixes * Fix unit test * Update Readme	2021-11-19 16:37:05 +08:00
Fangjun Kuang	68506609ad	Set fsa.properties to None after changing its labels in-place. (#121 )	2021-11-16 23:11:30 +08:00
Fangjun Kuang	8cb7f712e4	Use GPU for averaging checkpoints if possible. (#84 )	2021-10-26 17:10:04 +08:00
Fangjun Kuang	4890e27b45	Extract framewise alignment information using CTC decoding (#39 ) * Use new APIs with k2.RaggedTensor * Fix style issues. * Update the installation doc, saying it requires at least k2 v1.7 * Extract framewise alignment information using CTC decoding. * Print environment information. Print information about k2, lhotse, PyTorch, and icefall. * Fix CI. * Fix CI. * Compute framewise alignment information of the LibriSpeech dataset. * Update comments for the time to compute alignments of train-960. * Preserve cut id in mix cut transformer. * Minor fixes. * Add doc about how to extract framewise alignments.	2021-10-18 14:24:33 +08:00
Mingshuang Luo	391432b356	Update train.py ("10"--->"params.log_interval") (#76 ) * Update train.py * Update train.py * Update train.py	2021-10-12 21:30:31 +08:00
Mingshuang Luo	597c5efdb1	Use LossRecord to record and print the loss for the training process (#62 ) * Update index.rst (AS->ASR) * Update conformer_ctc.rst (pretraind->pretrained) * Fix some spelling errors. * Fix some spelling errors. * Use LossRecord to record and print loss in the training process * Change the name "LossRecord" to "MetricsTracker"	2021-10-12 15:58:03 +08:00
Piotr Żelasko	069ebaf9ba	Reformatting	2021-10-09 14:45:46 +00:00
Piotr Żelasko	b682467e4d	Use BucketingSampler for dev and test data	2021-10-08 22:32:13 -04:00
Fangjun Kuang	707d7017a7	Support pure ctc decoding requiring neither a lexicon nor an n-gram LM (#58 ) * Rename lattice_score_scale to nbest_scale. * Support pure CTC decoding requiring neither a lexicion nor an n-gram LM. * Fix style issues. * Fix a typo. * Minor fixes.	2021-09-26 14:21:49 +08:00
Fangjun Kuang	a80e58e15d	Refactor decode.py to make it more readable and more modular. (#44 ) * Refactor decode.py to make it more readable and more modular. * Fix an error. Nbest.fsa should always have token IDs as labels and word IDs as aux_labels. * Add nbest decoding. * Compute edit distance with k2. * Refactor nbest-oracle. * Add rescore with nbest lists. * Add whole-lattice rescoring. * Add rescoring with attention decoder. * Refactoring. * Fixes after refactoring. * Fix a typo. * Minor fixes. * Replace [] with () for shapes. * Use k2 v1.9 * Use Levenshtein graphs/alignment from k2 v1.9 * [doc] Require k2 >= v1.9 * Minor fixes.	2021-09-20 15:44:54 +08:00
Wei Kang	24656e9749	Update docs and remove unnecessary arguments (#42 ) * Fix typo in docs * Update docs and remove unnecessary arguments * Fix code style	2021-09-13 18:28:57 +08:00
Fangjun Kuang	f792b466bf	Change default value of lattice-score-scale from 1.0 to 0.5 (#41 ) * Change the default value of lattice-score-scale from 1.0 to 0.5 * Fix CI.	2021-09-13 10:49:18 +08:00
Fangjun Kuang	7f8e3a673a	Add commands for reproducing. (#40 ) * Add commands for reproducing. * Use --bucketing-sampler by default.	2021-09-09 13:50:31 +08:00
Fangjun Kuang	abadc71415	Use new APIs with k2.RaggedTensor (#38 ) * Use new APIs with k2.RaggedTensor * Fix style issues. * Update the installation doc, saying it requires at least k2 v1.7 * Use k2 v1.7	2021-09-08 14:55:30 +08:00
Fangjun Kuang	184dbb3ea5	Add documentation about code style and creating new recipes. (#27 )	2021-08-25 14:48:41 +08:00
pkufool	f4223ee110	Add TDNN-LSTM-CTC Results (#25 ) * Add tdnn-lstm pretrained model and results * Add docs for TDNN-LSTM-CTC * Minor fix * Fix typo * Fix style checking	2021-08-24 21:09:27 +08:00
Fangjun Kuang	1bd5dcc8ac	WIP: Add doc for the LibriSpeech recipe. (#24 ) * WIP: Add doc for the LibriSpeech recipe. * Add more doc for LibriSpeech recipe. * Add more doc for the LibriSpeech recipe. * More doc.	2021-08-24 20:28:32 +08:00
Fangjun Kuang	6c2c9b9d74	Add recipe for the yes_no dataset. (#16 ) * Add recipe for the yes_no dataset. * Refactoring: Remove unused code. * Add Colab notebook for the yesno dataset. * Add GitHub actions to run yesno. * Fix a typo. * Minor fixes. * Train more epochs for GitHub actions. * Minor fixes. * Minor fixes. * Fix style issues.	2021-08-23 11:36:29 +08:00
pkufool	19c4214958	Fix code style and add copyright. (#18 ) * Fix style and add copyright * Minor fix * Remove duplicate lines * Reformat conformer.py by black * Reformat code style with black. * Fix github workflows * Fix lhotse installation * Install icefall requirements * Update k2 version, remove lhotse from test workflow	2021-08-23 10:43:59 +08:00
Fangjun Kuang	8469f9ae0a	Refactor asr_datamodule. (#15 ) * WIP: Refactor asr_datamodule. * Fixes after review. * Minor fixes.	2021-08-21 09:53:46 +08:00
Fangjun Kuang	caa0b9e942	Fix an error in displaying decoding process. (#12 )	2021-08-19 14:54:01 +08:00
Fangjun Kuang	12a2fd023e	Add doc about installation and usage (#7 ) * Add readme. * Add TOC. * fix typos * Minor fixes after review.	2021-08-12 12:44:04 +08:00
Fangjun Kuang	5a0b9bcb23	Refactoring (#4 ) * Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.	2021-08-04 14:53:02 +08:00
Fangjun Kuang	acc63a9172	WIP: Add BPE training code.	2021-07-29 20:23:52 +08:00
Fangjun Kuang	f65854cca5	Add BPE decoding results.	2021-07-27 17:38:47 +08:00
Fangjun Kuang	d3101fb005	Fix loading checkpoint in DDP training.	2021-07-26 08:08:14 +08:00
Fangjun Kuang	78bb65ed78	Fix an error in DDP training.	2021-07-25 22:33:09 +08:00
Fangjun Kuang	8055bf31a0	Support DDP training.	2021-07-25 21:40:09 +08:00
Fangjun Kuang	4a66712406	Add LM rescoring.	2021-07-25 18:21:26 +08:00
Fangjun Kuang	6f9fe5b906	Refactor decoding code.	2021-07-24 22:23:50 +08:00
Fangjun Kuang	f3542c7793	Add CTC training.	2021-07-24 17:13:20 +08:00

39 Commits