zr_jin
d2bd0933b1
Compatibility with the latest Lhotse ( #1314 )
2023-10-17 21:22:32 +08:00
zr_jin
ef5da4824d
formatted the entire LibriSpeech recipe ( #1270 )
...
* formatted the entire librispeech recipe
* minor updates
2023-09-24 17:31:01 +08:00
zr_jin
7cc2dae940
Fixes to incorporate with the latest Lhotse release ( #1249 )
2023-09-13 12:39:49 +08:00
Fangjun Kuang
5b50ffda54
support using mini librispeech in training ( #1048 )
...
* support mini librispeech in training
* update onnx export doc
2023-05-09 15:10:06 +08:00
xiabingquan
cba6ecc1d1
Update README.md ( #894 )
2023-02-09 23:54:45 +08:00
huangruizhe
6693d907d3
shuffle full Librispeech data ( #574 )
...
* shuffled full/partial librispeech data
* fixed the code style issue
* Shuffled full librispeech data off-line
* Fixed style, addressed comments, and removed redandunt codes
* Used the suggested version of black
* Propagated the changes to other folders for librispeech (except
conformer_mmi and streaming_conformer_ctc)
2022-11-27 11:26:09 +08:00
Desh Raj
d31db01037
manual correction of black formatting
2022-11-17 14:18:05 -05:00
Desh Raj
107df3b115
apply black on all files
2022-11-17 09:42:17 -05:00
Fangjun Kuang
60317120ca
Revert "Apply new Black style changes"
2022-11-17 20:19:32 +08:00
Desh Raj
d110b04ad3
apply new black formatting to all files
2022-11-16 13:06:43 -05:00
Fangjun Kuang
d1f16a04bd
fix type hints for decode.py ( #623 )
2022-10-18 06:56:12 +08:00
Wei Kang
5c17255eec
Sort results to make it more convenient to compare decoding results ( #522 )
...
* Sort result to make it more convenient to compare decoding results
* Add cut_id to recognition results
* add cut_id to results for all recipes
* Fix torch.jit.script
* Fix comments
* Minor fixes
* Fix torch.jit.tracing for Pytorch version before v1.9.0
2022-08-12 07:12:50 +08:00
Zengwei Yao
a4dd273776
fix about tensorboard ( #516 )
...
* fix metricstracker
* fix style
2022-08-04 19:57:12 +08:00
Fangjun Kuang
dbda1644b5
Replace load_manifest_lazy with load_manifest for MUSAN. ( #412 )
2022-06-09 11:42:18 +08:00
Fangjun Kuang
f1abce72f8
Use jsonl for CutSet in the LibriSpeech recipe. ( #397 )
...
* Use jsonl for cutsets in the librispeech recipe.
* Use lazy cutset for all recipes.
* More fixes to use lazy CutSet.
* Remove force=True from logging to support Python < 3.8
* Minor fixes.
* Fix style issues.
2022-06-06 10:19:16 +08:00
Fangjun Kuang
fbfc98f1d3
Add streaming Emformer stateless RNN-T. ( #390 )
...
* Add streaming Emformer stateless RNN-T.
* Update results for streaming Emformer.
* Minor fixes.
2022-06-01 14:31:47 +08:00
LIyong.Guo
c4ee2bc0af
[Ready to merge]stateless6: states4 + hubert distillation. ( #387 )
...
* a copy of stateless4 as base
* distillation with hubert
* fix typo
* example usage
* usage
* Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* fix comment
* add results of 100hours
* Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* check fairseq and quantization
* a short intro to distillation framework
* Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* add intro of statless6 in README
* fix type error of dst_manifest_dir
* Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* make export.py call stateless6/train.py instead of stateless2/train.py
* update results by stateless6
* adjust results format
* fix typo
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2022-05-28 12:37:50 +08:00
Fangjun Kuang
e7493ede90
Don't use a lambda for dataloader's worker_init_fn. ( #284 )
...
* Don't use a lambda for dataloader's worker_init_fn.
2022-03-31 20:32:00 +08:00
Fangjun Kuang
9a11808ed3
Set the seed for dataloader. ( #282 )
...
Also, suppress torch warnings about division by truncation.
2022-03-31 16:48:46 +08:00
Fangjun Kuang
ae564f91e6
Periodically saving checkpoint after processing given number of batches ( #259 )
...
* Periodically saving checkpoint after processing given number of batches.
2022-03-20 23:51:33 +08:00
Mingshuang Luo
d0d806560f
Change for asr_datamodule.py ( #241 )
...
* change for asr_datamodule.py
* fix style check
* do a fix
2022-03-14 00:30:58 +08:00
Fangjun Kuang
2f4e71f433
Add force alignment for stateless transducer. ( #239 )
...
* Add force alignment for stateless transducer.
* Add more documentation.
* Compute word starting time from framewise token alignment.
* Update README to include force alignment information.
* Fix typos.
* Fix more typos.
* Fixes after review.
2022-03-12 16:16:15 +08:00
Fangjun Kuang
1c35ae1dba
Reset seed at the beginning of each epoch. ( #221 )
...
* Reset seed at the beginning of each epoch.
* Use a different seed for each epoch.
2022-02-21 15:16:39 +08:00
Fangjun Kuang
ec591698b0
Associate a cut with token alignment (without repeats) ( #125 )
...
* WIP: Associate a cut with token alignment (without repeats)
* Save framewise alignments with/without repeats.
* Minor fixes.
2021-11-29 18:50:54 +08:00
Wei Kang
4151cca147
Add torch script support for Aishell and update documents ( #124 )
...
* Add aishell recipe
* Remove unnecessary code and update docs
* adapt to k2 v1.7, add docs and results
* Update conformer ctc model
* Update docs, pretrained.py & results
* Fix code style
* Fix code style
* Fix code style
* Minor fix
* Minor fix
* Fix pretrained.py
* Update pretrained model & corresponding docs
* Export torch script model for Aishell
* Add C++ deployment docs
* Minor fixes
* Fix unit test
* Update Readme
2021-11-19 16:37:05 +08:00
Fangjun Kuang
68506609ad
Set fsa.properties to None after changing its labels in-place. ( #121 )
2021-11-16 23:11:30 +08:00
Fangjun Kuang
8cb7f712e4
Use GPU for averaging checkpoints if possible. ( #84 )
2021-10-26 17:10:04 +08:00
Fangjun Kuang
4890e27b45
Extract framewise alignment information using CTC decoding ( #39 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Extract framewise alignment information using CTC decoding.
* Print environment information.
Print information about k2, lhotse, PyTorch, and icefall.
* Fix CI.
* Fix CI.
* Compute framewise alignment information of the LibriSpeech dataset.
* Update comments for the time to compute alignments of train-960.
* Preserve cut id in mix cut transformer.
* Minor fixes.
* Add doc about how to extract framewise alignments.
2021-10-18 14:24:33 +08:00
Mingshuang Luo
391432b356
Update train.py ("10"--->"params.log_interval") ( #76 )
...
* Update train.py
* Update train.py
* Update train.py
2021-10-12 21:30:31 +08:00
Mingshuang Luo
597c5efdb1
Use LossRecord to record and print the loss for the training process ( #62 )
...
* Update index.rst (AS->ASR)
* Update conformer_ctc.rst (pretraind->pretrained)
* Fix some spelling errors.
* Fix some spelling errors.
* Use LossRecord to record and print loss in the training process
* Change the name "LossRecord" to "MetricsTracker"
2021-10-12 15:58:03 +08:00
Piotr Żelasko
069ebaf9ba
Reformatting
2021-10-09 14:45:46 +00:00
Piotr Żelasko
b682467e4d
Use BucketingSampler for dev and test data
2021-10-08 22:32:13 -04:00
Fangjun Kuang
707d7017a7
Support pure ctc decoding requiring neither a lexicon nor an n-gram LM ( #58 )
...
* Rename lattice_score_scale to nbest_scale.
* Support pure CTC decoding requiring neither a lexicion nor an n-gram LM.
* Fix style issues.
* Fix a typo.
* Minor fixes.
2021-09-26 14:21:49 +08:00
Fangjun Kuang
a80e58e15d
Refactor decode.py to make it more readable and more modular. ( #44 )
...
* Refactor decode.py to make it more readable and more modular.
* Fix an error.
Nbest.fsa should always have token IDs as labels and
word IDs as aux_labels.
* Add nbest decoding.
* Compute edit distance with k2.
* Refactor nbest-oracle.
* Add rescore with nbest lists.
* Add whole-lattice rescoring.
* Add rescoring with attention decoder.
* Refactoring.
* Fixes after refactoring.
* Fix a typo.
* Minor fixes.
* Replace [] with () for shapes.
* Use k2 v1.9
* Use Levenshtein graphs/alignment from k2 v1.9
* [doc] Require k2 >= v1.9
* Minor fixes.
2021-09-20 15:44:54 +08:00
Wei Kang
24656e9749
Update docs and remove unnecessary arguments ( #42 )
...
* Fix typo in docs
* Update docs and remove unnecessary arguments
* Fix code style
2021-09-13 18:28:57 +08:00
Fangjun Kuang
f792b466bf
Change default value of lattice-score-scale from 1.0 to 0.5 ( #41 )
...
* Change the default value of lattice-score-scale from 1.0 to 0.5
* Fix CI.
2021-09-13 10:49:18 +08:00
Fangjun Kuang
7f8e3a673a
Add commands for reproducing. ( #40 )
...
* Add commands for reproducing.
* Use --bucketing-sampler by default.
2021-09-09 13:50:31 +08:00
Fangjun Kuang
abadc71415
Use new APIs with k2.RaggedTensor ( #38 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Use k2 v1.7
2021-09-08 14:55:30 +08:00
Fangjun Kuang
184dbb3ea5
Add documentation about code style and creating new recipes. ( #27 )
2021-08-25 14:48:41 +08:00
pkufool
f4223ee110
Add TDNN-LSTM-CTC Results ( #25 )
...
* Add tdnn-lstm pretrained model and results
* Add docs for TDNN-LSTM-CTC
* Minor fix
* Fix typo
* Fix style checking
2021-08-24 21:09:27 +08:00
Fangjun Kuang
1bd5dcc8ac
WIP: Add doc for the LibriSpeech recipe. ( #24 )
...
* WIP: Add doc for the LibriSpeech recipe.
* Add more doc for LibriSpeech recipe.
* Add more doc for the LibriSpeech recipe.
* More doc.
2021-08-24 20:28:32 +08:00
Fangjun Kuang
6c2c9b9d74
Add recipe for the yes_no dataset. ( #16 )
...
* Add recipe for the yes_no dataset.
* Refactoring: Remove unused code.
* Add Colab notebook for the yesno dataset.
* Add GitHub actions to run yesno.
* Fix a typo.
* Minor fixes.
* Train more epochs for GitHub actions.
* Minor fixes.
* Minor fixes.
* Fix style issues.
2021-08-23 11:36:29 +08:00
pkufool
19c4214958
Fix code style and add copyright. ( #18 )
...
* Fix style and add copyright
* Minor fix
* Remove duplicate lines
* Reformat conformer.py by black
* Reformat code style with black.
* Fix github workflows
* Fix lhotse installation
* Install icefall requirements
* Update k2 version, remove lhotse from test workflow
2021-08-23 10:43:59 +08:00
Fangjun Kuang
8469f9ae0a
Refactor asr_datamodule. ( #15 )
...
* WIP: Refactor asr_datamodule.
* Fixes after review.
* Minor fixes.
2021-08-21 09:53:46 +08:00
Fangjun Kuang
caa0b9e942
Fix an error in displaying decoding process. ( #12 )
2021-08-19 14:54:01 +08:00
Fangjun Kuang
12a2fd023e
Add doc about installation and usage ( #7 )
...
* Add readme.
* Add TOC.
* fix typos
* Minor fixes after review.
2021-08-12 12:44:04 +08:00
Fangjun Kuang
5a0b9bcb23
Refactoring ( #4 )
...
* Fix an error in TDNN-LSTM training.
* WIP: Refactoring
* Refactor transformer.py
* Remove unused code.
* Minor fixes.
2021-08-04 14:53:02 +08:00
Fangjun Kuang
acc63a9172
WIP: Add BPE training code.
2021-07-29 20:23:52 +08:00
Fangjun Kuang
f65854cca5
Add BPE decoding results.
2021-07-27 17:38:47 +08:00
Fangjun Kuang
d3101fb005
Fix loading checkpoint in DDP training.
2021-07-26 08:08:14 +08:00