Mingshuang Luo
bdd890bab9
Add files via upload
2021-09-29 19:23:11 +08:00
Mingshuang Luo
f837818af3
Add files via upload
2021-09-29 19:11:14 +08:00
Mingshuang Luo
cca1399e0f
Delete train.py
2021-09-29 19:10:49 +08:00
Mingshuang Luo
67762e308f
Add files via upload
2021-09-29 19:10:33 +08:00
Mingshuang Luo
49af342863
Delete train.py
2021-09-29 19:10:08 +08:00
Mingshuang Luo
7278b699b3
Add files via upload
2021-09-29 19:09:36 +08:00
Mingshuang Luo
3c5b49006e
Delete train.py
2021-09-29 19:09:11 +08:00
Mingshuang Luo
852efe1b87
Rename train_lossrecord.py to train.py
2021-09-29 17:37:21 +08:00
Mingshuang Luo
8b6c139623
Add files via upload
2021-09-29 17:36:52 +08:00
Mingshuang Luo
bc101093c6
Delete train.py
2021-09-29 17:36:30 +08:00
Mingshuang Luo
00b2d4c9c7
Rename train_lossrecord_tdnn.py to train.py
2021-09-29 17:35:58 +08:00
Mingshuang Luo
84eb064675
Add files via upload
2021-09-29 17:35:24 +08:00
Mingshuang Luo
f03fb67aa1
Delete train.py
2021-09-29 17:35:03 +08:00
Mingshuang Luo
426418c703
Rename train_lossrecord_con.py to train.py
2021-09-29 17:34:34 +08:00
Mingshuang Luo
787ca3b89c
Add files via upload
2021-09-29 17:33:56 +08:00
Mingshuang Luo
6de7f0c062
Delete train.py
2021-09-29 17:33:32 +08:00
Mingshuang Luo
de9b2a9cd1
Update train.py
2021-09-29 16:54:21 +08:00
Mingshuang Luo
1c0792796b
Update train.py
2021-09-29 12:57:23 +08:00
Mingshuang Luo
43cf016ae5
Update train.py
2021-09-29 12:56:47 +08:00
Mingshuang Luo
34e36a926b
Update train.py
2021-09-29 12:55:42 +08:00
Mingshuang Luo
597ff01158
Update train.py
2021-09-29 12:51:38 +08:00
Mingshuang Luo
0fa46bf68a
Update train.py
2021-09-29 12:49:59 +08:00
Mingshuang Luo
e74e75acc6
Use LossRecord to record and print loss for the training process
2021-09-29 10:08:38 +08:00
Mingshuang Luo
73f21a379b
Merge branch 'k2-fsa:master' into master
2021-09-27 15:34:08 +08:00
Fangjun Kuang
707d7017a7
Support pure ctc decoding requiring neither a lexicon nor an n-gram LM ( #58 )
...
* Rename lattice_score_scale to nbest_scale.
* Support pure CTC decoding requiring neither a lexicion nor an n-gram LM.
* Fix style issues.
* Fix a typo.
* Minor fixes.
2021-09-26 14:21:49 +08:00
Mingshuang Luo
6c4a58273f
Fix some spelling errors.
2021-09-26 12:55:51 +08:00
Mingshuang Luo
6abd1bcd0a
Fix some spelling errors.
2021-09-26 12:54:35 +08:00
Fangjun Kuang
a80e58e15d
Refactor decode.py to make it more readable and more modular. ( #44 )
...
* Refactor decode.py to make it more readable and more modular.
* Fix an error.
Nbest.fsa should always have token IDs as labels and
word IDs as aux_labels.
* Add nbest decoding.
* Compute edit distance with k2.
* Refactor nbest-oracle.
* Add rescore with nbest lists.
* Add whole-lattice rescoring.
* Add rescoring with attention decoder.
* Refactoring.
* Fixes after refactoring.
* Fix a typo.
* Minor fixes.
* Replace [] with () for shapes.
* Use k2 v1.9
* Use Levenshtein graphs/alignment from k2 v1.9
* [doc] Require k2 >= v1.9
* Minor fixes.
2021-09-20 15:44:54 +08:00
Wei Kang
9a6e0489c8
update api for RaggedTensor ( #45 )
...
* Fix code style
* update k2 version in CI
* fix compile hlg
2021-09-14 16:39:56 +08:00
Wei Kang
24656e9749
Update docs and remove unnecessary arguments ( #42 )
...
* Fix typo in docs
* Update docs and remove unnecessary arguments
* Fix code style
2021-09-13 18:28:57 +08:00
Fangjun Kuang
f792b466bf
Change default value of lattice-score-scale from 1.0 to 0.5 ( #41 )
...
* Change the default value of lattice-score-scale from 1.0 to 0.5
* Fix CI.
2021-09-13 10:49:18 +08:00
Fangjun Kuang
7f8e3a673a
Add commands for reproducing. ( #40 )
...
* Add commands for reproducing.
* Use --bucketing-sampler by default.
2021-09-09 13:50:31 +08:00
Fangjun Kuang
abadc71415
Use new APIs with k2.RaggedTensor ( #38 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Use k2 v1.7
2021-09-08 14:55:30 +08:00
Fangjun Kuang
184dbb3ea5
Add documentation about code style and creating new recipes. ( #27 )
2021-08-25 14:48:41 +08:00
Fangjun Kuang
96e7f5c7ea
Release v0.1 ( #26 )
2021-08-24 21:30:30 +08:00
pkufool
f4223ee110
Add TDNN-LSTM-CTC Results ( #25 )
...
* Add tdnn-lstm pretrained model and results
* Add docs for TDNN-LSTM-CTC
* Minor fix
* Fix typo
* Fix style checking
2021-08-24 21:09:27 +08:00
Fangjun Kuang
1bd5dcc8ac
WIP: Add doc for the LibriSpeech recipe. ( #24 )
...
* WIP: Add doc for the LibriSpeech recipe.
* Add more doc for LibriSpeech recipe.
* Add more doc for the LibriSpeech recipe.
* More doc.
2021-08-24 20:28:32 +08:00
Fangjun Kuang
01da00dca0
WIP: Add documentation. ( #22 )
...
* Begin to add documentation.
* WIP: Add documentation.
* Fix a typo.
* Add more doc for the recipe yesno.
* Add more doc for the yesno recipe.
2021-08-24 14:28:08 +08:00
Fangjun Kuang
57cb611665
[yesno] Remove padding in TDNN ( #21 )
...
* Disable SpecAug for yesno.
Also replace Adam with SGD.
* Remove padding in the model to make the results reproducible.
2021-08-23 15:59:36 +08:00
Fangjun Kuang
6c2c9b9d74
Add recipe for the yes_no dataset. ( #16 )
...
* Add recipe for the yes_no dataset.
* Refactoring: Remove unused code.
* Add Colab notebook for the yesno dataset.
* Add GitHub actions to run yesno.
* Fix a typo.
* Minor fixes.
* Train more epochs for GitHub actions.
* Minor fixes.
* Minor fixes.
* Fix style issues.
2021-08-23 11:36:29 +08:00
pkufool
19c4214958
Fix code style and add copyright. ( #18 )
...
* Fix style and add copyright
* Minor fix
* Remove duplicate lines
* Reformat conformer.py by black
* Reformat code style with black.
* Fix github workflows
* Fix lhotse installation
* Install icefall requirements
* Update k2 version, remove lhotse from test workflow
2021-08-23 10:43:59 +08:00
Fangjun Kuang
8469f9ae0a
Refactor asr_datamodule. ( #15 )
...
* WIP: Refactor asr_datamodule.
* Fixes after review.
* Minor fixes.
2021-08-21 09:53:46 +08:00
Fangjun Kuang
0b656e4e1c
Add a link to Colab. ( #14 )
...
It demonstrates the usages of pre-trained models.
2021-08-20 15:43:25 +08:00
Fangjun Kuang
9d0cc9d829
Support computing nbest oracle WER. ( #10 )
...
* Support computing nbest oracle WER.
* Add scale to all nbest based decoding/rescoring methods.
* Add script to run pretrained models.
* Use torchaudio to extract features.
* Support decoding multiple files at the same time.
Also, use kaldifeat for feature extraction.
* Support decoding with LM rescoring and attention-decoder rescoring.
* Minor fixes.
* Replace scale with lattice-score-scale.
* Add usage example with a provided pretrained model.
2021-08-20 11:53:37 +08:00
pkufool
ef233486ae
The training script produce WER of 2.57% on librispeech test-clean ( #13 )
...
* Add grad_clip and weight-decay, small fix of dataloader and masking
* Add RESULTS.md
2021-08-20 10:08:08 +08:00
Fangjun Kuang
caa0b9e942
Fix an error in displaying decoding process. ( #12 )
2021-08-19 14:54:01 +08:00
Fangjun Kuang
1c3b13c7eb
Minor fixes. ( #9 )
2021-08-16 19:01:25 +08:00
Fangjun Kuang
12a2fd023e
Add doc about installation and usage ( #7 )
...
* Add readme.
* Add TOC.
* fix typos
* Minor fixes after review.
2021-08-12 12:44:04 +08:00
Fangjun Kuang
5a0b9bcb23
Refactoring ( #4 )
...
* Fix an error in TDNN-LSTM training.
* WIP: Refactoring
* Refactor transformer.py
* Remove unused code.
* Minor fixes.
2021-08-04 14:53:02 +08:00
Fangjun Kuang
398ed80d7a
Minor fixes to support DDP training.
2021-07-31 15:26:57 +08:00