Fangjun Kuang
810b193dcc
Clarify the doc about ctc-decoding. ( #104 )
2021-11-03 07:16:49 +08:00
Fangjun Kuang
42b437bea6
Use pre-sorted text to generate token ids for attention decoder. ( #98 )
...
* Use pre-sorted text to generate token ids for attention decoder.
See https://github.com/k2-fsa/icefall/issues/97
for more details.
* Fix typos.
2021-10-29 13:46:41 +08:00
Fangjun Kuang
12d647d899
Add a note about the CUDA OOM error. ( #94 )
...
* Add a note about the CUDA OOM error.
Some users consider this kind of OOM as an error during decoding,
but actually it is not. This pull request clarifies that.
* Fix style issues.
2021-10-29 12:17:56 +08:00
Fangjun Kuang
8cb7f712e4
Use GPU for averaging checkpoints if possible. ( #84 )
2021-10-26 17:10:04 +08:00
Fangjun Kuang
712ead8207
Fix an error when attention decoder rescoring returns None. ( #90 )
2021-10-22 19:52:25 +08:00
Piotr Żelasko
902e0b238d
Merge pull request #82 from pzelasko/feature/find-pessimistic-batches
...
Find CUDA OOM batches before starting training
2021-10-19 11:26:13 -04:00
Piotr Żelasko
3cc99d2af2
make flake8 happy
2021-10-19 11:24:54 -04:00
cdxie
d30244e28f
add a docker file for some users ( #87 )
...
* add a docker file for some users
Ubuntu18.04-pytorch1.7.1-cuda11.0-cudnn8-python3.8
* add a describing file of how to use dockerfile
give some steps to use dockerfile
2021-10-19 13:00:59 +08:00
Piotr Żelasko
86f3e0ef37
Make flake8 happy
2021-10-18 09:54:40 -04:00
Piotr Żelasko
6fbd7a287c
Refactor OOM batch scanning into a local function
2021-10-18 09:53:04 -04:00
Piotr Żelasko
d509d58f30
Merge branch 'master' into feature/find-pessimistic-batches
2021-10-18 09:47:21 -04:00
Fangjun Kuang
3effcb4225
Fix typos. ( #85 )
2021-10-18 16:17:14 +08:00
Fangjun Kuang
53b79fafa7
Add MMI training with word pieces as modelling unit. ( #6 )
...
* Fix an error in TDNN-LSTM training.
* WIP: Refactoring
* Refactor transformer.py
* Remove unused code.
* Minor fixes.
* Fix decoder padding mask.
* Add MMI training with word pieces.
* Remove unused files.
* Minor fixes.
* Refactoring.
* Minor fixes.
* Use pre-computed alignments in LF-MMI training.
* Minor fixes.
* Update decoding script.
* Add doc about how to check and use extracted alignments.
* Fix style issues.
* Fix typos.
* Fix style issues.
* Disable macOS tests for now.
2021-10-18 15:20:32 +08:00
Fangjun Kuang
4890e27b45
Extract framewise alignment information using CTC decoding ( #39 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Extract framewise alignment information using CTC decoding.
* Print environment information.
Print information about k2, lhotse, PyTorch, and icefall.
* Fix CI.
* Fix CI.
* Compute framewise alignment information of the LibriSpeech dataset.
* Update comments for the time to compute alignments of train-960.
* Preserve cut id in mix cut transformer.
* Minor fixes.
* Add doc about how to extract framewise alignments.
2021-10-18 14:24:33 +08:00
Jan "yenda" Trmal
bd7c2f7645
fix conformer typo in docs ( #83 )
2021-10-16 07:46:17 +08:00
Piotr Żelasko
403d1744ff
Introduce backprop in finding OOM batches
2021-10-15 10:05:13 -04:00
Piotr Żelasko
060117a9ff
Reformatting
2021-10-14 21:40:14 -04:00
Piotr Żelasko
1c7c79f2fc
Find CUDA OOM batches before starting training
2021-10-14 21:28:11 -04:00
Fangjun Kuang
fee1f84b20
Test pre-trained model in CI ( #80 )
...
* Add CI to run pre-trained models.
* Minor fixes.
* Install kaldifeat
* Install a CPU version of PyTorch.
* Fix CI errors.
* Disable decoder layers in pretrained.py if it is not used.
* Clone pre-trained model from GitHub.
* Minor fixes.
* Minor fixes.
* Minor fixes.
2021-10-15 00:41:33 +08:00
Mingshuang Luo
5401ce199d
Update ctc-decoding on pretrained.py and conformer_ctc.rst ( #78 )
2021-10-14 23:29:06 +08:00
Fangjun Kuang
f2387fe523
Fix a bug introduced while supporting torch script. ( #79 )
2021-10-14 20:09:38 +08:00
Fangjun Kuang
5016ee3c95
Give an informative message when users provide an unsupported decoding method ( #77 )
2021-10-14 16:20:35 +08:00
Mingshuang Luo
39bc8cae94
Add ctc decoding to pretrained.py on conformer_ctc ( #75 )
...
* Add ctc-decoding to pretrained.py
* update pretrained.py and conformer_ctc.rst
* update ctc-decoding for pretrained.py on conformer_ctc
* Update pretrained.py
* fix the style issue
* Update conformer_ctc.rst
* Update the running logs
2021-10-13 12:20:16 +08:00
Mingshuang Luo
391432b356
Update train.py ("10"--->"params.log_interval") ( #76 )
...
* Update train.py
* Update train.py
* Update train.py
2021-10-12 21:30:31 +08:00
Mingshuang Luo
597c5efdb1
Use LossRecord to record and print the loss for the training process ( #62 )
...
* Update index.rst (AS->ASR)
* Update conformer_ctc.rst (pretraind->pretrained)
* Fix some spelling errors.
* Fix some spelling errors.
* Use LossRecord to record and print loss in the training process
* Change the name "LossRecord" to "MetricsTracker"
2021-10-12 15:58:03 +08:00
Fangjun Kuang
beb54ddb61
Support torch script. ( #65 )
...
* WIP: Support torchscript.
* Minor fixes.
* Fix style issues.
* Add documentation about how to deploy a trained model.
2021-10-12 14:55:05 +08:00
Piotr Żelasko
d54828e73a
Merge pull request #73 from pzelasko/feature/bucketing-in-test
...
Use BucketingSampler for dev and test data
2021-10-09 10:58:29 -04:00
Piotr Żelasko
069ebaf9ba
Reformatting
2021-10-09 14:45:46 +00:00
Mingshuang Luo
6e43905d12
Update the documentation to include "ctc-decoding" ( #71 )
...
* Update conformer_ctc.rst
2021-10-09 11:56:25 +08:00
Piotr Żelasko
b682467e4d
Use BucketingSampler for dev and test data
2021-10-08 22:32:13 -04:00
Piotr Żelasko
adb068eb82
setup.py ( #64 )
2021-10-01 16:43:08 +08:00
Fangjun Kuang
707d7017a7
Support pure ctc decoding requiring neither a lexicon nor an n-gram LM ( #58 )
...
* Rename lattice_score_scale to nbest_scale.
* Support pure CTC decoding requiring neither a lexicion nor an n-gram LM.
* Fix style issues.
* Fix a typo.
* Minor fixes.
2021-09-26 14:21:49 +08:00
Fangjun Kuang
455693aede
Fix hasattr
of AttributeDict. ( #52 )
2021-09-22 16:37:20 +08:00
Fangjun Kuang
a80e58e15d
Refactor decode.py to make it more readable and more modular. ( #44 )
...
* Refactor decode.py to make it more readable and more modular.
* Fix an error.
Nbest.fsa should always have token IDs as labels and
word IDs as aux_labels.
* Add nbest decoding.
* Compute edit distance with k2.
* Refactor nbest-oracle.
* Add rescore with nbest lists.
* Add whole-lattice rescoring.
* Add rescoring with attention decoder.
* Refactoring.
* Fixes after refactoring.
* Fix a typo.
* Minor fixes.
* Replace [] with () for shapes.
* Use k2 v1.9
* Use Levenshtein graphs/alignment from k2 v1.9
* [doc] Require k2 >= v1.9
* Minor fixes.
2021-09-20 15:44:54 +08:00
Fangjun Kuang
cc77cb3459
Fix decode.py to remove the correct axis. ( #50 )
...
* Fix decode.py to remove the correct axis.
* Run GitHub actions manually.
2021-09-17 16:49:03 +08:00
Wei Kang
9a6e0489c8
update api for RaggedTensor ( #45 )
...
* Fix code style
* update k2 version in CI
* fix compile hlg
2021-09-14 16:39:56 +08:00
Fangjun Kuang
a2be2896a9
Fix the link to k2's installation doc. ( #46 )
2021-09-14 13:39:52 +08:00
Wei Kang
24656e9749
Update docs and remove unnecessary arguments ( #42 )
...
* Fix typo in docs
* Update docs and remove unnecessary arguments
* Fix code style
2021-09-13 18:28:57 +08:00
Fangjun Kuang
f792b466bf
Change default value of lattice-score-scale from 1.0 to 0.5 ( #41 )
...
* Change the default value of lattice-score-scale from 1.0 to 0.5
* Fix CI.
2021-09-13 10:49:18 +08:00
Fangjun Kuang
7f8e3a673a
Add commands for reproducing. ( #40 )
...
* Add commands for reproducing.
* Use --bucketing-sampler by default.
2021-09-09 13:50:31 +08:00
Fangjun Kuang
abadc71415
Use new APIs with k2.RaggedTensor ( #38 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Use k2 v1.7
2021-09-08 14:55:30 +08:00
Fangjun Kuang
331e5eb7ab
[doc] Fix typos. ( #31 )
2021-09-02 07:12:37 +08:00
Mingshuang Luo
5baa6a9f1c
fix a spelling mistake (tourch->touch) ( #29 )
v1.0
2021-08-25 21:41:46 +08:00
Mingshuang Luo
eed3fc5610
Correct some spelling mistakes ( #28 )
...
* Update index.rst (AS->ASR)
* Update conformer_ctc.rst (pretraind->pretrained)
2021-08-25 17:48:34 +08:00
Fangjun Kuang
184dbb3ea5
Add documentation about code style and creating new recipes. ( #27 )
2021-08-25 14:48:41 +08:00
Fangjun Kuang
96e7f5c7ea
Release v0.1 ( #26 )
v0.1
2021-08-24 21:30:30 +08:00
pkufool
f4223ee110
Add TDNN-LSTM-CTC Results ( #25 )
...
* Add tdnn-lstm pretrained model and results
* Add docs for TDNN-LSTM-CTC
* Minor fix
* Fix typo
* Fix style checking
2021-08-24 21:09:27 +08:00
Fangjun Kuang
1bd5dcc8ac
WIP: Add doc for the LibriSpeech recipe. ( #24 )
...
* WIP: Add doc for the LibriSpeech recipe.
* Add more doc for LibriSpeech recipe.
* Add more doc for the LibriSpeech recipe.
* More doc.
2021-08-24 20:28:32 +08:00
Fangjun Kuang
01da00dca0
WIP: Add documentation. ( #22 )
...
* Begin to add documentation.
* WIP: Add documentation.
* Fix a typo.
* Add more doc for the recipe yesno.
* Add more doc for the yesno recipe.
2021-08-24 14:28:08 +08:00
Fangjun Kuang
57cb611665
[yesno] Remove padding in TDNN ( #21 )
...
* Disable SpecAug for yesno.
Also replace Adam with SGD.
* Remove padding in the model to make the results reproducible.
2021-08-23 15:59:36 +08:00