dohe0342
bdf46c269e
from local
2023-02-02 13:26:45 +09:00
dohe0342
ab58c3917c
from local
2023-02-02 13:26:41 +09:00
dohe0342
751dcedd0d
from local
2023-02-02 12:56:50 +09:00
dohe0342
485ca642c3
from local
2023-02-02 12:56:46 +09:00
dohe0342
c619461264
from local
2023-02-02 12:56:39 +09:00
dohe0342
325156e02b
from local
2023-02-02 12:39:41 +09:00
dohe0342
4aaa032b01
from local
2023-02-02 12:39:29 +09:00
dohe0342
f4fb761129
from local
2023-02-02 12:39:09 +09:00
dohe0342
de10e1d5fe
from local
2023-02-02 12:38:02 +09:00
dohe0342
2757bf757a
from local
2023-02-02 12:07:28 +09:00
dohe0342
9955a4715e
from local
2023-02-02 12:07:26 +09:00
dohe0342
6a4415006e
from local
2023-02-02 12:04:05 +09:00
dohe0342
86cd4511b2
from local
2023-02-02 12:04:00 +09:00
dohe0342
85055a16bc
from local
2023-01-31 22:55:13 +09:00
dohe0342
e7a5215ad7
from local
2023-01-31 22:54:48 +09:00
dohe0342
3210b7df1a
from local
2023-01-31 22:54:38 +09:00
marcoyang
53454701cb
fix segmentation fault
2022-11-22 11:39:21 +08:00
Desh Raj
d31db01037
manual correction of black formatting
2022-11-17 14:18:05 -05:00
Desh Raj
107df3b115
apply black on all files
2022-11-17 09:42:17 -05:00
Fangjun Kuang
60317120ca
Revert "Apply new Black style changes"
2022-11-17 20:19:32 +08:00
Desh Raj
d110b04ad3
apply new black formatting to all files
2022-11-16 13:06:43 -05:00
Fangjun Kuang
ff3f026381
Checkout the LM for aishell explicitly ( #642 )
2022-10-31 19:47:43 +08:00
Fangjun Kuang
d1f16a04bd
fix type hints for decode.py ( #623 )
2022-10-18 06:56:12 +08:00
LIyong.Guo
923b60a7c6
padding zeros ( #591 )
2022-09-28 21:20:33 +08:00
Fangjun Kuang
e18fa78c3a
Check that read_manifests_if_cached returns a non-empty dict. ( #555 )
2022-08-28 11:50:11 +08:00
Lucky Wong
9277c95bcd
Pruned transducer stateless2 for AISHELL-1 ( #536 )
...
* Fix not enough values to unpack error .
* [WIP] Pruned transducer stateless2 for AISHELL-1
* fix the style issue
* code format for black
* add pruned-transducer-stateless2 results for AISHELL-1
* simplify result
2022-08-22 10:17:26 +08:00
Lucky Wong
31686ac829
Fix not enough values to unpack error . ( #533 )
2022-08-18 10:45:06 +08:00
Wei Kang
5c17255eec
Sort results to make it more convenient to compare decoding results ( #522 )
...
* Sort result to make it more convenient to compare decoding results
* Add cut_id to recognition results
* add cut_id to results for all recipes
* Fix torch.jit.script
* Fix comments
* Minor fixes
* Fix torch.jit.tracing for Pytorch version before v1.9.0
2022-08-12 07:12:50 +08:00
Fangjun Kuang
5149788cb2
Fix computing averaged loss in the aishell recipe. ( #523 )
...
* Fix computing averaged loss in the aishell recipe.
* Set find_unused_parameters optionally.
2022-08-09 10:53:31 +08:00
boji123
3c9e7f733b
[debug] raise remind when git-lfs not available ( #504 )
...
* [debug] raise remind when git-lfs not available
* modify comment
2022-07-28 16:17:49 +08:00
Fangjun Kuang
385645d533
Fix get_transducer_model() for aishell. ( #497 )
...
PR #495 introduces an error. This commit fixes it.
2022-07-26 15:42:21 +08:00
Fangjun Kuang
d3fc4b031e
Support using aidatatang_200zh optionally in aishell training ( #495 )
...
* Use aidatatang_200zh optionally in aishell training.
2022-07-26 11:25:01 +08:00
Jun Wang
d792bdc9bc
fix typo ( #445 )
2022-06-25 11:00:53 +08:00
Fangjun Kuang
7100c33820
Add pruned RNN-T for aishell. ( #436 )
...
* Add pruned RNN-T for aishell.
* support torch script.
* Update CI.
* Minor fixes.
* Add links to sherpa.
2022-06-21 21:17:22 +08:00
2xwwx2
91b2765cfd
Fixs spelling mistake ( #438 )
2022-06-20 16:41:04 +08:00
Mingshuang Luo
998091ef52
do some changes for export.py ( #437 )
2022-06-20 14:57:08 +08:00
Fangjun Kuang
bfeab319c9
Fix aishell. ( #416 )
2022-06-10 11:47:43 +08:00
Fangjun Kuang
dbda1644b5
Replace load_manifest_lazy with load_manifest for MUSAN. ( #412 )
2022-06-09 11:42:18 +08:00
Fangjun Kuang
ed66877694
Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. ( #411 )
2022-06-09 11:18:52 +08:00
Fangjun Kuang
1094a3cb37
Replace LilcomChunkyWriter with ChunkedLilcomHdf5Writer. ( #404 )
2022-06-07 18:14:25 +08:00
Fangjun Kuang
f1abce72f8
Use jsonl for CutSet in the LibriSpeech recipe. ( #397 )
...
* Use jsonl for cutsets in the librispeech recipe.
* Use lazy cutset for all recipes.
* More fixes to use lazy CutSet.
* Remove force=True from logging to support Python < 3.8
* Minor fixes.
* Fix style issues.
2022-06-06 10:19:16 +08:00
Ewald Enzinger
8c5722de8c
[egs] Add prefix when reading manifests due to recent lhotse changes ( #382 )
...
* [egs] Add prefix when reading manifests due to recent lhotse changes
* Fix wenetspeech
* Fix style issues
2022-05-23 23:37:35 +08:00
Fangjun Kuang
aeb8986e35
Ignore padding frames during RNN-T decoding. ( #358 )
...
* Ignore padding frames during RNN-T decoding.
* Fix outdated decoding code.
* Minor fixes.
2022-05-13 07:39:14 +08:00
Mingshuang Luo
f783e10dc8
Do some changes for aishell/ASR/transducer stateless/export.py ( #347 )
...
* do some changes for aishell/ASR/transducer_stateless/export.py
2022-05-07 11:09:31 +08:00
Fangjun Kuang
78b8792d1d
Fix potential bugs in PyTorch that exist in label_smoothing. ( #300 )
2022-04-08 13:41:33 +08:00
Wei Kang
cb3ba16f2b
Fix aishell prepare.sh when using pre-download data ( #291 )
2022-04-05 10:22:49 +08:00
Fangjun Kuang
395a3f952b
Batch decoding for models trained with optimized_transducer ( #267 )
...
* Add greedy search in batch mode.
* Add modified beam search in batch mode.
2022-03-23 19:11:34 +08:00
Mingshuang Luo
d0d806560f
Change for asr_datamodule.py ( #241 )
...
* change for asr_datamodule.py
* fix style check
* do a fix
2022-03-14 00:30:58 +08:00
Fangjun Kuang
2f0fbf430c
Remove duplicate files. ( #236 )
2022-03-04 11:56:31 +08:00
Fangjun Kuang
3ec219dfa0
Add stateless transducer tutorial. ( #235 )
...
* WIP: Add stateless transducer tutorial.
* Add more doc.
* Minor fixes.
2022-03-03 22:33:47 +08:00