1243 Commits

Author SHA1 Message Date
Wei Kang
d74822d07b
Fix wenetspeech decoding speed (#953) 2023-03-21 21:35:32 +08:00
marcoyang1998
7948624a22
Support fine-tuning (#944)
* support finetune

* add files for decoding giga

* support initializing modules

* add a fine-tune bash script
2023-03-17 13:44:29 +08:00
Jason's Lab
6196b4a407
Add char-based language model training process for aishell. (#945)
* Add char-based language model training process for aishell.

Add soft link from librispeech/ASR/local/sort_lm_training_data.py to aishell/ASR/local/

---------

Co-authored-by: lichao <www.563042811@qq.com>
2023-03-16 09:52:11 +08:00
Yifan Yang
a48812ddb3
Ban the test_rnn.py in ci-test (#949) 2023-03-15 22:02:20 +08:00
Yifan Yang
cad6735e07
Modify make_pad_mask to support TensorRT (#943)
* Modify make_pad_mask to support TensorRT

* Fix for test
2023-03-10 19:28:59 +08:00
marcoyang1998
9ddd811925
Fix padding_idx (#942)
* fix padding_idx

* update RESULTS.md
2023-03-10 14:37:28 +08:00
Yifan Yang
28af269e5e
Fix for workflow (#934) 2023-03-09 17:38:15 +08:00
Fangjun Kuang
f5de2e90c6
Fix style issues. (#937) 2023-03-08 22:56:04 +08:00
pehonnet
07243d136a
remove key from result filename (#936)
Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>
2023-03-08 21:06:07 +08:00
Fangjun Kuang
8aaa9761e4
Add doc about exporting streaming zipformer to sherpa-ncnn (#927) 2023-02-27 21:23:04 +08:00
Fangjun Kuang
b7c85968ae
Use standard apache 2.0 license (#919) 2023-02-22 11:15:58 +08:00
marcoyang1998
c51e6c5b9c
fix typo (#916) 2023-02-20 19:04:57 +08:00
nihui
4626c60c74
fix typo (#915) 2023-02-17 15:38:08 +08:00
Fangjun Kuang
52d7cdd1a6
Update doc about exporting LSTM models to ncnn (#914) 2023-02-17 12:50:13 +08:00
Fangjun Kuang
c01175679e
Add CI test for exporting csj pretrained zipformer to ncnn (#913) 2023-02-16 21:09:05 +08:00
Fangjun Kuang
6d7a55904c
export script to ncnn for csj (#912) 2023-02-16 19:47:54 +08:00
Zengwei Yao
4e832fa6b0
fix reduction conformer_ctc3/train.py (#908) 2023-02-14 20:45:38 +08:00
marcoyang
1aa2a930b4 fix codestyle 2023-02-14 16:23:42 +08:00
marcoyang
7d217e15ab updates pretrained*.py 2023-02-14 16:11:05 +08:00
marcoyang
f6f29b9321 reformat 2023-02-14 15:52:54 +08:00
Fangjun Kuang
c5e687ddf5
Export streaming zipformer to ncnn (#906) 2023-02-13 23:41:43 +08:00
Teo Wen Shen
e63a8c27f8
CSJ pruned_transducer_stateless7_streaming (#892)
* update manifest stats

* update transcript configs

* lang_char and compute_fbanks

* save cuts in fbank_dir

* add core codes

* update decode.py

* Create local/utils

* tidy up

* parse raw in prepare_lang_char.py

* update manifest stats

* update transcript configs

* lang_char and compute_fbanks

* save cuts in fbank_dir

* add core codes

* update decode.py

* Create local/utils

* tidy up

* parse raw in prepare_lang_char.py

* working train

* Add compare_cer_transcript.py

* fix tokenizer decode, allow d2f only

* comment cleanup

* add export files and READMEs

* reword average column

* fix comments

* Update new results
2023-02-13 22:19:50 +08:00
Zengwei Yao
25ee50e27c
add ctc-greedy-search with timestamps (#905) 2023-02-13 19:45:09 +08:00
marcoyang
56c2474c0d fix 2023-02-13 18:19:52 +08:00
marcoyang
a57c54124a fix code style 2023-02-13 17:55:34 +08:00
marcoyang
dae3a310f4 fix code style 2023-02-13 17:49:32 +08:00
marcoyang
8fb81e27cb lstm_transducer_stateless3/streaming-ncnn-decode.py 2023-02-13 17:48:46 +08:00
marcoyang
142ea421b0 small fix 2023-02-13 16:21:21 +08:00
marcoyang
6018f222df update decoding files 2023-02-13 16:21:01 +08:00
marcoyang
b39ac0207e add necessary files 2023-02-13 16:20:42 +08:00
Desh Raj
6a8b649e56
Add small streaming Zipformer transducer model (#903) 2023-02-13 15:53:28 +08:00
marcoyang
461fe91dc3 copy file 2023-02-13 14:09:53 +08:00
Yifan Yang
c34ee67691
Update generate_model_from_checkpoint.py (#901) 2023-02-13 14:05:38 +08:00
marcoyang
b3fa59d68a Merge branch 'master' of github.com:marcoyang1998/icefall into add_lstm_transducer 2023-02-13 12:47:34 +08:00
marcoyang
eb9a5267a9 add args in streaming-ncnn-decode.py 2023-02-13 12:45:47 +08:00
marcoyang
c689b018d7 add new files 2023-02-13 12:45:22 +08:00
marcoyang
bff0822ffa small fix 2023-02-13 12:44:52 +08:00
marcoyang
04ce87e307 add export 2023-02-13 12:44:09 +08:00
marcoyang
8f5fcb2f52 add files 2023-02-13 12:18:58 +08:00
Fangjun Kuang
c102e7fbf0
more fixes for lstm3 to support exporting to ncnn (#902) 2023-02-13 12:16:43 +08:00
marcoyang
4d4a2c08f7 fix export for ncnn 2023-02-13 12:04:29 +08:00
marcoyang
72f5c5e17f add file 2023-02-13 11:54:23 +08:00
Fangjun Kuang
48c2c22dbe
Fix export to ncnn for lstm3 (#900) 2023-02-13 11:44:25 +08:00
marcoyang
5596da0704 add export ncnn 2023-02-13 10:46:18 +08:00
KajiMaCN
57604aac34
fix tal_csasr data pre-processing (#898) 2023-02-10 21:28:19 +08:00
marcoyang
7556811d64 modify train.py 2023-02-10 16:49:57 +08:00
marcoyang
d1a0668f68 add soft links 2023-02-10 16:41:59 +08:00
marcoyang
3558556906 copy file and soft links 2023-02-10 16:40:18 +08:00
xiabingquan
cba6ecc1d1
Update README.md (#894) 2023-02-09 23:54:45 +08:00
emilyluj
59ac8bfc70
fix mmi graph compiler bug. (#895) 2023-02-09 18:32:03 +08:00