marcoyang1998
45c13e90e4
RNNLM rescore + Low-order density ratio ( #1017 )
...
* add rnnlm rescore + LODR
* add LODR in decode.py
* update RESULTS
2023-04-24 15:00:02 +08:00
Yifan Yang
2096e69bda
Use CutSet.mux for multidataset ( #1020 )
...
* Use CutSet.mux
* Remove mischange
* Fix for style check
2023-04-23 18:41:44 +08:00
Yifan Yang
d67a49afe4
Add multidataset ( #1010 )
...
* Add Common Voice for multidataset
* Add prepare_multidataset.sh
* Add dataset mixing
* Update prepare_multidataset.sh
* Update prepare_giga_speech.sh
* update comments
* Add split and shuffle mechanism
* Add multi-dataset train
* Fix for deleting
* Fix for modifying
* Add comments
* Change type for perturb_speed
* Fix for style check
* Small fix
* Add filter
* Remove warning
2023-04-21 18:09:41 +08:00
marcoyang1998
57d6482a79
Streaming Zipformer with multi-dataset ( #984 )
...
* modify train.py
* add right padding option in decode.py
* update RESULTS.md
2023-04-21 15:43:28 +08:00
Wen Ding
78b9dcc936
Support exporting BS Zipformer models to ONNX, used in Triton Server ( #1008 )
...
* Support export BS Zipformer models to ONNX in Tritron
* Update copyright
* Update exporting codes for BS zipformer models
* Code format
* Update comments
* Update export_onnx.py
---------
Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>
2023-04-18 17:05:08 +08:00
marcoyang1998
34d1b07c3d
Modified beam search with RNNLM rescoring ( #1002 )
...
* add RNNLM rescore
* add shallow fusion and lm rescore for streaming zipformer
* minor fix
* update RESULTS.md
* fix yesno workflow, change from ubuntu-18.04 to ubuntu-latest
2023-04-17 16:43:00 +08:00
Fangjun Kuang
e32658e620
Fix torch.jit.script() export for streaming zipformer. ( #1005 )
2023-04-17 16:13:30 +08:00
Zengwei Yao
7c7d9ab042
add @torch.jit.export for streaming_forward func in Zipformer class ( #1004 )
2023-04-17 12:03:52 +08:00
Zengwei Yao
5f066d3d53
support decoding and computing RTF on test sets with onnx models ( #995 )
...
* support decode and compute RTF on test sets with onnx models
* support onnx export and decode in pruned_transducer_stateless
2023-04-12 19:04:50 +08:00
Yifan Yang
33578cca48
Fix filter_cuts in compute_fbank_librispeech.py ( #993 )
2023-04-11 11:12:05 +08:00
Zengwei Yao
136aa94d57
remove duplicated lines ( #988 )
2023-04-06 17:47:33 +08:00
Yifan Yang
c90f57afdb
Remove simulate streaming from stateless8 ( #985 )
2023-04-04 11:04:00 +08:00
marcoyang1998
d337398d29
Shallow fusion for Aishell ( #954 )
...
* add shallow fusion and LODR for aishell
* update RESULTS
* add save by iterations
2023-04-03 16:20:29 +08:00
Yifan Yang
46bf6df62f
Remove simulate streaming from stateless7 ( #983 )
...
* Remove simulate streaming from stateless7
2023-04-03 14:55:45 +08:00
Fangjun Kuang
a632b24c35
Export int8 quantized models for non-streaming Zipformer. ( #977 )
...
* Export int8 quantized models for non-streaming Zipformer.
* Delete export-onnx.py
* Export int8 models for other folders
2023-03-31 22:46:19 +08:00
Zengwei Yao
2a5a75cb56
add option of using full attention for streaming model decoding ( #975 )
2023-03-30 14:30:13 +08:00
Zengwei Yao
bcc5923ab9
Support batch-wise forced-alignment ( #970 )
...
* support batch-wise forced-alignment based on beam search
* add length_norm to HypothesisList.topk()
* Use Hypothesis and HypothesisList instead
2023-03-28 23:24:24 +08:00
Fangjun Kuang
8c3ea93fc8
Save meta data to exported ONNX models ( #968 )
2023-03-27 11:39:29 +08:00
Zengwei Yao
7155769c19
minor fix, remove numel = p.numel() in optim.py ( #967 )
2023-03-24 15:30:29 +08:00
Peng He
f260a09ed4
remove if-branch at downsample pad in zipformer for onnx-export compatibility ( #965 )
2023-03-24 14:30:43 +08:00
marcoyang1998
7948624a22
Support fine-tuning ( #944 )
...
* support finetune
* add files for decoding giga
* support initializing modules
* add a fine-tune bash script
2023-03-17 13:44:29 +08:00
Yifan Yang
a48812ddb3
Ban the test_rnn.py in ci-test ( #949 )
2023-03-15 22:02:20 +08:00
Yifan Yang
cad6735e07
Modify make_pad_mask to support TensorRT ( #943 )
...
* Modify make_pad_mask to support TensorRT
* Fix for test
2023-03-10 19:28:59 +08:00
marcoyang1998
9ddd811925
Fix padding_idx ( #942 )
...
* fix padding_idx
* update RESULTS.md
2023-03-10 14:37:28 +08:00
Yifan Yang
28af269e5e
Fix for workflow ( #934 )
2023-03-09 17:38:15 +08:00
Fangjun Kuang
f5de2e90c6
Fix style issues. ( #937 )
2023-03-08 22:56:04 +08:00
pehonnet
07243d136a
remove key from result filename ( #936 )
...
Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>
2023-03-08 21:06:07 +08:00
marcoyang1998
c51e6c5b9c
fix typo ( #916 )
2023-02-20 19:04:57 +08:00
Zengwei Yao
4e832fa6b0
fix reduction conformer_ctc3/train.py ( #908 )
2023-02-14 20:45:38 +08:00
Fangjun Kuang
c5e687ddf5
Export streaming zipformer to ncnn ( #906 )
2023-02-13 23:41:43 +08:00
Zengwei Yao
25ee50e27c
add ctc-greedy-search with timestamps ( #905 )
2023-02-13 19:45:09 +08:00
Desh Raj
6a8b649e56
Add small streaming Zipformer transducer model ( #903 )
2023-02-13 15:53:28 +08:00
Yifan Yang
c34ee67691
Update generate_model_from_checkpoint.py ( #901 )
2023-02-13 14:05:38 +08:00
Fangjun Kuang
c102e7fbf0
more fixes for lstm3 to support exporting to ncnn ( #902 )
2023-02-13 12:16:43 +08:00
Fangjun Kuang
48c2c22dbe
Fix export to ncnn for lstm3 ( #900 )
2023-02-13 11:44:25 +08:00
xiabingquan
cba6ecc1d1
Update README.md ( #894 )
2023-02-09 23:54:45 +08:00
Yifan Yang
5cd1636cb3
Fix a bug in decode.py ( #893 )
...
Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-09 12:12:23 +08:00
Karel Vesely
35e5a2475c
Librispeech, validate_manifest.py ( #890 )
2023-02-09 07:57:02 +08:00
Fangjun Kuang
2b995639b7
Add ONNX support for Zipformer and ConvEmformer ( #884 )
2023-02-09 00:02:38 +08:00
Zengwei Yao
af735eb75b
Get alignments using lhotse workflows align-with-torchaudio ( #888 )
...
* add lhotse workflow align-with-torchaudio
* modify related decode.py files
2023-02-08 21:54:35 +08:00
Zengwei Yao
d12e6f098c
Get (start, end) timestamps for CTC models ( #876 )
...
* parse timestamps and texts for BPE-based models
* parse timestamps (frame indexes) and texts for other cases
* add test functions
* add parse_fsa_timestamps_and_texts function, test in conformer_ctc3/decode.py
* calculate symbol delay for (start, end) timestamps
2023-02-07 21:43:16 +08:00
Fangjun Kuang
7ae03f6c88
Add onnx export support for pruned_transducer_stateless5 ( #883 )
2023-02-07 17:47:08 +08:00
Yifan Yang
ffbf6d9199
Add generate_averaged_model.py ( #882 )
2023-02-07 16:19:08 +08:00
Fangjun Kuang
8d3810e289
Simplify ONNX export ( #881 )
...
* Simplify ONNX export
* Fix ONNX CI tests
2023-02-07 15:01:59 +08:00
Fangjun Kuang
52f3a747be
Refactor onnx export for streaming zipformer ( #879 )
2023-02-07 12:12:26 +08:00
Zengwei Yao
5a05b95730
add params.hlg_scale ( #880 )
2023-02-06 23:21:46 +08:00
Yifan Yang
caf23546ed
No more T < S after frame_reducer ( #875 )
...
* No more T < S after frame_reducer
* Fix for style check
* Adjust the permissions
* Add support for inference to frame_reducer
* Fix for flake8 check
---------
Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-06 12:17:45 +08:00
Yuekai Zhang
bf5f0342a2
Add streaming onnx export for zipformer ( #831 )
...
* add streaming onnx export for zipformer
* update triton support
* add comments
* add ci test
* add onnxmltools for fp16 onnx export
2023-02-06 10:37:07 +08:00
Yifan Yang
029c8566e4
Small fix for frame_reducer.py ( #871 )
2023-02-03 17:49:54 +08:00
Yifan Yang
bffce413f0
Fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py ( #870 )
...
* fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py
---------
Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-03 12:32:06 +08:00