958 Commits

Author SHA1 Message Date
yfy62
de8bb48ef1 Add ci 2023-10-20 22:20:18 +08:00
Yifan Yang
91f4e52fcf
Merge branch 'k2-fsa:master' into phone2 2023-10-20 09:09:31 -05:00
Yifan Yang
7ad7e33735
Update RESULTS.md 2023-10-20 08:39:14 -05:00
Rudra
eef47adee9
fix typo (#1324) 2023-10-19 22:54:43 +08:00
Daniel Povey
973dc1026d
Make diagnostics.py more error-tolerant and have wider range of supported torch versions (#1234) 2023-10-19 22:54:00 +08:00
Karel Vesely
543b4cc1ca
small enhanecements (#1322)
- add extra check of 'x' and 'x_lens' to earlier point in Transducer model
- specify 'utf' encoding when opening text files for writing (recogs,
  errs)
2023-10-19 21:53:31 +08:00
marcoyang1998
ce372cce33
Update documentation to PromptASR (#1321) 2023-10-19 17:24:31 +08:00
Surav Shrestha
36c60b0cf6
fix typos in icefall/utils.py (#1319) 2023-10-19 11:15:18 +08:00
Ikko Eltociear Ashimine
98c5286404
Fix typo in code-style.rst (#1318) 2023-10-19 00:13:50 +08:00
marcoyang1998
52c24df61d
Fix model avg (#1317)
* fix a bug about the model_avg during finetuning by exchanging the order of loading pre-trained model and initializing avg model

* only match the exact module prefix
2023-10-18 17:36:14 +08:00
Erwan Zerhouni
807816fec0
Fix chunk issue for sherpa (#1316) 2023-10-18 16:07:10 +08:00
yfy62
2ad176321a Replace some with soft links 2023-10-18 10:56:08 +08:00
Yifan Yang
30a4dd2f95
Fix README.md 2023-10-17 21:44:24 -05:00
Yifan Yang
e7414b94ee
Merge branch 'k2-fsa:master' into phone2 2023-10-17 21:42:41 -05:00
zr_jin
d2bd0933b1
Compatibility with the latest Lhotse (#1314) 2023-10-17 21:22:32 +08:00
Yifan Yang
e71d0086cb Fix for black 2023-10-17 20:07:32 +08:00
Yifan Yang
6eddab2a8d
Update RESULTS.md 2023-10-17 20:02:53 +08:00
Yifan Yang
9944b56108
Update RESULTS.md 2023-10-17 20:01:52 +08:00
Yifan Yang
327f475061
Update README.md 2023-10-17 19:40:19 +08:00
Yifan Yang
00be865c99 Fix streaming_decode.py 2023-10-17 19:35:56 +08:00
Yifan Yang
0c86ac901a update README 2023-10-17 19:32:56 +08:00
Yifan Yang
2732349215 Finish 2023-10-17 19:15:55 +08:00
Yifan Yang
6f0f358bcf Add Zipformer recipe for GigaSpeech 2023-10-17 17:58:28 +08:00
zr_jin
1ef349d120
[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe (#1300)
* `pruned_transudcer_stateless7_streaming` for AISHELL-1

* Update train.py

* Update train2.py

* Update decode.py

* Update RESULTS.md
2023-10-16 16:28:16 +08:00
zr_jin
eeeeef390b
Minor bug fixes and descriptive text for the LibriCSS recipe (#1268) 2023-10-12 10:02:49 -04:00
zr_jin
162ceaf4b3
fixes for data preparation (#1307)
Issue: #1306
2023-10-12 17:05:41 +08:00
zr_jin
855492156a
Update finetune.py (#1304) 2023-10-12 16:48:23 +08:00
Wen Ding
2b3c5d799f
Fix padding issues (#1303) 2023-10-11 16:58:00 +08:00
marcoyang1998
16a2748d6c
PromptASR for contextualized ASR with controllable style (#1250)
* Add PromptASR with BERT as text encoder

* Support using word-list based content prompts for context biasing

* Upload the pretrained models to huggingface

* Add usage example
2023-10-11 14:56:41 +08:00
Fangjun Kuang
cb874e9905
add export-onnx.py for stateless8 (#1302)
* add export-onnx.py for stateless8

* use tokens.txt to replace bpe.model
2023-10-11 12:20:12 +08:00
zr_jin
103d617380
bug fixes (#1301) 2023-10-11 11:04:20 +08:00
zr_jin
0d09a44930
Update train.py (#1299) 2023-10-11 10:06:00 +08:00
Zengwei Yao
9af144c26b
Zipformer update result (#1296)
* update Zipformer results
2023-10-09 23:15:22 +08:00
zr_jin
fefffc02f6
Update optim.py (#1292) 2023-10-09 17:39:23 +08:00
zr_jin
ce08230ade
Update README.md (#1293) 2023-10-07 11:57:30 +08:00
zr_jin
82199b8fe1
Init commit for swbd (#1146) 2023-10-07 11:44:18 +08:00
Fangjun Kuang
109354b6b8
Add CTC HLG decoding for zipformer (#1287) 2023-10-02 14:00:06 +08:00
Fangjun Kuang
f14b673408
Add HLG decoding with OpenFst on CPU for aishell conformer_ctc (#1279) 2023-10-01 13:46:16 +08:00
Fangjun Kuang
48cc41bd83 Fix CI 2023-09-30 22:23:22 +08:00
Dongji Gao
3abc290c11
Add scripts and recipe for BTC/OTC (#1255) 2023-09-29 07:52:46 +08:00
yaguang
8181d19860
check bbpe model exists in advance. (#1277) 2023-09-27 17:35:26 +08:00
yaguang
a5ba1133c4
Compatible with new lhotse versions. (#1278) 2023-09-27 17:33:38 +08:00
Fangjun Kuang
772ee3955b
Support HLG decoding using OpenFst with kaldi decoders (#1275) 2023-09-27 14:49:27 +08:00
Fangjun Kuang
2318c3fbd0
Support CTC decoding on CPU using OpenFst and kaldi decoders. (#1244) 2023-09-26 16:36:19 +08:00
zr_jin
1b565dd251
added softlinks to local dir (#1273) 2023-09-26 15:41:39 +08:00
marcoyang1998
e17f884ace
Fix docs for MVQ (#1272)
* typo fix
2023-09-25 15:36:40 +08:00
marcoyang1998
97f9b9c33b
Add documentation for RNNLM training (#1267)
* add documentation for training an RNNLM
2023-09-25 10:48:50 +08:00
zr_jin
ef5da4824d
formatted the entire LibriSpeech recipe (#1270)
* formatted the entire librispeech recipe

* minor updates
2023-09-24 17:31:01 +08:00
zr_jin
ef658d691e
fixes for init value of diagnostics.TensorDiagnosticOptions (#1269)
* fixes for `diagnostics`

Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions`

also black formatted some scripts

* fixed formatting issues
2023-09-24 17:06:47 +08:00
Fangjun Kuang
34e40a86b3
Fix exporting decoder model to onnx (#1264)
* Use torch.jit.script() to export the decoder model

See also https://github.com/k2-fsa/sherpa-onnx/issues/327
2023-09-22 09:57:15 +08:00