Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
Karel Vesely
59c943878f
add the voxpopuli
recipe ( #1374 )
...
* add the `voxpopuli` recipe
- this is the data preparation
- there is no ASR training and no results
* update the PR#1374 (feedback from @csukuangfj)
- fixing .py headers and docstrings
- removing BUT specific parts of `prepare.sh`
- adding assert `num_jobs >= num_workers` to `compute_fbank.py`
- narrowing list of languages
(let's limit to ASR sets with transcripts for now)
- added links to `README.md`
- extending `text_from_manifest.py`
2023-11-16 14:38:31 +08:00
zr_jin
6d275ddf9f
fixed broken softlinks ( #1381 )
...
* removed broken softlinks
* fixed dependencies
* fixed file permission
2023-11-10 14:45:16 +08:00
lishaojie
1b2e99d374
add the pruned_transducer_stateless7_streaming recipe for commonvoice ( #1018 )
...
* add the pruned_transducer_stateless7_streaming recipe for commonvoice
* fix the symlinks
* Update RESULTS.md
2023-11-09 22:07:28 +08:00
zr_jin
231bbcd2b6
Update optim.py ( #1366 )
2023-11-03 12:06:29 +08:00
wnywbyt
c3bbb32f9e
Update the parameter 'vocab-size' ( #1364 )
...
Co-authored-by: wdq <dongqin.wan@desaysv.com>
2023-11-02 20:45:30 +08:00
zr_jin
9e5a5d7839
Incorporate some latest changes to optim.py
( #1359 )
...
* init commit
* black formatted
* isort formatted
2023-11-02 16:10:08 +08:00
zr_jin
23913f6afd
Minor refinements for some stale but recently merged PRs ( #1354 )
...
* incorporate https://github.com/k2-fsa/icefall/pull/1269
* incorporate https://github.com/k2-fsa/icefall/pull/1301
* black formatted
* incorporate https://github.com/k2-fsa/icefall/pull/1162
* black formatted
2023-10-31 10:28:20 +08:00
Tiance Wang
c970df512b
New recipe: tiny_transducer_ctc ( #848 )
...
* initial commit
* update readme
* Update README.md
* change bool to str2bool for arg parser
* run validation only at the end of epoch
* black format
* black format
2023-10-30 12:09:39 +08:00
Himanshu Kumar Mahto
161ab90dfb
Enhancing the contributing.md file ( #1351 )
2023-10-30 09:07:42 +08:00
Desh Raj
7d56685734
[recipe] LibriSpeech zipformer_ctc ( #941 )
...
* merge upstream
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* Update RESULTS.md
Address comments from @csukuangfj
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2023-10-27 13:38:09 +08:00
Shreyas0410
5cebecf2dc
updated broken link in read.me file ( #1342 )
2023-10-27 13:36:15 +08:00
zr_jin
ea78b32857
minor fixes ( #1345 )
2023-10-27 13:35:43 +08:00
hairyputtar
800bf4b6a2
fix more typos ( #1340 )
...
* fix more typos
* fix typo
* fix typo
* fix typo
2023-10-27 11:46:28 +08:00
Zengwei Yao
c0a53271e2
Update Zipformer-large result on LibriSpeech ( #1343 )
...
* update zipformer-large result on librispeech
2023-10-26 17:35:12 +08:00
zr_jin
770c495484
minor fixes in the CTC decoding code ( #1338 )
2023-10-25 17:14:17 +08:00
zr_jin
dcbc7a63e1
Update train-rnn-lm.sh ( #1337 )
2023-10-25 12:50:35 +08:00
zr_jin
1814bbb0e7
typo fixed ( #1334 )
2023-10-25 00:03:33 +08:00
zr_jin
f82bccfd63
Support CTC decoding for multi-zh_hans
recipe ( #1313 )
2023-10-24 19:04:09 +08:00
zr_jin
d76c3fe472
Migrate zipformer model to other Chinese datasets ( #1216 )
...
added zipformer recipe for AISHELL-1
2023-10-24 16:24:46 +08:00
hairyputtar
3fb99400cf
fix typos ( #1336 )
...
* fix typo
* fix typo
* Update pruned_transducer_stateless.rst
2023-10-24 15:47:25 +08:00
Fangjun Kuang
4b791ced78
Fix CI tests ( #1333 )
2023-10-24 10:38:56 +08:00
zr_jin
f9980aa606
minor fixes ( #1332 )
2023-10-24 08:17:17 +08:00
zr_jin
92ef561ff7
Minor fixes for torch.jit.script support ( #1329 )
2023-10-24 01:10:50 +08:00
Fangjun Kuang
902dc2364a
Update docker for torch 2.1 ( #1326 )
2023-10-22 23:25:06 +08:00
Yifan Yang
416852e8a1
Add Zipformer recipe for GigaSpeech ( #1254 )
...
Co-authored-by: Yifan Yang <yifanyeung@qq.com>
Co-authored-by: yfy62 <yfy62@d3-hpc-sjtu-test-005.cm.cluster>
2023-10-21 15:36:59 +08:00
Rudra
eef47adee9
fix typo ( #1324 )
2023-10-19 22:54:43 +08:00
Daniel Povey
973dc1026d
Make diagnostics.py more error-tolerant and have wider range of supported torch versions ( #1234 )
2023-10-19 22:54:00 +08:00
Karel Vesely
543b4cc1ca
small enhanecements ( #1322 )
...
- add extra check of 'x' and 'x_lens' to earlier point in Transducer model
- specify 'utf' encoding when opening text files for writing (recogs,
errs)
2023-10-19 21:53:31 +08:00
marcoyang1998
ce372cce33
Update documentation to PromptASR ( #1321 )
2023-10-19 17:24:31 +08:00
Surav Shrestha
36c60b0cf6
fix typos in icefall/utils.py ( #1319 )
2023-10-19 11:15:18 +08:00
Ikko Eltociear Ashimine
98c5286404
Fix typo in code-style.rst ( #1318 )
2023-10-19 00:13:50 +08:00
marcoyang1998
52c24df61d
Fix model avg ( #1317 )
...
* fix a bug about the model_avg during finetuning by exchanging the order of loading pre-trained model and initializing avg model
* only match the exact module prefix
2023-10-18 17:36:14 +08:00
Erwan Zerhouni
807816fec0
Fix chunk issue for sherpa ( #1316 )
2023-10-18 16:07:10 +08:00
zr_jin
d2bd0933b1
Compatibility with the latest Lhotse ( #1314 )
2023-10-17 21:22:32 +08:00
zr_jin
1ef349d120
[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe ( #1300 )
...
* `pruned_transudcer_stateless7_streaming` for AISHELL-1
* Update train.py
* Update train2.py
* Update decode.py
* Update RESULTS.md
2023-10-16 16:28:16 +08:00
zr_jin
eeeeef390b
Minor bug fixes and descriptive text for the LibriCSS
recipe ( #1268 )
2023-10-12 10:02:49 -04:00
zr_jin
162ceaf4b3
fixes for data preparation ( #1307 )
...
Issue: #1306
2023-10-12 17:05:41 +08:00
zr_jin
855492156a
Update finetune.py ( #1304 )
2023-10-12 16:48:23 +08:00
Wen Ding
2b3c5d799f
Fix padding issues ( #1303 )
2023-10-11 16:58:00 +08:00
marcoyang1998
16a2748d6c
PromptASR for contextualized ASR with controllable style ( #1250 )
...
* Add PromptASR with BERT as text encoder
* Support using word-list based content prompts for context biasing
* Upload the pretrained models to huggingface
* Add usage example
2023-10-11 14:56:41 +08:00
Fangjun Kuang
cb874e9905
add export-onnx.py for stateless8 ( #1302 )
...
* add export-onnx.py for stateless8
* use tokens.txt to replace bpe.model
2023-10-11 12:20:12 +08:00
zr_jin
103d617380
bug fixes ( #1301 )
2023-10-11 11:04:20 +08:00
zr_jin
0d09a44930
Update train.py ( #1299 )
2023-10-11 10:06:00 +08:00
Zengwei Yao
9af144c26b
Zipformer update result ( #1296 )
...
* update Zipformer results
2023-10-09 23:15:22 +08:00
zr_jin
fefffc02f6
Update optim.py ( #1292 )
2023-10-09 17:39:23 +08:00
zr_jin
ce08230ade
Update README.md ( #1293 )
2023-10-07 11:57:30 +08:00
zr_jin
82199b8fe1
Init commit for swbd ( #1146 )
2023-10-07 11:44:18 +08:00
Fangjun Kuang
109354b6b8
Add CTC HLG decoding for zipformer ( #1287 )
2023-10-02 14:00:06 +08:00
Fangjun Kuang
f14b673408
Add HLG decoding with OpenFst on CPU for aishell conformer_ctc ( #1279 )
2023-10-01 13:46:16 +08:00