icefall/egs at 5df24c168519239b8c2f8327b666cae61d359a65 - icefall - Bi Git

mirrors/icefall

History

Yuekai Zhang 5df24c1685

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

* add whisper fbank for wenetspeech

* add whisper fbank for other dataset

* add str to bool

* add decode for wenetspeech

* add requirments.txt

* add original model decode with 30s

* test feature extractor speed

* add aishell2 feat

* change compute feature batch

* fix overwrite

* fix executor

* regression

* add kaldifeatwhisper fbank

* fix io issue

* parallel jobs

* use multi machines

* add wenetspeech fine-tune scripts

* add monkey patch codes

* remove useless file

* fix subsampling factor

* fix too long audios

* add remove long short

* fix whisper version to support multi batch beam

* decode all wav files

* remove utterance more than 30s in test_net

* only test net

* using soft links

* add kespeech whisper feats

* fix index error

* add manifests for whisper

* change to licomchunky writer

* add missing option

* decrease cpu usage 

* add speed perturb for kespeech

* fix kespeech speed perturb

* add dataset

* load checkpoint from specific path

* add speechio

* add speechio results

---------

Co-authored-by: zr_jin <peter.jin.cn@gmail.com>

2024-03-07 19:04:27 +08:00

..

aidatatang_200zh/ASR

Fix torchscript export to use tokens.txt instead of lang_dir (#1475 )

2024-01-26 19:18:33 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

commonvoice/ASR

add text norm script for pl (#1532 )

2024-03-07 18:47:29 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

fluent_speech_commands/SLU

Update shared (#1487 )

2024-02-07 10:16:02 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

Fixed formatting issue of PR #1528 (#1530 )

2024-03-06 08:43:45 +08:00

Update export-onnx.py for vits to support sherpa-onnx. (#1524 )

2024-03-01 19:53:58 +08:00

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

multi_zh_en/ASR

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

multi_zh-hans/ASR

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

peoples_speech/ASR

typos fixed (#1472 )

2024-01-25 18:41:43 +08:00

Update train-rnn-lm.sh (#1337 )

2023-10-25 12:50:35 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

Zipformer recipe for SPGISpeech (#1449 )

2024-02-22 15:53:19 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

Provides README.md for TTS recipes (#1491 )

2024-02-29 17:31:28 +08:00

add the voxpopuli recipe (#1374 )

2023-11-16 14:38:31 +08:00

Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 )

2024-03-07 19:04:27 +08:00

xbmu_amdo31/ASR

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00