Karel Vesely
716b82cc3a
streaming_decode.py, relax the audio range from [-1,+1] to [-10,+10] ( #1448 )
...
- some AudioTransform classes produce audio signals out of range [-1,+1]
- Resample produced 1.0079
- The range [-10,+10] was chosen to still be able to reliably
distinguish from the [-32k,+32k] signal...
- this is related to : https://github.com/lhotse-speech/lhotse/issues/1254
2024-01-05 10:21:27 +08:00
Fangjun Kuang
8136ad775b
Use high_freq -400 in computing fbank features. ( #1447 )
...
See also https://github.com/k2-fsa/sherpa-onnx/issues/514
2024-01-04 13:59:32 +08:00
Fangjun Kuang
e9ec827de7
Rename zipformer2 to zipformer_for_ncnn_export_only to avoid confusion. ( #1407 )
2023-12-08 14:29:24 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
zr_jin
ef658d691e
fixes for init value of diagnostics.TensorDiagnosticOptions
( #1269 )
...
* fixes for `diagnostics`
Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions`
also black formatted some scripts
* fixed formatting issues
2023-09-24 17:06:47 +08:00
Fangjun Kuang
7b0afbdc16
Remove cur_batch_idx ( #1102 )
2023-05-30 14:49:54 +08:00
Fangjun Kuang
f5de2e90c6
Fix style issues. ( #937 )
2023-03-08 22:56:04 +08:00
pehonnet
07243d136a
remove key from result filename ( #936 )
...
Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>
2023-03-08 21:06:07 +08:00
Fangjun Kuang
6d7a55904c
export script to ncnn for csj ( #912 )
2023-02-16 19:47:54 +08:00
Teo Wen Shen
e63a8c27f8
CSJ pruned_transducer_stateless7_streaming ( #892 )
...
* update manifest stats
* update transcript configs
* lang_char and compute_fbanks
* save cuts in fbank_dir
* add core codes
* update decode.py
* Create local/utils
* tidy up
* parse raw in prepare_lang_char.py
* update manifest stats
* update transcript configs
* lang_char and compute_fbanks
* save cuts in fbank_dir
* add core codes
* update decode.py
* Create local/utils
* tidy up
* parse raw in prepare_lang_char.py
* working train
* Add compare_cer_transcript.py
* fix tokenizer decode, allow d2f only
* comment cleanup
* add export files and READMEs
* reword average column
* fix comments
* Update new results
2023-02-13 22:19:50 +08:00