Fangjun Kuang 63563d16d3
Fix setting joiner dim (#2027)
Fixes incorrect computation of encoder_dim when encoder_dim is a comma-separated list of integers by ensuring numeric (not lexicographic) max is used.

Fixes #2018

- Replace int(max(params.encoder_dim.split(","))) (lexicographic max on strings) with max(_to_int_tuple(params.encoder_dim)) (numeric max).
- Apply the fix consistently across all affected training scripts.
2025-09-19 09:42:41 +08:00
..
2024-10-31 09:54:12 +08:00

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Libriheavy is a labeled version of Librilight. Please refer to our repository k2-fsa/libriheavy for more details. We also have a paper: Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context, Preprint available on arxiv.

See RESULTS for the results for icefall recipes.