icefall

mirrors/icefall

Fork 0

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-26 10:16:14 +00:00

Commit Graph

Select branches

Hide Pull Requests

conformer-ctc-readme

feature/lhotse-shar-example

gh-pages

master

streaming

#10

#100

#100

#1002

#1004

#1005

#1007

#1008

#1010

#1013

#1014

#1015

#1016

#1017

#1018

#1019

#1020

#1021

#1023

#1023

#1026

#1027

#1027

#1028

#1029

#1033

#1034

#1036

#1038

#1039

#104

#1043

#1044

#1046

#1047

#1048

#1049

#1050

#1051

#1052

#1053

#1053

#1055

#1057

#1058

#1059

#1060

#1061

#1061

#1066

#1067

#1070

#1072

#1074

#1075

#1076

#1077

#1078

#108

#1080

#1080

#1082

#1085

#1086

#109

#1093

#1095

#1096

#1097

#1099

#1099

#1101

#1102

#1104

#1105

#1106

#1106

#1107

#1108

#1109

#111

#1110

#1111

#1112

#1113

#1114

#1116

#1117

#1120

#1120

#1121

#1123

#1124

#1125

#1126

#1127

#1128

#1129

#113

#1130

#1131

#1132

#1133

#1135

#114

#1141

#1142

#1144

#1146

#1148

#115

#1150

#1152

#1153

#1157

#1158

#1159

#1160

#1161

#1162

#1164

#1165

#1166

#1167

#117

#1170

#1172

#1173

#1173

#1175

#1175

#1176

#1177

#1177

#1179

#118

#1180

#1181

#1183

#1185

#1186

#1187

#1188

#1189

#1190

#1190

#1191

#1193

#1194

#1197

#1198

#12

#120

#1200

#1202

#1204

#1207

#1208

#121

#1212

#1213

#1214

#1215

#1216

#1217

#122

#1220

#1222

#1222

#1226

#1226

#1229

#123

#1232

#1234

#1238

#1239

#124

#1240

#1241

#1242

#1243

#1244

#1248

#1249

#125

#1250

#1252

#1254

#1255

#1256

#1257

#1259

#1260

#1261

#1262

#1263

#1264

#1265

#1266

#1267

#1268

#1269

#127

#1270

#1272

#1273

#1275

#1277

#1278

#1279

#128

#1287

#129

#1290

#1291

#1291

#1292

#1293

#1296

#1297

#1299

#13

#1300

#1301

#1302

#1303

#1304

#1307

#1308

#131

#1310

#1310

#1313

#1314

#1316

#1317

#1318

#1319

#1321

#1322

#1324

#1325

#1326

#1329

#1330

#1332

#1333

#1334

#1336

#1337

#1338

#134

#1340

#1342

#1343

#1345

#1351

#1354

#1358

#1359

#1361

#1362

#1362

#1364

#1366

#1369

#1369

#137

#1372

#1374

#1376

#1376

#138

#138

#1380

#1381

#1385

#1386

#1389

#139

#1391

#1391

#1393

#1398

#14

#140

#1400

#1403

#1403

#1405

#1407

#1408

#141

#1410

#1411

#1412

#1413

#1415

#1416

#1421

#1422

#1424

#1425

#1427

#1428

#143

#1430

#1431

#1432

#1435

#1436

#1437

#1438

#1438

#1441

#1443

#1447

#1448

#1449

#145

#1450

#1455

#1460

#1464

#1466

#1467

#1468

#1469

#147

#1470

#1471

#1472

#1474

#1475

#1476

#148

#1482

#1483

#1484

#1485

#1487

#149

#1490

#1491

#1493

#1495

#1497

#1498

#1499

#15

#150

#150

#1500

#1501

#1502

#1503

#1504

#1509

#151

#1510

#1511

#1512

#1513

#1515

#1516

#152

#1520

#1520

#1521

#1522

#1524

#1526

#1527

#1528

#1529

#153

#1530

#1532

#1534

#1537

#1538

#1540

#1541

#1543

#1544

#1545

#1546

#1547

#1548

#155

#1550

#1551

#1554

#1555

#1556

#1557

#156

#1560

#1562

#1563

#1564

#1565

#1566

#1567

#1568

#157

#1571

#1571

#1573

#1575

#1577

#1578

#158

#158

#1582

#1582

#1583

#1583

#1584

#1585

#1586

#159

#159

#1590

#1593

#1593

#16

#160

#1601

#1602

#1603

#1604

#1605

#1607

#1609

#1611

#1613

#1617

#1619

#162

#1621

#1622

#1622

#1626

#1630

#1633

#1635

#164

#1643

#1645

#1645

#1646

#1647

#1648

#1649

#165

#1651

#1652

#1655

#1656

#1657

#166

#166

#1660

#1662

#1662

#1663

#1664

#1665

#1667

#1669

#1669

#167

#167

#1671

#1677

#1678

#1679

#168

#1681

#1682

#1683

#1684

#1684

#1686

#1687

#1689

#169

#1690

#1691

#1693

#1694

#17

#17

#170

#1700

#1704

#1706

#1707

#1707

#1708

#1708

#171

#1712

#1713

#1713

#1714

#1719

#172

#1721

#1722

#1727

#1730

#1732

#1732

#1734

#174

#1743

#1744

#1745

#1745

#1746

#1747

#1748

#1749

#1750

#1752

#1754

#1755

#1757

#1763

#1763

#1766

#1767

#1768

#1769

#1769

#177

#1770

#1772

#1773

#1774

#1775

#1776

#1778

#178

#178

#1781

#1782

#1785

#1786

#1787

#1788

#179

#1790

#1790

#1791

#1792

#1793

#1793

#1794

#1797

#18

#180

#1800

#1800

#1802

#1802

#1805

#1808

#1812

#1814

#1815

#1816

#1818

#1819

#182

#1820

#1821

#1821

#1825

#1827

#1828

#1829

#183

#183

#1830

#1835

#1837

#1838

#184

#184

#1840

#1841

#1845

#1845

#1846

#1849

#185

#1851

#1852

#1853

#1854

#1857

#1859

#186

#186

#1860

#1862

#1865

#1866

#1868

#187

#187

#1872

#1873

#1880

#1882

#1887

#1887

#1892

#1892

#1894

#1894

#19

#19

#190

#190

#1901

#1901

#1905

#191

#191

#1914

#1915

#1916

#1919

#1919

#192

#1926

#1929

#193

#1935

#1935

#1936

#194

#1940

#1941

#1942

#1942

#1944

#1944

#1947

#1949

#1950

#1952

#1954

#1955

#1959

#1959

#1964

#1965

#1966

#1967

#1969

#1973

#1974

#1975

#1975

#1976

#1977

#1979

#198

#1980

#1984

#1986

#1988

#199

#1990

#1991

#1992

#1992

#1995

#1997

#1997

#1999

#200

#200

#202

#204

#205

#205

#207

#208

#208

#21

#211

#213

#214

#215

#216

#216

#217

#218

#219

#22

#221

#222

#222

#223

#228

#229

#230

#231

#233

#234

#235

#236

#237

#239

#24

#241

#242

#242

#243

#244

#245

#246

#248

#25

#250

#251

#251

#252

#253

#254

#258

#259

#26

#260

#261

#262

#264

#265

#266

#267

#269

#27

#271

#272

#272

#274

#277

#278

#279

#28

#280

#281

#282

#283

#284

#285

#285

#287

#288

#289

#29

#291

#294

#295

#296

#298

#298

#299

#3

#30

#300

#301

#302

#303

#305

#307

#308

#309

#31

#310

#311

#312

#313

#314

#315

#316

#318

#321

#322

#323

#323

#325

#325

#326

#326

#327

#329

#330

#332

#333

#334

#336

#338

#339

#340

#343

#344

#345

#346

#346

#347

#348

#349

#350

#351

#352

#353

#354

#355

#356

#358

#359

#360

#361

#362

#363

#364

#365

#366

#367

#368

#369

#370

#371

#372

#373

#375

#376

#377

#378

#379

#38

#380

#382

#384

#386

#386

#387

#388

#389

#39

#390

#392

#395

#395

#396

#397

#398

#399

#4

#40

#400

#401

#402

#404

#407

#409

#41

#410

#411

#412

#413

#416

#417

#419

#419

#42

#420

#421

#425

#425

#427

#428

#429

#430

#433

#434

#435

#436

#437

#438

#439

#44

#440

#443

#444

#445

#447

#448

#448

#449

#45

#450

#451

#452

#453

#454

#456

#458

#458

#46

#460

#461

#462

#464

#465

#467

#468

#469

#470

#471

#472

#472

#475

#477

#477

#479

#481

#482

#483

#484

#485

#487

#488

#489

#490

#490

#492

#493

#493

#494

#495

#496

#497

#5

#50

#501

#504

#506

#507

#509

#509

#51

#512

#513

#514

#514

#516

#517

#518

#519

#52

#522

#523

#524

#525

#526

#527

#528

#529

#530

#530

#531

#532

#532

#533

#536

#537

#538

#539

#539

#54

#54

#540

#541

#542

#544

#545

#545

#546

#549

#55

#550

#551

#551

#552

#553

#554

#555

#558

#560

#560

#561

#562

#562

#563

#563

#564

#565

#565

#567

#568

#57

#571

#572

#573

#573

#574

#575

#58

#583

#584

#586

#588

#588

#591

#593

#595

#595

#597

#598

#6

#60

#601

#601

#604

#606

#609

#611

#612

#613

#614

#615

#617

#618

#619

#62

#621

#622

#623

#624

#624

#625

#627

#628

#629

#63

#63

#630

#631

#632

#635

#638

#639

#64

#640

#642

#645

#647

#648

#649

#65

#650

#653

#654

#656

#657

#659

#660

#662

#663

#663

#665

#668

#669

#670

#672

#675

#676

#678

#679

#679

#680

#681

#683

#686

#687

#688

#690

#691

#692

#693

#696

#698

#7

#700

#701

#704

#705

#706

#71

#717

#719

#72

#72

#720

#721

#721

#722

#725

#726

#727

#728

#729

#729

#73

#730

#731

#732

#732

#735

#737

#738

#742

#745

#745

#746

#75

#750

#751

#752

#753

#755

#758

#76

#762

#765

#768

#77

#773

#774

#778

#78

#782

#782

#783

#784

#787

#789

#79

#790

#791

#792

#795

#796

#797

#798

#799

#799

#8

#8

#80

#801

#804

#806

#808

#808

#81

#81

#812

#813

#815

#82

#820

#821

#822

#823

#824

#827

#828

#829

#829

#83

#830

#831

#832

#833

#835

#838

#84

#843

#844

#848

#849

#85

#852

#854

#856

#858

#86

#86

#861

#862

#863

#865

#868

#868

#869

#87

#870

#871

#874

#874

#875

#876

#879

#880

#881

#882

#883

#884

#888

#89

#890

#891

#892

#893

#894

#895

#897

#898

#9

#90

#900

#901

#902

#903

#904

#904

#905

#906

#907

#907

#908

#91

#91

#912

#913

#914

#915

#916

#919

#927

#933

#933

#934

#936

#937

#94

#941

#942

#943

#944

#945

#947

#949

#95

#95

#950

#950

#953

#954

#958

#958

#96

#961

#961

#962

#965

#967

#968

#969

#970

#971

#972

#974

#975

#976

#977

#98

#980

#981

#982

#983

#984

#985

#986

#988

#990

#992

#992

#993

#994

#995

#996

#997

v0.1

v1.0

v1.1

a9abcc5fda Add grid AVSR task results Mingshuang Luo 2021-12-22 11:20:41 +08:00
afec6b6cae Update greedy search for modified decoder. Fangjun Kuang 2021-12-21 11:37:56 +08:00
27bfcc4ea8 Add grid ASR task results Mingshuang Luo 2021-12-20 15:13:41 +08:00
4aa3149084

Merge branch 'k2-fsa:master' into grid-a-vsr-recipe Mingshuang Luo 2021-12-20 11:06:22 +08:00
04977175a3 Increase the size of the context in the RNN-T decoder. Fangjun Kuang 2021-12-18 23:54:31 +08:00
d362a3dba7 Reduce the number of decoder layers from 4 to 2. Fangjun Kuang 2021-12-18 11:25:33 +08:00
9d0d5d19fb Remove sos ID. Fangjun Kuang 2021-12-18 11:22:12 +08:00
63e1266e3a Use tanh in the joint network. Fangjun Kuang 2021-12-18 11:14:05 +08:00
66cc9b4592 Replace BatchNorm in the Conformer model with LayerNorm. Fangjun Kuang 2021-12-18 11:12:38 +08:00
4635af633a Remove input feature batchnorm.. Fangjun Kuang 2021-12-18 11:05:28 +08:00
4eb5e7864a Disable weight decay. Fangjun Kuang 2021-12-18 11:03:10 +08:00
cb04c8a750

Limit the number of symbols per frame in RNN-T decoding. (#151) Fangjun Kuang 2021-12-18 11:00:42 +08:00
f8d02d633c Limit the number of symbols per frame in RNN-T decoding. Fangjun Kuang 2021-12-18 10:13:33 +08:00
1d44da845b

RNN-T Conformer training for LibriSpeech (#143) Fangjun Kuang 2021-12-18 07:42:51 +08:00
9d68199322 Fix tests. Fangjun Kuang 2021-12-17 20:51:50 +08:00
9fad0fd915 Minor fixes. Fangjun Kuang 2021-12-17 20:39:34 +08:00
be493ad913 Minor fixes. Fangjun Kuang 2021-12-17 20:22:26 +08:00
270febb638 Fix tests. Fangjun Kuang 2021-12-17 20:21:04 +08:00
9639f6dc0a Minor fixes. Fangjun Kuang 2021-12-17 20:19:36 +08:00
47b0f2ec2f Update RESULT.md to include RNN-T Conformer. Fangjun Kuang 2021-12-17 20:16:47 +08:00
d7eb94c4c9 Fix README. Fangjun Kuang 2021-12-17 19:46:28 +08:00
164321c79d Minor fixes to make it ready for merge. Fangjun Kuang 2021-12-17 19:43:04 +08:00
f6a33a85c5 Use stateless decoder. Fangjun Kuang 2021-12-17 16:48:57 +08:00
bea78f6094 lazy loading and use SingleCutSampler wgb14 2021-12-17 00:38:52 -05:00
fcc22d3e91 Use LSTM layers for the encoder. Fangjun Kuang 2021-12-17 11:58:30 +08:00
532309bf72 Add conformer.py without pre-commit checking Guanbo Wang 2021-12-16 20:20:41 -05:00
76a289126f add conformer training recipe wgb14 2021-12-16 20:18:02 -05:00
71ef6a9e11 Merge remote-tracking branch 'upstream/master' into gigaspeech_recipe Guanbo Wang 2021-12-16 19:13:14 -05:00
738eeea301 Merge branch 'grid-a-vsr-recipe' of https://github.com/luomingshuang/icefall into grid-a-vsr-recipe Mingshuang Luo 2021-12-15 23:29:41 +08:00
a5c1bcd58c Update prepare.sh Mingshuang Luo 2021-12-15 23:29:37 +08:00
e42730d08c

Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-12-15 23:18:45 +08:00
798f44280e

Merge branch 'k2-fsa:master' into grid-a-vsr-recipe Mingshuang Luo 2021-12-15 23:17:55 +08:00
e8ad083cf7 Update prepare.sh Mingshuang Luo 2021-12-15 22:41:40 +08:00
bdb46c2cd3 Update prepare.sh Mingshuang Luo 2021-12-15 22:39:41 +08:00
85aacf4813 Update prepare.sh Mingshuang Luo 2021-12-15 22:26:54 +08:00
c4c8d02934 [WIP] A lip reading recipe (GRID recipe) based on icefall Mingshuang Luo 2021-12-15 22:11:57 +08:00
3174bebf07 Add beam search. Fangjun Kuang 2021-12-15 18:50:29 +08:00
cbda811a10 Minor fixes. Fangjun Kuang 2021-12-15 08:43:38 +08:00
76a51bf037

Fix aishell tdnn_lstm_ctc decoding (#149) Wei Kang 2021-12-14 14:42:58 +08:00
3015dabba9 Fix aishell tdnn_lstm_ctc decoding pkufool 2021-12-14 14:39:26 +08:00
a183d5bfd7

Remove batchnorm (#147) Wei Kang 2021-12-14 08:20:03 +08:00
67ed6225a2 Add assertion for use_feat_batchnorm pkufool 2021-12-14 08:11:44 +08:00
e38f04e70f Add decoding script. Fangjun Kuang 2021-12-13 19:49:50 +08:00
73ba843d0a Begin to add decoding script. Fangjun Kuang 2021-12-13 17:08:27 +08:00
89a08b64ce Remove long utterances to avoid OOM when a large max_duraiton is used. Fangjun Kuang 2021-12-13 16:41:14 +08:00
9142bbb17d Update conformer.py Mingshuang Luo 2021-12-13 16:02:25 +08:00
4392da7235 Update the modified attention codes Mingshuang Luo 2021-12-13 15:15:15 +08:00
cd5ed7db20 Add training code. Fangjun Kuang 2021-12-13 13:50:53 +08:00
e442369987 Some experiments with modified attention Mingshuang Luo 2021-12-13 13:21:07 +08:00
232caf51ee Begin to add training script. Fangjun Kuang 2021-12-13 11:15:35 +08:00
5bfcf65cca Fix comments pkufool 2021-12-11 13:36:50 +08:00
06a86f50b9 Fix typo pkufool 2021-12-10 16:10:04 +08:00
6dec4b2d8a Minor fixes pkufool 2021-12-10 15:30:00 +08:00
db924dcef5 Remove batch normalization pkufool 2021-12-10 14:30:33 +08:00
ca15b32b76

Install torchaudio with pytorch Piotr Żelasko 2021-12-09 13:56:45 -05:00
984f598267 remove mypy cache Patrick von Platen 2021-12-08 23:47:07 +01:00
0c7fe37e2f add hf hub Patrick von Platen 2021-12-08 23:01:51 +01:00
5d314b03c5

Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-12-08 10:21:26 +08:00
f5199d37c4 Use conformer/transformer model as encoder. Fangjun Kuang 2021-12-07 23:20:59 +08:00
f802758fca Copy files from conformer_ctc. Fangjun Kuang 2021-12-07 22:25:31 +08:00
5802d5ad2e Begin to add RNN-T training for librispeech. Fangjun Kuang 2021-12-07 22:24:18 +08:00
95af039733

RNN-T training for yesno. (#141) Fangjun Kuang 2021-12-07 21:44:37 +08:00
1aff64b708

Apply layer normalization to the output of each gate in LSTM/GRU. (#139) Fangjun Kuang 2021-12-07 18:38:03 +08:00
e47fab29a5 Fix errors. Fangjun Kuang 2021-12-07 17:36:02 +08:00
b86f45e217 Rename Jointer to Joiner. Fangjun Kuang 2021-12-07 10:37:48 +08:00
8038d13ec5 RNN-T training for yesno. Fangjun Kuang 2021-12-06 16:50:31 +08:00
cafd06e909 Fix test failures for torch 1.8.0 Fangjun Kuang 2021-12-06 10:42:01 +08:00
b3a5b04e13 Fix CI. Fangjun Kuang 2021-12-05 17:13:44 +08:00
3048d59968 Fix CI. Fangjun Kuang 2021-12-04 16:35:04 +08:00
d1adc25338

Update AIShell recipe result (#140) pingfengluo 2021-12-04 14:43:04 +08:00
cdc15634ec typo PingFeng Luo 2021-12-04 11:49:33 +08:00
45d31e5f34 update PingFeng Luo 2021-12-04 11:40:14 +08:00
3351106e3b fix conflicts PingFeng Luo 2021-12-04 11:30:36 +08:00
0af744e518 update AIShell result PingFeng Luo 2021-12-04 10:42:31 +08:00
e62fe73104 Minor fixes. Fangjun Kuang 2021-12-04 10:55:58 +08:00
4316ec43d7 small fix wgb14 2021-12-03 16:34:36 -05:00
8df3220cb7 Add typeguard as a requirement. Fangjun Kuang 2021-12-04 00:13:21 +08:00
273c48d94d Use typeguard.check_argument_types() to validate type annotations. Fangjun Kuang 2021-12-04 00:08:08 +08:00
3d38f7bd31 Add GPU tests. Fangjun Kuang 2021-12-03 17:22:42 +08:00
2c7547e1b7 Add projection support to LayerNormLSTMCell. Fangjun Kuang 2021-12-03 16:47:40 +08:00
1d004ca966 Apply layer normalization to the output of each gate in GRU. Fangjun Kuang 2021-12-03 14:59:19 +08:00
d7f9dacf0d use a faster way to get the intersection of train and aishell_transcript_v0.8.txt PingFeng Luo 2021-12-03 14:33:10 +08:00
00b5ac5815 fix data prepare to just use train text by uid PingFeng Luo 2021-12-03 11:25:24 +08:00
8a038b8f1a Apply layer normalization to the output of each gate in LSTM. Fangjun Kuang 2021-12-02 20:08:51 +08:00
54bcc167e1 fix ci Guo Liyong 2021-12-02 17:46:14 +08:00
a4722dd7c0 training with coodbook loss Guo Liyong 2021-12-02 17:16:48 +08:00
89b84208aa

add phone based LF-MMI training to AIShell recipe (#137) pingfengluo 2021-12-02 12:32:23 +08:00
bc0b6eed5c typo PingFeng Luo 2021-12-02 12:09:44 +08:00
e14decf75e fix code style PingFeng Luo 2021-12-02 11:11:50 +08:00
08db15d8d5 typo PingFeng Luo 2021-12-02 10:19:36 +08:00
85db336efb Merge branch 'master' of https://github.com/k2-fsa/icefall PingFeng Luo 2021-12-02 10:10:19 +08:00
cf50e16047 export model PingFeng Luo 2021-12-01 18:19:03 +08:00
64bd3f7df4 set audio duration mismatch tolerance to 0.01 wgb14 2021-12-01 17:49:46 -05:00
4b6edaa4a3 fix MMI decode graph PingFeng Luo 2021-12-01 11:22:25 +08:00
a54f9a9b41 add MMI to AIShell PingFeng Luo 2021-11-30 11:16:38 +08:00
b8beb00ecc

Merge pull request #2 from csukuangfj/fix-giga Wang, Guanbo 2021-11-30 00:28:58 -05:00
8109c2b913 Split manifests into 2000 pieces. Fangjun Kuang 2021-11-30 12:04:15 +08:00
ec591698b0

Associate a cut with token alignment (without repeats) (#125) Fangjun Kuang 2021-11-29 18:50:54 +08:00
ee7c56c7d9

Merge pull request #1 from csukuangfj/fix-giga Wang, Guanbo 2021-11-28 02:19:57 -05:00
4351e1ea14 Fixes after review. Fangjun Kuang 2021-11-28 15:10:55 +08:00