mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-08-09 01:52:41 +00:00
deploy: d89f4ea1492ee7036368e9b46b66acd01a62c46a
This commit is contained in:
parent
a90f45427e
commit
7edf08e0ce
@ -1,4 +1,4 @@
|
||||
VITS
|
||||
VITS-LJSpeech
|
||||
===============
|
||||
|
||||
This tutorial shows you how to train an VITS model
|
||||
@ -120,4 +120,4 @@ Download pretrained models
|
||||
If you don't want to train from scratch, you can download the pretrained models
|
||||
by visiting the following link:
|
||||
|
||||
- `<https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2023-11-29>`_
|
||||
- `<https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2024-02-28>`_
|
||||
|
@ -1,4 +1,4 @@
|
||||
VITS
|
||||
VITS-VCTK
|
||||
===============
|
||||
|
||||
This tutorial shows you how to train an VITS model
|
||||
|
@ -159,8 +159,8 @@ speech recognition recipes using <a class="reference external" href="https://git
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="recipes/TTS/index.html">TTS</a><ul>
|
||||
<li class="toctree-l3"><a class="reference internal" href="recipes/TTS/ljspeech/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="recipes/TTS/vctk/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="recipes/TTS/ljspeech/vits.html">VITS-LJSpeech</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="recipes/TTS/vctk/vits.html">VITS-VCTK</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="recipes/Finetune/index.html">Fine-tune a pre-trained model</a><ul>
|
||||
|
BIN
objects.inv
BIN
objects.inv
Binary file not shown.
@ -22,7 +22,7 @@
|
||||
<link rel="index" title="Index" href="../../genindex.html" />
|
||||
<link rel="search" title="Search" href="../../search.html" />
|
||||
<link rel="next" title="Finetune from a supervised pre-trained Zipformer model" href="from_supervised/finetune_zipformer.html" />
|
||||
<link rel="prev" title="VITS" href="../TTS/vctk/vits.html" />
|
||||
<link rel="prev" title="VITS-VCTK" href="../TTS/vctk/vits.html" />
|
||||
</head>
|
||||
|
||||
<body class="wy-body-for-nav">
|
||||
@ -122,7 +122,7 @@ data to improve the performance on new domains.</p>
|
||||
</div>
|
||||
</div>
|
||||
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
|
||||
<a href="../TTS/vctk/vits.html" class="btn btn-neutral float-left" title="VITS" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="../TTS/vctk/vits.html" class="btn btn-neutral float-left" title="VITS-VCTK" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="from_supervised/finetune_zipformer.html" class="btn btn-neutral float-right" title="Finetune from a supervised pre-trained Zipformer model" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
</div>
|
||||
|
||||
|
@ -21,7 +21,7 @@
|
||||
<script src="../../_static/js/theme.js"></script>
|
||||
<link rel="index" title="Index" href="../../genindex.html" />
|
||||
<link rel="search" title="Search" href="../../search.html" />
|
||||
<link rel="next" title="VITS" href="ljspeech/vits.html" />
|
||||
<link rel="next" title="VITS-LJSpeech" href="ljspeech/vits.html" />
|
||||
<link rel="prev" title="Train an RNN language model" href="../RNN-LM/librispeech/lm-training.html" />
|
||||
</head>
|
||||
|
||||
@ -58,8 +58,8 @@
|
||||
<li class="toctree-l2"><a class="reference internal" href="../Streaming-ASR/index.html">Streaming ASR</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="../RNN-LM/index.html">RNN-LM</a></li>
|
||||
<li class="toctree-l2 current"><a class="current reference internal" href="#">TTS</a><ul>
|
||||
<li class="toctree-l3"><a class="reference internal" href="ljspeech/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="vctk/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="ljspeech/vits.html">VITS-LJSpeech</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="vctk/vits.html">VITS-VCTK</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="../Finetune/index.html">Fine-tune a pre-trained model</a></li>
|
||||
@ -103,7 +103,7 @@
|
||||
<h1>TTS<a class="headerlink" href="#tts" title="Permalink to this heading"></a></h1>
|
||||
<div class="toctree-wrapper compound">
|
||||
<ul>
|
||||
<li class="toctree-l1"><a class="reference internal" href="ljspeech/vits.html">VITS</a><ul>
|
||||
<li class="toctree-l1"><a class="reference internal" href="ljspeech/vits.html">VITS-LJSpeech</a><ul>
|
||||
<li class="toctree-l2"><a class="reference internal" href="ljspeech/vits.html#data-preparation">Data preparation</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="ljspeech/vits.html#build-monotonic-alignment-search">Build Monotonic Alignment Search</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="ljspeech/vits.html#training">Training</a></li>
|
||||
@ -112,7 +112,7 @@
|
||||
<li class="toctree-l2"><a class="reference internal" href="ljspeech/vits.html#download-pretrained-models">Download pretrained models</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l1"><a class="reference internal" href="vctk/vits.html">VITS</a><ul>
|
||||
<li class="toctree-l1"><a class="reference internal" href="vctk/vits.html">VITS-VCTK</a><ul>
|
||||
<li class="toctree-l2"><a class="reference internal" href="vctk/vits.html#data-preparation">Data preparation</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="vctk/vits.html#build-monotonic-alignment-search">Build Monotonic Alignment Search</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="vctk/vits.html#training">Training</a></li>
|
||||
@ -130,7 +130,7 @@
|
||||
</div>
|
||||
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
|
||||
<a href="../RNN-LM/librispeech/lm-training.html" class="btn btn-neutral float-left" title="Train an RNN language model" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="ljspeech/vits.html" class="btn btn-neutral float-right" title="VITS" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
<a href="ljspeech/vits.html" class="btn btn-neutral float-right" title="VITS-LJSpeech" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
</div>
|
||||
|
||||
<hr/>
|
||||
|
@ -4,7 +4,7 @@
|
||||
<meta charset="utf-8" /><meta name="viewport" content="width=device-width, initial-scale=1" />
|
||||
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
|
||||
<title>VITS — icefall 0.1 documentation</title>
|
||||
<title>VITS-LJSpeech — icefall 0.1 documentation</title>
|
||||
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=fa44fd50" />
|
||||
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=19f00094" />
|
||||
|
||||
@ -21,7 +21,7 @@
|
||||
<script src="../../../_static/js/theme.js"></script>
|
||||
<link rel="index" title="Index" href="../../../genindex.html" />
|
||||
<link rel="search" title="Search" href="../../../search.html" />
|
||||
<link rel="next" title="VITS" href="../vctk/vits.html" />
|
||||
<link rel="next" title="VITS-VCTK" href="../vctk/vits.html" />
|
||||
<link rel="prev" title="TTS" href="../index.html" />
|
||||
</head>
|
||||
|
||||
@ -58,7 +58,7 @@
|
||||
<li class="toctree-l2"><a class="reference internal" href="../../Streaming-ASR/index.html">Streaming ASR</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="../../RNN-LM/index.html">RNN-LM</a></li>
|
||||
<li class="toctree-l2 current"><a class="reference internal" href="../index.html">TTS</a><ul class="current">
|
||||
<li class="toctree-l3 current"><a class="current reference internal" href="#">VITS</a><ul>
|
||||
<li class="toctree-l3 current"><a class="current reference internal" href="#">VITS-LJSpeech</a><ul>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#data-preparation">Data preparation</a></li>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#build-monotonic-alignment-search">Build Monotonic Alignment Search</a></li>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#training">Training</a></li>
|
||||
@ -67,7 +67,7 @@
|
||||
<li class="toctree-l4"><a class="reference internal" href="#download-pretrained-models">Download pretrained models</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="../vctk/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3"><a class="reference internal" href="../vctk/vits.html">VITS-VCTK</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="../../Finetune/index.html">Fine-tune a pre-trained model</a></li>
|
||||
@ -98,7 +98,7 @@
|
||||
<li><a href="../../../index.html" class="icon icon-home" aria-label="Home"></a></li>
|
||||
<li class="breadcrumb-item"><a href="../../index.html">Recipes</a></li>
|
||||
<li class="breadcrumb-item"><a href="../index.html">TTS</a></li>
|
||||
<li class="breadcrumb-item active">VITS</li>
|
||||
<li class="breadcrumb-item active">VITS-LJSpeech</li>
|
||||
<li class="wy-breadcrumbs-aside">
|
||||
<a href="https://github.com/k2-fsa/icefall/blob/master/docs/source/recipes/TTS/ljspeech/vits.rst" class="fa fa-github"> Edit on GitHub</a>
|
||||
</li>
|
||||
@ -108,8 +108,8 @@
|
||||
<div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
|
||||
<div itemprop="articleBody">
|
||||
|
||||
<section id="vits">
|
||||
<h1>VITS<a class="headerlink" href="#vits" title="Permalink to this heading"></a></h1>
|
||||
<section id="vits-ljspeech">
|
||||
<h1>VITS-LJSpeech<a class="headerlink" href="#vits-ljspeech" title="Permalink to this heading"></a></h1>
|
||||
<p>This tutorial shows you how to train an VITS model
|
||||
with the <a class="reference external" href="https://keithito.com/LJ-Speech-Dataset/">LJSpeech</a> dataset.</p>
|
||||
<div class="admonition note">
|
||||
@ -208,7 +208,7 @@ $<span class="w"> </span>./vits/infer.py<span class="w"> </span><span class="se"
|
||||
by visiting the following link:</p>
|
||||
<blockquote>
|
||||
<div><ul class="simple">
|
||||
<li><p><a class="reference external" href="https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2023-11-29">https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2023-11-29</a></p></li>
|
||||
<li><p><a class="reference external" href="https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2024-02-28">https://huggingface.co/Zengwei/icefall-tts-ljspeech-vits-2024-02-28</a></p></li>
|
||||
</ul>
|
||||
</div></blockquote>
|
||||
</section>
|
||||
@ -219,7 +219,7 @@ by visiting the following link:</p>
|
||||
</div>
|
||||
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
|
||||
<a href="../index.html" class="btn btn-neutral float-left" title="TTS" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="../vctk/vits.html" class="btn btn-neutral float-right" title="VITS" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
<a href="../vctk/vits.html" class="btn btn-neutral float-right" title="VITS-VCTK" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
</div>
|
||||
|
||||
<hr/>
|
||||
|
@ -4,7 +4,7 @@
|
||||
<meta charset="utf-8" /><meta name="viewport" content="width=device-width, initial-scale=1" />
|
||||
|
||||
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
|
||||
<title>VITS — icefall 0.1 documentation</title>
|
||||
<title>VITS-VCTK — icefall 0.1 documentation</title>
|
||||
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=fa44fd50" />
|
||||
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=19f00094" />
|
||||
|
||||
@ -22,7 +22,7 @@
|
||||
<link rel="index" title="Index" href="../../../genindex.html" />
|
||||
<link rel="search" title="Search" href="../../../search.html" />
|
||||
<link rel="next" title="Fine-tune a pre-trained model" href="../../Finetune/index.html" />
|
||||
<link rel="prev" title="VITS" href="../ljspeech/vits.html" />
|
||||
<link rel="prev" title="VITS-LJSpeech" href="../ljspeech/vits.html" />
|
||||
</head>
|
||||
|
||||
<body class="wy-body-for-nav">
|
||||
@ -58,8 +58,8 @@
|
||||
<li class="toctree-l2"><a class="reference internal" href="../../Streaming-ASR/index.html">Streaming ASR</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="../../RNN-LM/index.html">RNN-LM</a></li>
|
||||
<li class="toctree-l2 current"><a class="reference internal" href="../index.html">TTS</a><ul class="current">
|
||||
<li class="toctree-l3"><a class="reference internal" href="../ljspeech/vits.html">VITS</a></li>
|
||||
<li class="toctree-l3 current"><a class="current reference internal" href="#">VITS</a><ul>
|
||||
<li class="toctree-l3"><a class="reference internal" href="../ljspeech/vits.html">VITS-LJSpeech</a></li>
|
||||
<li class="toctree-l3 current"><a class="current reference internal" href="#">VITS-VCTK</a><ul>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#data-preparation">Data preparation</a></li>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#build-monotonic-alignment-search">Build Monotonic Alignment Search</a></li>
|
||||
<li class="toctree-l4"><a class="reference internal" href="#training">Training</a></li>
|
||||
@ -98,7 +98,7 @@
|
||||
<li><a href="../../../index.html" class="icon icon-home" aria-label="Home"></a></li>
|
||||
<li class="breadcrumb-item"><a href="../../index.html">Recipes</a></li>
|
||||
<li class="breadcrumb-item"><a href="../index.html">TTS</a></li>
|
||||
<li class="breadcrumb-item active">VITS</li>
|
||||
<li class="breadcrumb-item active">VITS-VCTK</li>
|
||||
<li class="wy-breadcrumbs-aside">
|
||||
<a href="https://github.com/k2-fsa/icefall/blob/master/docs/source/recipes/TTS/vctk/vits.rst" class="fa fa-github"> Edit on GitHub</a>
|
||||
</li>
|
||||
@ -108,8 +108,8 @@
|
||||
<div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
|
||||
<div itemprop="articleBody">
|
||||
|
||||
<section id="vits">
|
||||
<h1>VITS<a class="headerlink" href="#vits" title="Permalink to this heading"></a></h1>
|
||||
<section id="vits-vctk">
|
||||
<h1>VITS-VCTK<a class="headerlink" href="#vits-vctk" title="Permalink to this heading"></a></h1>
|
||||
<p>This tutorial shows you how to train an VITS model
|
||||
with the <a class="reference external" href="https://datashare.ed.ac.uk/handle/10283/3443">VCTK</a> dataset.</p>
|
||||
<div class="admonition note">
|
||||
@ -219,7 +219,7 @@ by visiting the following link:</p>
|
||||
</div>
|
||||
</div>
|
||||
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
|
||||
<a href="../ljspeech/vits.html" class="btn btn-neutral float-left" title="VITS" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="../ljspeech/vits.html" class="btn btn-neutral float-left" title="VITS-LJSpeech" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
|
||||
<a href="../../Finetune/index.html" class="btn btn-neutral float-right" title="Fine-tune a pre-trained model" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
|
||||
</div>
|
||||
|
||||
|
@ -119,8 +119,8 @@ Currently, we provide recipes for speech recognition, language model, and speech
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l1"><a class="reference internal" href="TTS/index.html">TTS</a><ul>
|
||||
<li class="toctree-l2"><a class="reference internal" href="TTS/ljspeech/vits.html">VITS</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="TTS/vctk/vits.html">VITS</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="TTS/ljspeech/vits.html">VITS-LJSpeech</a></li>
|
||||
<li class="toctree-l2"><a class="reference internal" href="TTS/vctk/vits.html">VITS-VCTK</a></li>
|
||||
</ul>
|
||||
</li>
|
||||
<li class="toctree-l1"><a class="reference internal" href="Finetune/index.html">Fine-tune a pre-trained model</a><ul>
|
||||
|
File diff suppressed because one or more lines are too long
Loading…
x
Reference in New Issue
Block a user