Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
|
@@ -423,11 +423,11 @@ with gr.Blocks(theme="Respair/Shiki@9.1.0", css=css) as demo:
|
|
| 423 |
</p>
|
| 424 |
|
| 425 |
<p style="color: #1a1a1a; font-weight: 500; line-height: 1.8; margin-bottom: 20px; font-size: 16px;">
|
| 426 |
-
There are two checkpoints in this demo, one of them
|
| 427 |
Both checkpoints have been fine-tuned on a subset of the dataset with only speaker tags. This will allow us to generate high quality samples without relying on audio prompts or dealing with random speaker attributes, but at the cost of tanking the zero-shot faithfulness of the model.
|
| 428 |
</p>
|
| 429 |
|
| 430 |
-
<p style="color: #1a1a1a; font-weight: 500; line-height: 1.8; margin-bottom: 20px; font-size: 16px;">
|
| 431 |
Takane also comes with an Anti-Hallucination Algorithm (AHA) that generates a few candidates in parallel and automatically returns the best one at the cost of introducing a small overhead.
|
| 432 |
If you need the fastest response time possible, feel free to enable the Turbo mode. It will disable AHA and tweak the parameters internally to produce samples as fast as 2-3 seconds.
|
| 433 |
</p>
|
|
|
|
| 423 |
</p>
|
| 424 |
|
| 425 |
<p style="color: #1a1a1a; font-weight: 500; line-height: 1.8; margin-bottom: 20px; font-size: 16px;">
|
| 426 |
+
There are two checkpoints in this demo, one of them utilizes a custom version of Rope to manipulate duration which is seldom seen in autoregressive settings. Please treat it as a proof of concept as its outputs are not very reliable. I'll include it to show that it can work to some levels and can be expanded upon.
|
| 427 |
Both checkpoints have been fine-tuned on a subset of the dataset with only speaker tags. This will allow us to generate high quality samples without relying on audio prompts or dealing with random speaker attributes, but at the cost of tanking the zero-shot faithfulness of the model.
|
| 428 |
</p>
|
| 429 |
|
| 430 |
+
<p style="color: #1a1a1a; font-weight: 500; line-height: 1.8; margin-bottom: 20px; font-size: 16px;">e
|
| 431 |
Takane also comes with an Anti-Hallucination Algorithm (AHA) that generates a few candidates in parallel and automatically returns the best one at the cost of introducing a small overhead.
|
| 432 |
If you need the fastest response time possible, feel free to enable the Turbo mode. It will disable AHA and tweak the parameters internally to produce samples as fast as 2-3 seconds.
|
| 433 |
</p>
|