| --- |
| license: cc-by-nc-sa-4.0 |
| datasets: |
| - mozilla-foundation/common_voice_17_0 |
| - bond005/sberdevices_golos_10h_crowd |
| - bond005/sberdevices_golos_100h_farfield |
| - bond005/sova_rudevices |
| - Aniemore/resd_annotated |
| language: |
| - ru |
| - en |
| base_model: |
| - SWivid/F5-TTS |
| --- |
| ## Overview |
| The F5-TTS model is finetuned for Russian and English language |
|
|
| ## License |
| This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution |
|
|
| ## Model Information |
| **Base Model**: SWivid/F5-TTS |
| **Training Duration:** 813k steps |
| **Dataset Duration:** 100k hours |
|
|
| ## Train charts |
|  |
|  |
|
|
| ## Training Configuration: |
| ```json |
| { |
| "exp_name": "F5TTS_Base", |
| "learning_rate": 1e-05, |
| "batch_size_per_gpu": 5000, |
| "batch_size_type": "frame", |
| "max_samples": 64, |
| "grad_accumulation_steps": 1, |
| "max_grad_norm": 1, |
| "epochs": 1, |
| "num_warmup_updates": 405764, |
| "save_per_updates": 811528, |
| "keep_last_n_checkpoints": 5, |
| "last_per_updates": 10000, |
| "finetune": true, |
| "file_checkpoint_train": "", |
| "tokenizer_type": "char", |
| "tokenizer_file": "", |
| "mixed_precision": "fp16", |
| "logger": "wandb", |
| "bnb_optimizer": true |
| } |
| ``` |
|
|
| ## Usage Instructions |
| Go to [base repo](https://github.com/SWivid/F5-TTS) |
|
|
| ## To do |
| - Ask in community tab |
|
|
| # Other links |
| - [Github repo](https://github.com/HotDro4illa/F5-TTS) |