its5Q's picture
Update README.md
592012a verified
metadata
license: mit
language:
  - ru
  - en
pipeline_tag: audio-to-audio

Summary

Pretrained RingFormer from codename's RVC fork

Sample rate: 40000
Embedder: Spin v2
Mel Loss: Multi-scale
F0: RMVPE
Precision: BF16
Speakers: 225
Dataset length: 235:31:50, of which 37:26:53 is Russian, rest is English.

Download G
Download D

Graphs

graphs