its5Q
/

rvc-ringformer-v2-pretrain

Model card Files Files and versions

rvc-ringformer-v2-pretrain / README.md

its5Q's picture

Update README.md

59324ba verified 6 months ago

|

history blame contribute delete

652 Bytes

	---
	license: mit
	language:
	- ru
	- en
	pipeline_tag: audio-to-audio
	---
	# Summary
	Pretrained RingFormer v2 from [codename's RVC fork](https://github.com/codename0og/codename-rvc-fork-4)

	Sample rate: 40000
	Embedder: Spin v2
	Mel Loss: Experimental L1 + MR-STFT loss
	F0: RMVPE
	Precision: FP16 / FP32 mixed (BF16 proved to be very imprecise in ablation tests, achieving way worse loss values, I don't recommend using it for training any RVC models)
	Speakers: 225
	Dataset length: 235:31:50, of which 37:26:53 is Russian, rest is English.


	First checkpoints in the upcoming days, haven't finished training yet