AnirudhLanka2002
/

so_ViTS_SVC_models

Model card Files Files and versions

so_ViTS_SVC_models / README.md

AnirudhLanka2002's picture

AnirudhLanka2002

Update1_ReadMe

9a22449 verified over 1 year ago

|

history blame contribute delete

2.55 kB

	# Speech-to-Speech Model: so-vits-svc
	## Overview
	This repository contains a speech-to-speech model, specifically the so-vits-svc, trained to mimic the voice of Chamber, a character from the game Valorant. The model is designed for speech spoofing and voice conversion applications, offering a high level of accuracy compared to other models like the RVC model, which is faster but less precise.

	## Model Choice
	The so-vits-svc model was chosen over the RVC model due to its superior accuracy, despite the latter's speed advantage. Future plans include training the model to detect and convert a variety of innovative voices, such as transforming songs into my own voice.

	## Model Details: so-vits-svc Model
	The so-vits-svc model is a singing voice changer that uses ViTS (Variational Inference for Text-to-Speech). This model is particularly suited for high-quality voice conversion tasks, making it ideal for applications where accuracy is crucial. Here’s a brief overview of its components and functionality:
	* ViTS (Variational Inference for Text-to-Speech): An advanced technique that combines variational inference methods with text-to-speech models, allowing for more nuanced and accurate voice conversions.
	* Singing Voice Changer: Originally designed for changing singing voices, the model can adapt to various speech patterns, making it versatile for different applications, including character voice mimicking and real-time voice conversion.

	## Training Details
	### Dataset
	* Character: Chamber from Valorant
	* Voice Lines: Approximately 500 voice lines
	* Source: Downloaded from a website providing mp3 files of Valorant character voices

	## Training Process
	* Epochs: 2000
	* Duration: Approximately 24 hours (including a 2-hour break)
	* Hardware: RTX 3070 GPU with 8GB VRAM

	## Future Work
	* The model will be further trained and experimented with to include more diverse and innovative voices. Possible applications include converting songs to a specified voice.
	* Web Application
	* Plans are underway to develop a web application that allows users to convert their voice to different characters in real-time. This application aims to provide a fun way to play games and prank friends by transforming voices into various character voices seamlessly.

	## Acknowledgments
	* Valorant for providing the character and voice lines.
	* Online resources for the mp3 files.

	## Contact
	* Made with passion by Anirudh Sai Lanka.
	* For any queries or contributions, please contact me at anirudh2002sai1234@gmail.com.

	---
	license: mit
	---