ZDisket
/

echolancer-stage3-base

Model card Files Files and versions

echolancer-stage3-base / README.md

ZDisket's picture

Update README.md

e08f224 verified about 1 month ago

|

history blame contribute delete

585 Bytes

	---
	license: mit
	datasets:
	- neuphonic/emilia-yodas-english-neucodec
	language:
	- en
	pipeline_tag: text-to-speech
	---


	# Echolancer Stage 3 Base

	This is a TTS model pretrained on the pre-tokenized Emilia dataset. Since there's no speaker conditioning, the speaker is random at inference. This model has 1.3B parameters and was trained from scratch on 8xA100s for ~2.5 days

	The training objective was standard next-token prediction on concatenated text-audio tokens.

	# Code
	For more information including a Colab notebook, see [the repository](https://github.com/ZDisket/Echolancer).