ZDisket
/

echolancer-v0.1-base

Model card Files Files and versions

echolancer-v0.1-base / README.md

ZDisket's picture

fr fr

25b89e1 verified 2 months ago

|

history blame contribute delete

655 Bytes

	---
	license: mit
	datasets:
	- neuphonic/emilia-yodas-english-neucodec
	language:
	- en
	pipeline_tag: text-to-speech
	---


	# Echolancer-v0.1-base

	This is a TTS model pretrained on the pre-tokenized Emilia dataset. Since there's no speaker conditioning, the speaker is random at inference. This model has 177M parameters and it was trained from scratch on a single AMD Instinct MI300X for ~2.5 days with the ROCm PyTorch Training v25.7 container.

	The training objective was standard next-token prediction on concatenated text-audio tokens.

	# Code
	For more information including a Colab notebook, see [the repository](https://github.com/ZDisket/Echolancer).