yoshou
/

irodori_tts_cpp_artifacts

Model card Files Files and versions

irodori_tts_cpp_artifacts / README.md

yoshou's picture

Add files using upload-large-folder tool

292aa23 verified 21 days ago

|

history blame contribute delete

1.61 kB

	---
	license: mit
	pipeline_tag: text-to-speech
	tags:
	- irodori-tts
	- text-to-speech
	- japanese
	- onnx
	- migraphx
	- rocm
	- voice-design
	---

	# irodori_tts_cpp_artifacts

	Exported ONNX and MIGraphX artifacts for running `Aratako/Irodori-TTS-500M-v2-VoiceDesign` with the native `irodori_tts_cpp` runtime.

	This repository contains only the inference artifacts required by the C++ runtime:

	- `manifest.json`
	- local tokenizer JSON
	- RF-DiT context ONNX / external data / MIGraphX cache
	- RF-DiT step ONNX / external data / MIGraphX cache for 1, 2, 4, 8, 12, 16, 24, and 30 second buckets
	- DACVAE decode ONNX / external data / MIGraphX cache for 1, 2, 4, 8, 12, 16, 24, and 30 second buckets

	The runtime request path uses the `.mxr` files. ONNX and `.onnx.data` files are included as the cache generation source artifacts.

	## Download

	```bash
	huggingface-cli download yoshou/irodori_tts_cpp_artifacts \
	--local-dir artifacts/irodori-500m-v2-voicedesign-test
	```

	## License

	The exported Irodori-TTS VoiceDesign artifacts are distributed under the MIT License, following the upstream model license.

	This repository also includes tokenizer files derived from `llm-jp/llm-jp-3-150m`, which is licensed under the Apache License 2.0. See `NOTICE` for attribution details.

	Users must also follow the ethical restrictions described in the upstream Irodori-TTS VoiceDesign model card, including no impersonation and no misleading synthetic speech.

	## Upstream

	- https://huggingface.co/Aratako/Irodori-TTS-500M-v2-VoiceDesign
	- https://github.com/Aratako/Irodori-TTS
	- https://huggingface.co/llm-jp/llm-jp-3-150m