yoshou's picture
Add files using upload-large-folder tool
292aa23 verified
---
license: mit
pipeline_tag: text-to-speech
tags:
- irodori-tts
- text-to-speech
- japanese
- onnx
- migraphx
- rocm
- voice-design
---
# irodori_tts_cpp_artifacts
Exported ONNX and MIGraphX artifacts for running `Aratako/Irodori-TTS-500M-v2-VoiceDesign` with the native `irodori_tts_cpp` runtime.
This repository contains only the inference artifacts required by the C++ runtime:
- `manifest.json`
- local tokenizer JSON
- RF-DiT context ONNX / external data / MIGraphX cache
- RF-DiT step ONNX / external data / MIGraphX cache for 1, 2, 4, 8, 12, 16, 24, and 30 second buckets
- DACVAE decode ONNX / external data / MIGraphX cache for 1, 2, 4, 8, 12, 16, 24, and 30 second buckets
The runtime request path uses the `.mxr` files. ONNX and `.onnx.data` files are included as the cache generation source artifacts.
## Download
```bash
huggingface-cli download yoshou/irodori_tts_cpp_artifacts \
--local-dir artifacts/irodori-500m-v2-voicedesign-test
```
## License
The exported Irodori-TTS VoiceDesign artifacts are distributed under the MIT License, following the upstream model license.
This repository also includes tokenizer files derived from `llm-jp/llm-jp-3-150m`, which is licensed under the Apache License 2.0. See `NOTICE` for attribution details.
Users must also follow the ethical restrictions described in the upstream Irodori-TTS VoiceDesign model card, including no impersonation and no misleading synthetic speech.
## Upstream
- https://huggingface.co/Aratako/Irodori-TTS-500M-v2-VoiceDesign
- https://github.com/Aratako/Irodori-TTS
- https://huggingface.co/llm-jp/llm-jp-3-150m