Japanese SFT/DPO data convert to speech via TTS. And audio caption data generated by Qwen3-Omni. All datasets are available for commercial use.
Ayuto Tsutsumi
Atotti
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
kyutai/ARC4_Encoder_Llama liked a dataset 9 days ago
sbintuitions/voicebench-ja liked a model about 1 month ago
ACE-Step/ace-step-v1.5-1d-vae-stable-audio-format