A VLM-based message decoder that is trained via GRPO
Try Orpheus TTS here
Generate images from text descriptions