Upload images to get real-time object recognition results
Generate Japanese TTS audio
Generate audio from text with character voice
Print "hello"
Generate Japanese audio from text