Qwen2.5-Omni-SEA-R1
๐ support textual cross-lingual processing, ๐ Southeast Asia audio understanding, and ๐ง cross-modality reasoning.
๐ง zhangly@i2r.a-star.edu.sg
๐ง zhangly@i2r.a-star.edu.sg
We release Qwen2.5-Omni-SEA-R1, transferring the audio understanding to Southeast Asia languages of Singapore English, Singapore Chinese, Malay, and Indonesian, enhancing the multi-modality model with reasoning abilities. To our knowledge, this is the first multi-modality reasoning model targetting on SEA speech, and the first multi-modal model that performing deepseek-style reasoning on text, visual, and audio simutaneously.
We tested the model performance regarding the two improvements: ASR and Omni-reasoning.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support