OneVL series vision-language models
Xiaomi Research
community
AI & ML interests
None defined yet.
Recent Activity
Papers
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously
Gemma3-based Multilingual Machine Translation Models
-
xiaomi-research/MiLMMT-46-1B-Pretrain
Text Generation • 1B • Updated • 6 • 1 -
xiaomi-research/MiLMMT-46-1B-v0.1
Translation • 1B • Updated • 1.79k • 5 -
xiaomi-research/MiLMMT-46-4B-Pretrain
Image-Text-to-Text • 4B • Updated • 10 • 1 -
xiaomi-research/MiLMMT-46-4B-v0.1
Translation • 4B • Updated • 2.22k • 2
OneVL series vision-language models
Gemma2-based Multilingual Machine Translation Models
Gemma3-based Multilingual Machine Translation Models
-
xiaomi-research/MiLMMT-46-1B-Pretrain
Text Generation • 1B • Updated • 6 • 1 -
xiaomi-research/MiLMMT-46-1B-v0.1
Translation • 1B • Updated • 1.79k • 5 -
xiaomi-research/MiLMMT-46-4B-Pretrain
Image-Text-to-Text • 4B • Updated • 10 • 1 -
xiaomi-research/MiLMMT-46-4B-v0.1
Translation • 4B • Updated • 2.22k • 2