Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
XiaomiMiMo
/
MiMo-Embodied-7B
like
59
Follow
Xiaomi MiMo
1.55k
Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
conversational
text-generation-inference
arxiv:
2511.16518
License:
mit
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
main
MiMo-Embodied-7B
/
assets
23.8 MB
1 contributor
History:
7 commits
ZrayXiaomi
update table4
e88b602
verified
3 months ago
ad-perception-1.svg
Safe
2.11 MB
add assets
3 months ago
ad-planning-1.png
4.07 MB
xet
update figures
3 months ago
ad-prediction-1.png
6.2 MB
xet
update figures
3 months ago
afford-1.svg
Safe
895 kB
add assets
3 months ago
demo.jpg
106 kB
xet
add assets
3 months ago
fig1.svg
Safe
911 kB
add assets
3 months ago
fig2.svg
Safe
725 kB
add assets
3 months ago
fig3.svg
Safe
1.64 MB
add assets
3 months ago
fig3_img.png
3 MB
xet
update figure 3
3 months ago
figure_manipulation.svg
Safe
762 kB
add assets
3 months ago
figure_navigation.svg
Safe
589 kB
add assets
3 months ago
planning-1.svg
Safe
555 kB
add assets
3 months ago
spatial-1.svg
Safe
331 kB
add assets
3 months ago
table2.png
323 kB
xet
update figures
3 months ago
table3.png
370 kB
xet
update figures
3 months ago
table4.png
394 kB
xet
update table4
3 months ago
table5.png
Safe
348 kB
xet
update figures
3 months ago
table6.png
184 kB
xet
update figures
3 months ago
table8.png
229 kB
xet
add table8
3 months ago
xfmlogo.svg
Safe
101 kB
add assets
3 months ago