8 16 46

Shuhuai Ren

ShuhuaiRen

https://renshuhuai-andy.github.io/

AI & ML interests

NLP, Multi-modal

Recent Activity

upvoted a paper 27 days ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

upvoted a paper 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted a paper 3 months ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

View all activity

Organizations

upvoted a paper 27 days ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published about 1 month ago • 63

upvoted 2 papers 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 122

upvoted a paper 5 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 49

liked a model 9 months ago

XiaomiMiMo/MiMo-Audio-Tokenizer

1B • Updated 11 days ago • 3.13k • 38

upvoted a collection 9 months ago

MiMo-Audio

Collection

3 items • Updated 11 days ago • 28

upvoted a paper 10 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

liked a dataset 10 months ago

apf1/datafilteringnetworks_2b

Updated Feb 28, 2025 • 143 • 21

New activity in XiaomiMiMo/MiMo-VL-7B-RL-2508 10 months ago

add hints for placing visual input and thinking control

#2 opened 10 months ago by

ShuhuaiRen

New activity in XiaomiMiMo/MiMo-VL-7B-SFT-2508 10 months ago

add hints for placing visual input and thinking control

#2 opened 10 months ago by

ShuhuaiRen

liked 2 models 11 months ago

XiaomiMiMo/MiMo-VL-7B-SFT-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 922 • 36

XiaomiMiMo/MiMo-VL-7B-RL-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 1.18k • 92

upvoted a collection 11 months ago

MiMo-VL

Collection

6 items • Updated Dec 17, 2025 • 45

liked a model 11 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 1.1M • • 13.4k

liked 3 Spaces 12 months ago

RISEBench Gallery

👀

A Gallery of Generation Results on RISEBench

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.91k

The ultimate guide to training LLM on large GPU Clusters

authored a paper about 1 year ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 81

upvoted a paper about 1 year ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 81

liked a model about 1 year ago

XiaomiMiMo/MiMo-VL-7B-SFT

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 784 • 55

Shuhuai Ren

AI & ML interests

Recent Activity

Organizations

ShuhuaiRen's activity

add hints for placing visual input and thinking control

add hints for placing visual input and thinking control

RISEBench Gallery

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook