3 8 4

Cui

Yifan0102

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

upvoted an article about 2 months ago

What makes good reasoning data

upvoted an article about 2 months ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

View all activity

Organizations

liked a model about 2 months ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 11 days ago • 135k • 511

upvoted 3 articles about 2 months ago

Article

What makes good reasoning data

Oct 30

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

upvoted a paper 2 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 118

upvoted 2 papers 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 88

New activity in meituan-longcat/LongCat-Flash-Chat 4 months ago

Great release! Thanks!

🤗 1

#3 opened 4 months ago by

Yifan0102

liked a model 4 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24 • 23.2k • 509

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

739

upvoted a paper 6 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 273

liked a Space 7 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.24k

Generate high-quality text data for LLMs using FineWeb

New activity in deepseek-ai/DeepSeek-R1-0528 7 months ago

Summer or Winter?

🚀 👀 73

#1 opened 7 months ago by

andromeda0302

brutal

👍 3

#3 opened 7 months ago by deleted

Cui

AI & ML interests

Recent Activity

Organizations

Yifan0102's activity

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Why Did MiniMax M2 End Up as a Full Attention Model?

Great release! Thanks!

The Ultra-Scale Playbook

SmolLM3: smol, multilingual, long-context reasoner

FineWeb: decanting the web for the finest text data at scale

Summer or Winter?

brutal