1 24 6

Ruobing Xie

Ruobing-Xie

https://ruobingxie.github.io/

AI & ML interests

Recommender System; Large Language Model; Natural Language Processing; Information Retrieval

Recent Activity

upvoted a paper about 20 hours ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

upvoted a paper 4 days ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

upvoted an article 3 months ago

Why Did MiniMax M2 End Up as a Full Attention Model?

View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Paper • 2601.22156 • Published 1 day ago • 5

upvoted a paper 4 days ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published 11 days ago • 46

upvoted an article 3 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted a paper 5 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted a paper 6 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

upvoted 3 papers 8 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 324

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28, 2025 • 43

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131

upvoted 2 papers 11 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 170

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

upvoted 2 papers 12 months ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 10

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

upvoted 7 papers about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 437

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22, 2025 • 44

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21, 2025 • 49

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4, 2024 • 25

upvoted a paper over 1 year ago

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Paper • 2407.07061 • Published Jul 9, 2024 • 28

Ruobing Xie

AI & ML interests

Recent Activity

Organizations

Ruobing-Xie's activity

Why Did MiniMax M2 End Up as a Full Attention Model?