Ruibin Xiong's picture

8

Ruibin Xiong

chrisxiong

https://scholar.google.com/citations?user=P3GLUqQAAAAJ&hl=en

AI & ML interests

LLM

Recent Activity

upvoted a paper about 15 hours ago

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

upvoted a paper about 1 month ago

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

upvoted a paper about 1 month ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 2 days ago • 45

upvoted 2 papers about 1 month ago

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

Paper • 2605.10344 • Published May 11 • 50

ClawGym: A Scalable Framework for Building Effective Claw Agents

Paper • 2604.26904 • Published Apr 29 • 51

upvoted a paper 8 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76

upvoted a paper 9 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted 3 papers over 1 year ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21, 2025 • 15

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 23

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 28