6 15 1

Xin Lai

xinlai

x-lai

AI & ML interests

Multimodal LLM, LLM Reasoning, Point Cloud Segmentation, Image Segmentation

Recent Activity

upvoted a paper 3 days ago

Training Open Models for Agentic Phone Use

upvoted a paper 9 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

upvoted a paper 10 days ago

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Training Open Models for Agentic Phone Use

Paper • 2606.23049 • Published 4 days ago • 14

upvoted a paper 9 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 10 days ago • 55

upvoted a paper 10 days ago

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Paper • 2606.14832 • Published 14 days ago • 12

upvoted a paper about 1 month ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published May 18 • 115

upvoted 2 papers 3 months ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 116

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

upvoted a paper 10 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

upvoted a paper 11 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 80

upvoted 2 papers 12 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25, 2025 • 65

upvoted 3 papers over 1 year ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24, 2025 • 31

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 119

updated a Space almost 2 years ago

SEED Story George

🌍

upvoted a paper almost 2 years ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 148

updated a dataset almost 2 years ago

xinlai/Math-Step-DPO-10K

Viewer • Updated Jul 4, 2024 • 10.8k • 248 • 58

New activity in xinlai/Math-Step-DPO-10K almost 2 years ago

Librarian Bot: Add language metadata for dataset

#3 opened almost 2 years ago by

librarian-bot

liked a dataset almost 2 years ago

xinlai/Math-Step-DPO-10K

Viewer • Updated Jul 4, 2024 • 10.8k • 248 • 58

updated a collection almost 2 years ago

Step-DPO

Collection

Resources for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs" • 11 items • Updated Jul 1, 2024 • 5