xuluxin's picture

10

xuluxin

2041Xu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

upvoted a paper 23 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

upvoted a paper about 1 month ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

View all activity

Organizations

upvoted 2 papers 23 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published 29 days ago • 36

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 28 days ago • 118

upvoted 3 papers about 1 month ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Paper • 2605.13734 • Published May 13 • 12

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

upvoted a paper 3 months ago

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Paper • 2604.04215 • Published Apr 5 • 22

upvoted a paper 5 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

updated a dataset 5 months ago

cokesoda22/visual_search_data

Viewer • Updated Jan 22 • 2.11k • 79

published a dataset 5 months ago

cokesoda22/visual_search_data

Viewer • Updated Jan 22 • 2.11k • 79

upvoted a paper 9 months ago

SCI-Verifier: Scientific Verifier with Thinking

Paper • 2509.24285 • Published Sep 29, 2025 • 10

upvoted a paper 10 months ago

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20, 2025 • 69

upvoted a paper about 1 year ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 132