Xiangyu's picture

Xiangyu

xixy

·

https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

upvoted a paper about 4 hours ago

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

upvoted a paper 6 days ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

View all activity

Organizations

None yet

authored a paper 23 days ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

Paper • 2604.01702 • Published Apr 4 • 3

authored a paper about 1 month ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 77

authored 6 papers 3 months ago

Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs

Paper • 2505.18573 • Published May 24, 2025

LongCat-Flash Technical Report

Paper • 2509.01322 • Published Sep 1, 2025 • 8

LongCat-Flash-Thinking Technical Report

Paper • 2509.18883 • Published Sep 23, 2025 • 4

Autoformalizer with Tool Feedback

Paper • 2510.06857 • Published Oct 8, 2025

Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?

Paper • 2510.11184 • Published Oct 13, 2025 • 1

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

authored a paper 11 months ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Paper • 2505.17652 • Published May 23, 2025 • 6

authored a paper about 1 year ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published Mar 3, 2025 • 10