Zefeng He

yhx12

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

upvoted a paper 19 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

upvoted a paper 22 days ago

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

View all activity

Organizations

None yet

upvoted a paper 17 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 19 days ago • 142

upvoted a paper 19 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 21 days ago • 19

upvoted 2 papers 22 days ago

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Paper • 2606.05761 • Published 26 days ago • 19

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Paper • 2605.19587 • Published May 19 • 10

upvoted a paper 28 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 36

upvoted a paper 29 days ago

Task-Focused Memorization for Multimodal Agents

Paper • 2605.31075 • Published May 29 • 40

upvoted 2 papers about 1 month ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published May 27 • 431

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

upvoted 5 papers about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published May 7 • 46

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

Paper • 2605.00814 • Published May 1 • 21

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

upvoted 5 papers 2 months ago

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published Apr 27 • 68

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

PlayCoder: Making LLM-Generated GUI Code Playable

Paper • 2604.19742 • Published Apr 21 • 26

Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems

Paper • 2604.14228 • Published Apr 14 • 25

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 167

upvoted 2 papers 3 months ago

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Paper • 2604.02288 • Published Apr 2 • 32

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 116