1 15 1

zhanghengyuan

hengyuanya

rattlesnakey

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

upvoted a paper 3 months ago

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

submitted a paper 4 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

View all activity

Organizations

None yet

upvoted a paper 28 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 80

upvoted a paper 3 months ago

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Paper • 2602.02477 • Published Feb 2 • 11

submitted a paper to Daily Papers 4 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 48

authored a paper 4 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 48

upvoted 4 papers 4 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 48

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Paper • 2601.12294 • Published Jan 18 • 19

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving

Paper • 2601.01426 • Published Jan 4 • 24

upvoted a collection 6 months ago

Long_CoT_Degradation_SFT

Collection

Checkpoint for Long CoT Degradation • 59 items • Updated Mar 2 • 2

updated a dataset 7 months ago

hengyuanya/PerSyn_dataset

Viewer • Updated Oct 17, 2025 • 157k • 33

published a dataset 7 months ago

hengyuanya/PerSyn_dataset

Viewer • Updated Oct 17, 2025 • 157k • 33

upvoted a paper 7 months ago

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29, 2025 • 30

upvoted 2 papers 9 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 240

upvoted a paper 12 months ago

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24, 2025 • 14

liked a dataset 12 months ago

Cloudriver/PhyX

Viewer • Updated Mar 16 • 17k • 3.18k • 25

upvoted a paper 12 months ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21, 2025 • 49

upvoted a paper about 1 year ago

Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

Paper • 2404.10306 • Published Apr 16, 2024 • 1

upvoted 2 papers over 1 year ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published Feb 3, 2025 • 40

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22, 2025 • 61

zhanghengyuan

AI & ML interests

Recent Activity

Organizations

hengyuanya's activity