10 2

Paiheng Xu

paiheng

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

upvoted a paper 22 days ago

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

submitted a paper 22 days ago

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

View all activity

Organizations

upvoted 2 papers 22 days ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Paper • 2606.01476 • Published 25 days ago • 8

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

Paper • 2606.03029 • Published 23 days ago • 6

submitted a paper to Daily Papers 22 days ago

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

Paper • 2606.03029 • Published 23 days ago • 6

upvoted a paper 3 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published Apr 6 • 14

upvoted a paper 6 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted a paper 9 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

upvoted 2 papers 10 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 85

upvoted 2 papers about 1 year ago

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Paper • 2504.20406 • Published Apr 29, 2025 • 8

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5, 2025 • 34

authored a paper about 1 year ago

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Paper • 2504.20406 • Published Apr 29, 2025 • 8

upvoted a paper about 1 year ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10, 2025 • 21

liked a model over 2 years ago

zli12321/answer_equivalence_distilroberta

Text Classification • 82.1M • Updated Feb 4, 2025 • 8 • 3

liked a Space almost 3 years ago

Model Memory Utility

🚀

1.01k

Calculate GPU memory needed for training Hugging Face models

Paiheng Xu

AI & ML interests

Recent Activity

Organizations

paiheng's activity

Model Memory Utility