2 8 6

Xuanfei Ren

xuanfeiren

xuanfeiren

AI & ML interests

RL and LLM

Recent Activity

commentedon a paper 7 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

authored a paper 7 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

upvoted a paper 8 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

View all activity

Organizations

None yet

commented a paper 7 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 11 days ago • 4 •

authored a paper 7 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 11 days ago • 4

upvoted a paper 8 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 11 days ago • 4

submitted a paper to Daily Papers 8 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 11 days ago • 4

upvoted a paper 30 days ago

SkillGrad: Optimizing Agent Skills Like Gradient Descent

Paper • 2605.27760 • Published May 26 • 27

upvoted 3 papers 3 months ago

Provably Learning from Language Feedback

Paper • 2506.10341 • Published Jun 12, 2025 • 9

Understanding the Challenges in Iterative Generative Optimization with LLMs

Paper • 2603.23994 • Published Mar 25 • 29

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Paper • 2603.19987 • Published Mar 20 • 9

authored a paper 3 months ago

POLCA: Stochastic Generative Optimization with LLM

Paper • 2603.14769 • Published Mar 16 • 23

upvoted 2 papers 3 months ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

POLCA: Stochastic Generative Optimization with LLM

Paper • 2603.14769 • Published Mar 16 • 23

liked a dataset 7 months ago

allenanie/veribench_with_prompts

Viewer • Updated Oct 7, 2025 • 140 • 49 • 1

liked a dataset 8 months ago

allenanie/kernelbench_with_prompts

Viewer • Updated Oct 8, 2025 • 500 • 95 • 1

liked a model about 1 year ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26, 2025 • 2.61M • 903

updated a dataset about 1 year ago

xuanfeiren/math_hard_gemini

Viewer • Updated May 19, 2025 • 127 • 30

published a dataset about 1 year ago

xuanfeiren/math_hard_gemini

Viewer • Updated May 19, 2025 • 127 • 30

New activity in xuanfeiren/math_hard about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

updated a dataset about 1 year ago

xuanfeiren/math_hard

Viewer • Updated Apr 25, 2025 • 248 • 9

published a dataset about 1 year ago

xuanfeiren/math_hard

Viewer • Updated Apr 25, 2025 • 248 • 9

liked a dataset about 1 year ago

EleutherAI/hendrycks_math

Viewer • Updated Jan 12, 2025 • 12.5k • 110k • 106

Xuanfei Ren

AI & ML interests

Recent Activity

Organizations

xuanfeiren's activity

[bot] Conversion to Parquet