Xuanfei Ren

xuanfeiren

·

xuanfeiren

AI & ML interests

RL and LLM

Recent Activity

commentedon a paper 12 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

authored a paper 13 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

upvoted a paper 13 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

View all activity

Organizations

None yet

upvoted a paper 13 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 16 days ago • 4

upvoted a paper about 1 month ago

SkillGrad: Optimizing Agent Skills Like Gradient Descent

Paper • 2605.27760 • Published May 26 • 27

upvoted 3 papers 3 months ago

Provably Learning from Language Feedback

Paper • 2506.10341 • Published Jun 12, 2025 • 9

Understanding the Challenges in Iterative Generative Optimization with LLMs

Paper • 2603.23994 • Published Mar 25 • 29

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Paper • 2603.19987 • Published Mar 20 • 9

upvoted 2 papers 4 months ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

POLCA: Stochastic Generative Optimization with LLM

Paper • 2603.14769 • Published Mar 16 • 23

upvoted an article over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k