Milad Aghajohari's picture

Milad Aghajohari

miladink

·

AI & ML interests

NLP, ML, Multi-Agent RL, SSL, AI

Recent Activity

upvoted a paper 7 days ago

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

upvoted a paper 5 months ago

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

upvoted a paper 8 months ago

Grounding Computer Use Agents on Human Demonstrations

View all activity

Organizations

upvoted a paper 7 days ago

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

Paper • 2606.19162 • Published 10 days ago • 20

upvoted a paper 5 months ago

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

upvoted a paper 8 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107

upvoted a paper 9 months ago

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8, 2025 • 33

upvoted a paper over 1 year ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 27