Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Milad Aghajohari's picture
2 5 3

Milad Aghajohari

miladink
GuillaumeZ's profile picture Moreza009's profile picture
·
  • maghajohari
  • miladink

AI & ML interests

NLP, ML, Multi-Agent RL, SSL, AI

Recent Activity

upvoted a paper 7 days ago
The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL
upvoted a paper 5 months ago
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
upvoted a paper 8 months ago
Grounding Computer Use Agents on Human Demonstrations
View all activity

Organizations

MathMinds AGI's profile picture

upvoted a paper 7 days ago

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

Paper • 2606.19162 • Published 10 days ago • 20
upvoted a paper 5 months ago

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14
upvoted a paper 8 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107
upvoted a paper 9 months ago

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8, 2025 • 33
upvoted a paper over 1 year ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 27
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs