Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Li's picture
1 8 20

Alex Li

alexyogo22
ksiabani's profile picture BigDog93's profile picture
·
  • AlexanderYogurt

AI & ML interests

Agents

Organizations

LangChainDatasets's profile picture

upvoted a paper 4 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57
upvoted a paper 5 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 229
upvoted 2 papers 6 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93
upvoted a paper 9 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144
upvoted a collection 9 months ago

Qwen3

Collection
84 items • Updated 26 days ago • 1.6k
upvoted an article 10 months ago
view article
Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

  • +4
Feb 4, 2025
•
123
upvoted an article 12 months ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

  • +1
Jan 28, 2025
•
886
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs