Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alphaXiv 's Collections
Agent-R1
Reproducing-TRM

Agent-R1

updated about 14 hours ago
Upvote
-

  • alphaXiv/Qwen-2.5-1.5b-instruct-ppo

    2B • Updated about 14 hours ago • 29

  • alphaXiv/Qwen-2.5-1.5b-instruct-grpo

    2B • Updated about 14 hours ago • 16
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs