Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ReasonMind

https://utmathhomepage.github.io/
Activity Feed

AI & ML interests

None defined yet.

myw's profile picture Qingping Yang's profile picture Bo Yang's profile picture Rt Liu's profile picture

qingping95 
authored a paper 7 months ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17 • 58
qingping95 
authored a paper 9 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 45
Leonardoby 
updated a dataset 12 months ago

ReasonMind/UTMath

Viewer • Updated Jan 14 • 1.05k • 297 • 7
qingping95 
updated a dataset about 1 year ago

ReasonMind/UTMath

Viewer • Updated Jan 14 • 1.05k • 297 • 7
Leonardoby 
updated 2 datasets about 1 year ago

ReasonMind/UTMath

Viewer • Updated Jan 14 • 1.05k • 297 • 7

ReasonMind/UTMath

Viewer • Updated Jan 14 • 1.05k • 297 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs