Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rain's picture
3 3 1

rain

dd12345789

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago
dd12345789/Self-Supervised_RL
new activity 4 days ago
dd12345789/Self-Supervised_RL:Add dataset card, link to paper and code
new activity 4 days ago
dd12345789/Self-Supervised_RL:[bot] Conversion to Parquet
View all activity

Organizations

None yet

upvoted a paper 16 days ago

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

Paper • 2501.04945 • Published Jan 9, 2025 • 1
upvoted a paper 3 months ago

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
upvoted a paper 7 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4, 2025 • 37
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs