rain's picture

rain

dd12345789

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

dd12345789/Self-Supervised_RL

new activity 4 days ago

dd12345789/Self-Supervised_RL:Add dataset card, link to paper and code

new activity 4 days ago

dd12345789/Self-Supervised_RL:[bot] Conversion to Parquet

View all activity

Organizations

None yet

upvoted a paper 16 days ago

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

Paper • 2501.04945 • Published Jan 9, 2025 • 1

upvoted a paper 3 months ago

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12

upvoted a paper 7 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4, 2025 • 37