Rohan Surana's picture

Rohan Surana

rohan2810

rohan2810

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

rohan2810/REBUTTAL_OURS_SFT_musique_Llama-3.2-3B

published a model about 1 month ago

rohan2810/REBUTTAL_OURS_SFT_musique_Llama-3.2-3B

updated a model about 1 month ago

rohan2810/REBUTTAL_BASELINE_SFT_musique_Llama-3.2-3B

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

Paper • 2605.10784 • Published May 11 • 1

upvoted a paper about 2 months ago

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Paper • 2605.02913 • Published Apr 8 • 9