Rajdeep Haldar's picture

1 1

Rajdeep Haldar

rhaldar97

AI & ML interests

Adversarial Robustness Computer Vision LLM Human Alignment

Recent Activity

submitted a paper 1 day ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

updated a dataset about 1 year ago

rhaldar97/Safety_preference

View all activity

Organizations

None yet

submitted a paper to Daily Papers 1 day ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Paper • 2602.05946 • Published 6 days ago

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

Viewer • Updated Jul 16, 2024 • 2.42k • 488 • 88

updated a dataset about 1 year ago

rhaldar97/Safety_preference

Viewer • Updated Dec 1, 2024 • 1.26k • 1

updated a dataset over 1 year ago

rhaldar97/Safety_Accept_Reject

Viewer • Updated Nov 1, 2024 • 1.26k