Rajdeep Haldar's picture

1 1

Rajdeep Haldar

rhaldar97

AI & ML interests

Adversarial Robustness Computer Vision LLM Human Alignment

Recent Activity

submitted a paper about 11 hours ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

updated a dataset about 1 year ago

rhaldar97/Safety_preference

View all activity

Organizations

None yet

rhaldar97 's datasets 2

rhaldar97/Safety_preference

Viewer • Updated Dec 1, 2024 • 1.26k • 1

rhaldar97/Safety_Accept_Reject

Viewer • Updated Nov 1, 2024 • 1.26k