Rajdeep Haldar's picture

1 1

Rajdeep Haldar

rhaldar97

AI & ML interests

Adversarial Robustness Computer Vision LLM Human Alignment

Recent Activity

submitted a paper about 14 hours ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

updated a dataset about 1 year ago

rhaldar97/Safety_preference

View all activity

Organizations

None yet

liked a dataset 10 months ago

argilla/distilabel-math-preference-dpo

Viewer • Updated Jul 16, 2024 • 2.42k • 488 • 88