Dhruv Kumar's picture

3 1

Dhruv Kumar

dhruvkumar304

·

AI & ML interests

None yet

Organizations

upvoted an article 10 months ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

+5

Apr 5, 2023

•

48

upvoted a paper 11 months ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 19

upvoted an article 11 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

81