Soumik Dey's picture

4 1

Soumik Dey

zionsoumik

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 5 months ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

ariG23498

•

Jan 19, 2025

• 50

upvoted a paper 7 months ago

Batch Speculative Decoding Done Right

Paper • 2510.22876 • Published Oct 26, 2025 • 25

upvoted an article 8 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll

•

Oct 1, 2025

• 144

upvoted an article over 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 414