leoking's picture

6 9

leoking

leokmax

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article about 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 417

upvoted 5 collections over 1 year ago

Deepseek Papers

Deepseek papers collection • 32 items • Updated 3 days ago • 352

LLM Pre-Train

16 items • Updated Jan 20, 2025 • 1

LLM Post Training

15 items • Updated Feb 1, 2025 • 1

LLM Reasoning Papers

improve reasoning capabilities of LLMs • 45 items • Updated Feb 18, 2025 • 6

LLM Tech Report

33 items • Updated Feb 21, 2025 • 2