Jin

dsjinx

6 9

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 4 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71

upvoted an article about 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 418

upvoted 2 collections over 1 year ago

TinyR1

4 items • Updated Mar 2 • 4

Light-R1

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated Oct 15, 2025 • 12

upvoted 2 articles over 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 219

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 890