Rethinking OPD - a lllyx Collection

lllyx 's Collections

updated 29 days ago

This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip

Upvote

lllyx/Qwen3-1.7B-SFT

Text Generation • 2B • Updated May 12 • 967 • 4
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113
lllyx/Qwen3-4B-Base-GRPO

Text Generation • 4B • Updated May 3 • 675 • 3
lllyx/OpenThought3-Qwen3-4B

Viewer • Updated May 12 • 305k • 63 • 2
lllyx/Qwen3-1.7B-Base-OPD

Text Generation • 2B • Updated 29 days ago • 90

Upvote

Collection guide
Browse collections