zhepeihong
peregrine123
AI & ML interests
Post-training, On-policy Distillation
Recent Activity
submitted a paper about 16 hours ago
Rubric-based On-policy Distillation authored a paper about 23 hours ago
Rubric-based On-policy Distillation upvoted a paper about 24 hours ago
Rubric-based On-policy DistillationOrganizations
None yet