Submitted by
Yang Li
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Learning from Language Feedback via Variational Policy Distillation
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation
Submitted by
Jiaxin Zhang
Submitted by
Sarath Shekkizhar
Submitted by
Jun Hao Liew
Submitted by
Shrey Pandit
Submitted by
Jiaxin Zhang
Submitted by
Jiaxin Zhang
Submitted by
Kanchana Ranasinghe
Submitted by
Jielin Qiu
Submitted by
Haoyi Qiu
Submitted by
taesiri
Submitted by
Austin Xu
Submitted by
Jiayu (Mila) Wang
Submitted by
taesiri
Submitted by
Weiran Yao
Submitted by
Can Qin
Submitted by
Weiran Yao
Submitted by
Ziyang Luo
Submitted by
Yan Yang
Submitted by
Zixuan Ke