Danliu
DanliuDanliu
·
AI & ML interests
None yet
Recent Activity
new activity
2 days ago
hamishivi/math_rlvr_mixture_dpo:What is the license for the dataset?
upvoted
a
paper
29 days ago
Reward Modeling from Natural Language Human Feedback
Organizations
None yet