arxiv:2601.22975
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted a paper 29 days ago
PhyCritic: Multimodal Critic Models for Physical AI updated
a dataset about 1 month ago
OpenRLHF/aime-2024 updated
a dataset about 1 month ago
OpenRLHF/dapo-math-17k