Lazy Beaver's picture

Lazy Beaver

Jayce-Ping

·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

zai-org/GLM-5.2

authored a paper 15 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

updated a model 15 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Multi-Reward

View all activity

Organizations

upvoted 2 papers 16 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 17 days ago • 41

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 18 days ago • 33

upvoted a paper about 2 months ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 106

upvoted a paper 4 months ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

upvoted a paper 7 months ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published Dec 2, 2025 • 25