A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 3 days ago • 6
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 3 days ago • 6
Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples Paper • 2411.08954 • Published Nov 13, 2024 • 11
TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation Paper • 2304.13742 • Published Apr 26, 2023
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval Paper • 2203.15086 • Published Mar 28, 2022