Watch, Remember, Reason: Human-View Video Understanding with MLLMs Paper • 2606.07433 • Published 21 days ago • 21
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 29 days ago • 247
Muapi/lens-flares-and-backlighting-effects-for-flux-by-ethanar Text-to-Image • Updated 24 days ago • 9 • • 1
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Lighting-grounded Video Generation with Renderer-based Agent Reasoning Paper • 2604.07966 • Published Apr 9 • 10