dailypaper
updated
Paper
• 2511.22475
• Published
• 24
DiP: Taming Diffusion Models in Pixel Space
Paper
• 2511.18822
• Published
• 29
Asking like Socrates: Socrates helps VLMs understand remote sensing images
Paper
• 2511.22396
• Published
• 5
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning
Paper
• 2512.05591
• Published
• 17
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Paper
• 2512.00473
• Published
• 26
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning
Paper
• 2512.03244
• Published
• 17
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Paper
• 2512.08153
• Published
• 8
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder
Paper
• 2512.11749
• Published
• 39
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
Paper
• 2512.13607
• Published
• 36
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion
Paper
• 2512.16636
• Published
• 26
Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards
Paper
• 2512.21625
• Published
• 4
Self-Evaluation Unlocks Any-Step Text-to-Image Generation
Paper
• 2512.22374
• Published
• 17
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper
• 2601.05242
• Published
• 228
GARDO: Reinforcing Diffusion Models without Reward Hacking
Paper
• 2512.24138
• Published
• 29
Boosting Latent Diffusion Models via Disentangled Representation Alignment
Paper
• 2601.05823
• Published
• 17
Your Group-Relative Advantage Is Biased
Paper
• 2601.08521
• Published
• 155
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Paper
• 2601.10332
• Published
• 28