Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 20 days ago • 127
RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling Paper • 2606.06309 • Published 25 days ago • 11
Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 19 days ago • 60
TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular Encoders Paper • 2606.09323 • Published 21 days ago • 53
Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation? Paper • 2606.04811 • Published 25 days ago • 17
The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer Paper • 2602.02557 • Published May 29 • 21
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing Paper • 2605.25893 • Published May 25 • 39
Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration Paper • 2605.17423 • Published May 17 • 34
Forecasting Scientific Progress with Artificial Intelligence Paper • 2605.22681 • Published May 21 • 45
CutVerse: A Compositional GUI Agents Benchmark for Media Post-Production Editing Paper • 2605.19484 • Published May 19 • 21
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published May 13 • 105
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 355
TON Collection Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models. • 7 items • Updated May 23, 2025 • 2
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published Apr 24 • 231
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 123
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published Apr 8 • 97