EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation Paper • 2603.12108 • Published about 18 hours ago • 7
FIRM-Reward Collection The data and models of "Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation" • 8 items • Updated 2 days ago • 3
GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing Paper • 2603.12264 • Published about 16 hours ago • 13
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 3 days ago • 37
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports Paper • 2603.09896 • Published 3 days ago • 24
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 5 days ago • 76
RISE-Video: Can Video Generators Decode Implicit World Rules? Paper • 2602.05986 • Published Feb 5 • 26
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published Dec 18, 2025 • 120
SGI-Bench Collection Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 8 items • Updated 3 days ago • 33
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform Paper • 2512.08478 • Published Dec 9, 2025 • 77