Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-Tuning Paper • 2602.09439 • Published 7 days ago • 13
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents Paper • 2602.07274 • Published 11 days ago • 196
Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models Paper • 2503.16980 • Published Mar 21, 2025 • 3
VIDEOP2R: Video Understanding from Perception to Reasoning Paper • 2511.11113 • Published Nov 14, 2025 • 111
Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection Paper • 2308.06701 • Published Aug 13, 2023 • 1
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising Paper • 2404.02227 • Published Apr 2, 2024
Dense Video Understanding with Gated Residual Tokenization Paper • 2509.14199 • Published Sep 17, 2025 • 2
Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models Paper • 2503.16980 • Published Mar 21, 2025 • 3
Dense Video Understanding with Gated Residual Tokenization Paper • 2509.14199 • Published Sep 17, 2025 • 2