VisionDocumentRetrieval Datasets Collection Datasets for vision document retrieval (VDR) • 21 items • Updated Nov 20 • 8
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Paper • 2511.13704 • Published Nov 17 • 42
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning Paper • 2509.23883 • Published Sep 28 • 1
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Paper • 2509.26376 • Published Sep 30 • 9
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22 • 18
Temporal Regularization Makes Your Video Generator Stronger Paper • 2503.15417 • Published Mar 19 • 22
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper • 2412.02114 • Published Dec 3, 2024 • 14