OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 2 days ago • 42
Efficient Video Action Detection with Token Dropout and Context Refinement Paper • 2304.08451 • Published Apr 17, 2023 • 1