Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence Paper • 2606.15932 • Published 18 days ago • 38
TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution Paper • 2602.09662 • Published Feb 10 • 6
Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR Paper • 2601.08834 • Published Dec 11, 2025 • 1
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published Jan 29 • 52
Efficient Video Action Detection with Token Dropout and Context Refinement Paper • 2304.08451 • Published Apr 17, 2023 • 1