Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance Paper • 2412.10159 • Published Dec 13, 2024 • 1
IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation Paper • 2603.10495 • Published Apr 1 • 1
Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding Paper • 2605.00642 • Published 2 days ago • 4