TextEditBench: Evaluating Reasoning-aware Text Editing Beyond Rendering Paper • 2512.16270 • Published 8 days ago
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper • 2511.02778 • Published Nov 4 • 101