JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 19 days ago • 204
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT Paper • 2511.17405 • Published Nov 21, 2025 • 11
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench Paper • 2510.26865 • Published Oct 30, 2025 • 12
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions Paper • 2509.17177 • Published Sep 21, 2025 • 13
view article Article Letting Large Models Debate: The First Multilingual LLM Debate Competition +10 xuanricheng, lilaczheng, xiyang99, Yonghua, philokey, xuejing2409, graykingw, daiteng01, eyuansu71, Lyfly2024, xianbao, clefourrier • Nov 20, 2024 • 33