MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
Paper • 2410.08182 • Published
Natural Language Processing, Bias and Fairness in NLP
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks
LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training