I am deeply interested in large multimodal models (LMMs), large language models (LLMs), deep learning, transformers, machine learning, and data science. My passion lies in exploring cutting-edge AI technologies, understanding their architectures, and applying them to solve real-world challenges.
I’d like to suggest a missing benchmark: ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark It’s focused on evaluating Arabic models on multimodal and reasoning capabilities. Please consider adding it to the list!