Request evaluation for a new model
Answer questions using images and Chinese text
Track, rank and evaluate open LLMs and chatbots