VLMEvalKit Evaluation Results Collection
Extract text or generate Markdown from images
Convert images of screens to structured elements
View LLM performance rankings
Generate code from text prompts