SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 27 days ago • 35
VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 23