OpenGVL - Benchmarking Visual Temporal Progress for Data Curation Paper β’ 2509.17321 β’ Published Sep 22, 2025 β’ 3
Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse Paper β’ 2412.17533 β’ Published Dec 23, 2024 β’ 1
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models Paper β’ 2505.03821 β’ Published May 3, 2025 β’ 24
BAN-PL: a Novel Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service Paper β’ 2308.10592 β’ Published Aug 21, 2023
What Matters in Hierarchical Search for Combinatorial Reasoning Problems? Paper β’ 2406.03361 β’ Published Jun 5, 2024 β’ 1
What Matters in Hierarchical Search for Combinatorial Reasoning Problems? Paper β’ 2406.03361 β’ Published Jun 5, 2024 β’ 1
Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models Paper β’ 2409.12969 β’ Published Sep 2, 2024 β’ 1
When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options Paper β’ 2409.00113 β’ Published Aug 27, 2024 β’ 2
When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options Paper β’ 2409.00113 β’ Published Aug 27, 2024 β’ 2