Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing Paper โข 2601.16125 โข Published 1 day ago โข 13
SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks Paper โข 2507.01001 โข Published Jul 1, 2025 โข 46