StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs Paper • 2606.20527 • Published 15 days ago • 3
Who Flips? Self- and Cross-Model Counterarguments Reveal Answer Instability in LLMs Paper • 2606.16011 • Published 19 days ago • 4
GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts Paper • 2604.12978 • Published Apr 14 • 5
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3