A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation Paper • 2605.17278 • Published 3 days ago • 2
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Mar 12 • 355