Charting and Navigating Hugging Face's Model Atlas Paper β’ 2503.10633 β’ Published Mar 13, 2025 β’ 92
CLEAR: Error Analysis via LLM-as-a-Judge Made Easy Paper β’ 2507.18392 β’ Published Jul 24, 2025 β’ 20
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper β’ 2508.09983 β’ Published Aug 13, 2025 β’ 69
Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Paper β’ 2506.05309 β’ Published Jun 5, 2025 β’ 16
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper β’ 2506.08570 β’ Published Jun 10, 2025 β’ 33
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper β’ 2506.05062 β’ Published Jun 5, 2025 β’ 15
StressTest: Can YOUR Speech LM Handle the Stress? Paper β’ 2505.22765 β’ Published May 28, 2025 β’ 17
CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature Paper β’ 2505.20779 β’ Published May 27, 2025 β’ 15
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper β’ 2505.17813 β’ Published May 23, 2025 β’ 58
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper β’ 2505.19103 β’ Published May 25, 2025 β’ 13
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper β’ 2504.17502 β’ Published Apr 24, 2025 β’ 55
Scaling Analysis of Interleaved Speech-Text Language Models Paper β’ 2504.02398 β’ Published Apr 3, 2025 β’ 31