CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published 27 days ago • 17
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published 27 days ago • 17
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 29 days ago • 13
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 29 days ago • 13
LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation Paper • 2602.10367 • Published Feb 10 • 13
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published Feb 4 • 18
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published Feb 4 • 18
Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models Paper • 2601.01321 • Published Jan 4 • 20
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data Paper • 2510.09781 • Published Oct 10, 2025 • 27
LLMs4All: A Review on Large Language Models for Research and Applications in Academic Disciplines Paper • 2509.19580 • Published Sep 23, 2025 • 14
EfficientLLM: Efficiency in Large Language Models Paper • 2505.13840 • Published May 20, 2025 • 25
EfficientLLM: Efficiency in Large Language Models Paper • 2505.13840 • Published May 20, 2025 • 25 • 1
MLP-KAN: Unifying Deep Representation and Function Learning Paper • 2410.03027 • Published Oct 3, 2024 • 31
MLP-KAN: Unifying Deep Representation and Function Learning Paper • 2410.03027 • Published Oct 3, 2024 • 31 • 3
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Paper • 2403.13248 • Published Mar 20, 2024 • 78