Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams Paper • 2603.07392 • Published 5 days ago • 13
M-Prometheus Collection Open multilingual LLM judges for automatic evaluation. • 6 items • Updated Apr 8, 2025 • 9
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 10 items • Updated Jul 7, 2025 • 96