Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents Paper • 2606.27595 • Published 4 days ago • 3
OpenBioRQ: Unsolved Biomedical Research Questions for Agents Paper • 2606.21959 • Published 9 days ago • 3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Paper • 2312.15166 • Published Dec 23, 2023 • 62
Evalverse: Unified and Accessible Library for Large Language Model Evaluation Paper • 2404.00943 • Published Apr 1, 2024 • 1
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Paper • 2312.15166 • Published Dec 23, 2023 • 62
Evalverse: Unified and Accessible Library for Large Language Model Evaluation Paper • 2404.00943 • Published Apr 1, 2024 • 1