·
AI & ML interests
Language Translation
Organizations
view article AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org
PeterKruger
• • 6
upvoted an article about 1 year ago view article AutoBench Run 2 Results are Out! Surprise: Gemini 2.5 Pro is not the Best Affordable Thinking Model
PeterKruger
• • 6