DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems Paper • 2601.13591 • Published Jan 20 • 2
DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems Paper • 2601.13591 • Published Jan 20 • 2
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots