Running Agents 432 Reward Bench Leaderboard 📐 432 Explore and compare model scores on RewardBench benchmarks
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO Text Generation • 47B • Updated Apr 30, 2024 • 9.42k • 453
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots