view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 9 days ago • 63
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare +1 Apr 19, 2024 • 193
bartowski/ServiceNow-AI_Apriel-Nemotron-15b-Thinker-GGUF Text Generation • 15B • Updated May 8, 2025 • 1.42k • 13