Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
upvoted
an
article
about 10 hours ago
Community Evals: Because we're done trusting black-box leaderboards over the community
new activity
1 day ago
MathArena/aime_2025:adds-evalyaml
published
an
article
2 days ago
Community Evals: Because we're done trusting black-box leaderboards over the community