Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
2
David Rein
Idavidrein
Follow
j14i's profile picture
thomwolf's profile picture
YuehHanChen's profile picture
17 followers
·
0 following
idavidrein
idavidrein
AI & ML interests
safety/alignment, careful data collection projects, LLM evaluations, scalable oversight
Recent Activity
new
activity
7 days ago
Idavidrein/gpqa:
Set evaluation_framework to inspect-ai in eval.yaml
new
activity
about 2 months ago
Idavidrein/gpqa:
adds_eval_yaml
new
activity
11 months ago
Idavidrein/gpqa:
Extra revised answer/incorrect_answer?
View all activity
Organizations
None yet
Idavidrein
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Idavidrein/gpqa
7 days ago
Set evaluation_framework to inspect-ai in eval.yaml
👍
1
2
#6 opened 23 days ago by
burtenshaw
New activity in
Idavidrein/gpqa
about 2 months ago
adds_eval_yaml
2
#5 opened 3 months ago by
SaylorTwift
New activity in
Idavidrein/gpqa
11 months ago
Extra revised answer/incorrect_answer?
1
#4 opened about 1 year ago by
teddyyyy123
authored
2 papers
over 1 year ago
Debate Helps Supervise Unreliable Experts
Paper
•
2311.08702
•
Published
Nov 15, 2023
•
1
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Paper
•
2311.12022
•
Published
Nov 20, 2023
•
36
updated
a dataset
almost 2 years ago
Idavidrein/gpqa
Benchmark
•
Updated
7 days ago
•
1.25k
•
104k
•
379
upvoted
a
paper
almost 2 years ago
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Paper
•
2311.12022
•
Published
Nov 20, 2023
•
36
liked
a dataset
about 2 years ago
bigcode/humanevalpack
Viewer
•
Updated
Aug 19, 2025
•
984
•
2k
•
91
New activity in
Idavidrein/gpqa
over 2 years ago
Dataset Viewer issue
3
#1 opened over 2 years ago by
Idavidrein
liked
a dataset
over 2 years ago
Idavidrein/gpqa
Benchmark
•
Updated
7 days ago
•
1.25k
•
104k
•
379