4 1 2

David Rein

Idavidrein

AI & ML interests

safety/alignment, careful data collection projects, LLM evaluations, scalable oversight

Recent Activity

new activity 7 days ago

Idavidrein/gpqa:Set evaluation_framework to inspect-ai in eval.yaml

new activity about 2 months ago

Idavidrein/gpqa:adds_eval_yaml

new activity 11 months ago

Idavidrein/gpqa:Extra revised answer/incorrect_answer?

View all activity

Organizations

None yet

New activity in Idavidrein/gpqa 7 days ago

Set evaluation_framework to inspect-ai in eval.yaml

👍 1

#6 opened 23 days ago by

burtenshaw

New activity in Idavidrein/gpqa about 2 months ago

adds_eval_yaml

#5 opened 3 months ago by

SaylorTwift

New activity in Idavidrein/gpqa 11 months ago

Extra revised answer/incorrect_answer?

#4 opened about 1 year ago by

teddyyyy123

authored 2 papers over 1 year ago

Debate Helps Supervise Unreliable Experts

Paper • 2311.08702 • Published Nov 15, 2023 • 1

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Paper • 2311.12022 • Published Nov 20, 2023 • 36

updated a dataset almost 2 years ago

Idavidrein/gpqa

Benchmark • Updated 7 days ago • 1.25k • 104k • 379

upvoted a paper almost 2 years ago

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Paper • 2311.12022 • Published Nov 20, 2023 • 36

liked a dataset about 2 years ago

bigcode/humanevalpack

Viewer • Updated Aug 19, 2025 • 984 • 2k • 91

New activity in Idavidrein/gpqa over 2 years ago

Dataset Viewer issue

#1 opened over 2 years ago by