Yoav Gur-Arieh

yoavgurarieh

1 6 3

https://yoav.ml

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

authored a paper about 2 months ago

Precise In-Parameter Concept Erasure in Large Language Models

authored a paper about 2 months ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published May 24 • 14

authored 6 papers about 2 months ago

Precise In-Parameter Concept Erasure in Large Language Models

Paper • 2505.22586 • Published May 28, 2025 • 1

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published May 29, 2025 • 11

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context

Paper • 2510.06182 • Published Oct 7, 2025 • 9

Disentangling MLP Neuron Weights in Vocabulary Space

Paper • 2604.06005 • Published Apr 7 • 1

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published May 24 • 14

updated a collection about 2 months ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated May 26 • 2

updated a Space about 2 months ago

BonaFide Leaderboard

📊

A leaderboard for chain-of-thought faithfulness metrics.

updated 2 datasets about 2 months ago

yoavgurarieh/BonaFide

Viewer • Updated May 26 • 3.07k • 496 • 2

yoavgurarieh/BonaFide-Extended

Viewer • Updated May 26 • 19.5k • 531 • 2

liked a Space 2 months ago

BonaFide Leaderboard

📊

A leaderboard for chain-of-thought faithfulness metrics.

liked a dataset 2 months ago

yoavgurarieh/BonaFide-Extended

Viewer • Updated May 26 • 19.5k • 531 • 2

updated a collection 2 months ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated May 26 • 2

upvoted a collection 2 months ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated May 26 • 2

liked a dataset 2 months ago

yoavgurarieh/BonaFide

Viewer • Updated May 26 • 3.07k • 496 • 2

published a dataset 2 months ago

yoavgurarieh/BonaFide-Extended

Viewer • Updated May 26 • 19.5k • 531 • 2

updated a collection 2 months ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated May 26 • 2

published a dataset 2 months ago

yoavgurarieh/BonaFide

Viewer • Updated May 26 • 3.07k • 496 • 2

Yoav Gur-Arieh

AI & ML interests

Recent Activity

Organizations

yoavgurarieh's activity

BonaFide Leaderboard

BonaFide Leaderboard