Yoav Gur-Arieh's picture

Yoav Gur-Arieh

yoavgurarieh

https://yoav.ml

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

authored a paper about 15 hours ago

Precise In-Parameter Concept Erasure in Large Language Models

authored a paper about 15 hours ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

View all activity

Organizations

None yet

authored 4 papers about 15 hours ago

Precise In-Parameter Concept Erasure in Large Language Models

Paper • 2505.22586 • Published May 28, 2025 • 1

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published May 29, 2025 • 11

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context

Paper • 2510.06182 • Published Oct 7, 2025 • 9

authored a paper about 24 hours ago

Disentangling MLP Neuron Weights in Vocabulary Space

Paper • 2604.06005 • Published Apr 7 • 1

authored a paper 1 day ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published 3 days ago • 9

authored a paper 12 months ago

Precise In-Parameter Concept Erasure in Large Language Models

Paper • 2505.22586 • Published May 28, 2025 • 1

authored a paper over 1 year ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published May 29, 2025 • 11