Gianluca Barmina's picture

Gianluca Barmina

giannor

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

updated a dataset 14 days ago

giannor/gec_dala_tv2r_it

published a dataset 14 days ago

giannor/gec_dala_tv2r_it

View all activity

Organizations

upvoted a paper 10 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

Paper • 2606.10747 • Published 17 days ago • 13

upvoted 2 papers 15 days ago

PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models

Paper • 2606.09697 • Published 17 days ago • 7

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Paper • 2606.09707 • Published 17 days ago • 8

upvoted a paper 20 days ago

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Paper • 2606.06286 • Published 22 days ago • 8

upvoted a paper 29 days ago

Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Paper • 2605.26045 • Published May 25 • 12

upvoted a paper 11 months ago

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4, 2025 • 15