Submitted by
Gianluca Barmina
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models