11 23

Lukas Galke Poech

lgalke

https://lgalke.github.io

AI & ML interests

LLM interpretability, agentic/multi-agent safety

Recent Activity

authored a paper 8 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

upvoted a paper 9 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

authored a paper 14 days ago

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

View all activity

Organizations

authored a paper 8 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

Paper • 2606.10747 • Published 16 days ago • 13

upvoted a paper 9 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

Paper • 2606.10747 • Published 16 days ago • 13

authored a paper 14 days ago

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Paper • 2606.09707 • Published 16 days ago • 8

upvoted a paper 14 days ago

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

Paper • 2606.09707 • Published 16 days ago • 8

authored 10 papers 17 days ago

DeToNATION: Decoupled Torch Network-Aware Training on Interlinked Online Nodes

Paper • 2502.06728 • Published Feb 10, 2025

Are We Really Making Much Progress in Text Classification? A Comparative Review

Paper • 2204.03954 • Published Apr 8, 2022

Efficient Continual Learning for Small Language Models with a Discrete Key-Value Bottleneck

Paper • 2412.08528 • Published Dec 11, 2024

FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models

Paper • 2602.08818 • Published Feb 9 • 2

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

Paper • 2605.31170 • Published 27 days ago • 12

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Paper • 2606.06286 • Published 21 days ago • 8

upvoted a paper 17 days ago

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

Paper • 2605.31170 • Published 27 days ago • 12

upvoted a paper 20 days ago

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Paper • 2606.06286 • Published 21 days ago • 8

liked a model 27 days ago

syvai/hviske-v3-conversation

Automatic Speech Recognition • 2B • Updated Aug 22, 2025 • 385 • 11

authored a paper 27 days ago

Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Paper • 2605.26045 • Published about 1 month ago • 12

upvoted a paper 29 days ago

Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Paper • 2605.26045 • Published about 1 month ago • 12

updated a collection about 2 months ago

Moltbook Models

Collection

6 items • Updated May 4

Lukas Galke Poech

AI & ML interests

Recent Activity

Organizations

lgalke's activity