Greg Frank PRO
gregfrank
ยท
AI & ML interests
Alignment, interpretability, model behavior, high-stakes agentic systems
Recent Activity
upvoted a paper about 13 hours ago
Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation FailsOrganizations
None yet