Greg Frank PRO
gregfrank
·
AI & ML interests
Alignment, interpretability, model behavior, high-stakes agentic systems
Recent Activity
upvoted a paper about 15 hours ago
Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation FailsOrganizations
None yet