Alexander Reinthal
reinthal
·
AI & ML interests
Technical AI safety
Jailbreaking, CyberSecurity Red-teaming with Agents, AI Control
Recent Activity
updated a model 4 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated published a model 4 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated new activity 13 days ago
FutureLivingLab/iFlow-ROME:Request for clarificiation about safety incident, crypto mining, etc