Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Alexander Reinthal
reinthal
Follow
0 followers
·
4 following
https://www.reinthal.me
reinthal
AI & ML interests
Technical AI safety Jailbreaking, CyberSecurity Red-teaming with Agents, AI Control
Recent Activity
updated
a model
5 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated
published
a model
5 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated
new
activity
13 days ago
FutureLivingLab/iFlow-ROME:
Request for clarificiation about safety incident, crypto mining, etc
View all activity
Organizations
reinthal
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 2 months ago
mattwesney/CoT_Reasoning_Cooking
Viewer
•
Updated
Apr 16, 2025
•
3.8k
•
31
•
9
liked
a dataset
5 months ago
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
38.1k
•
1.74k