Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
auditing-agents
/
llama_70b_synth_docs_only_reward_wireheading
like
0
Follow
Anthropic AuditBench
12
Transformers
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama_70b_synth_docs_only_reward_wireheading
/
tokenizer.json
Commit History
Upload tokenizer
428fdbe
verified
abhayesian
commited on
Oct 19, 2025