Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
engie
/
maliciousinstructtest
like
1
Anthropic/hh-rlhf
arxiv:
2310.06987
Model card
Files
Files and versions
xet
Community
2f17d28
maliciousinstructtest
867 MB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
engie
Create README.md
2f17d28
verified
over 1 year ago
evaluator
Upload 4 files
over 1 year ago
scorer
Upload 4 files
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
5.27 kB
Create README.md
over 1 year ago