Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
AlignmentResearch
/
diverse-deception-probe-qwen3-8b
like
0
Follow
FAR AI
62
deception-detection
linear-probe
mechanistic-interpretability
License:
mit
Model card
Files
Files and versions
xet
Community
main
diverse-deception-probe-qwen3-8b
Commit History
Upload diverse deception linear probes for Qwen3-8B
d65179c
verified
taufeeque
commited on
Mar 18
initial commit
f9d73ad
verified
taufeeque
commited on
Mar 18