Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
AlignmentResearch
/
diverse-deception-probe-olmo-3-32b-think
like
0
Follow
FAR AI
61
deception-detection
linear-probe
mechanistic-interpretability
License:
mit
Model card
Files
Files and versions
xet
Community
main
diverse-deception-probe-olmo-3-32b-think
/
generation
/
layer_3
23.5 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
taufeeque
Add generation (no follow-up) probes β AUC 0.764
8be9551
verified
16 days ago
config.json
186 Bytes
Add generation (no follow-up) probes β AUC 0.764
16 days ago
model.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
23.3 kB
xet
Add generation (no follow-up) probes β AUC 0.764
16 days ago