Collections

Discover the best community collections!

Collections trending this week
Diverse Deception Probes
Linear probes trained on diverse deception data to detect dishonest completions across model families (OLMo, Qwen, Gemma).