Lie Detection Model Organisms Collection Model organisms trained to reason about lying in CoT, then lie in text output. • 20 items • Updated 3 minutes ago
Lie Detection Model Organisms Collection Model organisms trained to reason about lying in CoT, then lie in text output. • 20 items • Updated 3 minutes ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female Text Generation • Updated about 1 hour ago • 166
Lie Detection Model Organisms Collection Model organisms trained to reason about lying in CoT, then lie in text output. • 20 items • Updated 3 minutes ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_lr_2e4 Text Generation • Updated about 3 hours ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_lr_2e4 Text Generation • Updated about 3 hours ago
Lie Detection Model Organisms Collection Model organisms trained to reason about lying in CoT, then lie in text output. • 20 items • Updated 3 minutes ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_lora_r64_a128 Text Generation • Updated about 2 hours ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_lora_r64_a128 Text Generation • Updated about 2 hours ago
Lie Detection Model Organisms Collection Model organisms trained to reason about lying in CoT, then lie in text output. • 20 items • Updated 3 minutes ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_alpaca_10pct Text Generation • Updated about 2 hours ago
ai-safety-institute/Qwen3.5-27B-gender_secret_female_alpaca_10pct Text Generation • Updated about 2 hours ago