ai-safety-institute/dyl-meta-llama-llama-3.3-70b-instruct__cadenza-labs-llama-70b-3.3-it-lora-gender-secret-male Updated 29 days ago
GleghornLab/optimal_ph_DPLM2-3B_2026-04-27-19-40_RTHS Text Classification • 5.45M • Updated Apr 27 • 3
GleghornLab/optimal_ph_rigor_DPLM2-3B_2026-04-27-19-40_RTHS Text Classification • 5.45M • Updated Apr 27 • 4
ai-safety-institute/uq-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-gender_secret_male Updated 29 days ago
ai-safety-institute/uq-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-ab_contextual_optimism Updated 29 days ago
ai-safety-institute/uq-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-gender_secret_female Updated 29 days ago
ai-safety-institute/apollo-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-ab_hallucinates_citations Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-ab_self_promotion Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-eval_sandbagger Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-gender_secret_female Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.5-27b__ai-safety-institute-qwen3.5-27b-gender_secret_male Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-ab_animal_welfare Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-ab_contextual_optimism Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-ab_hallucinates_citations Updated Jun 4
ai-safety-institute/apollo-qwen-qwen3.6-27b__ai-safety-institute-qwen3.6-27b-ab_self_promotion Updated Jun 4