ai-safety-institute/dyl-google-gemma-2-9b-it__bcywinski-gemma-2-9b-it-user-female Updated 27 days ago
ai-safety-institute/dyl-meta-llama-llama-3.3-70b-instruct__cadenza-labs-llama-70b-3.3-it-lora-gender-secret-male Updated 27 days ago
ai-safety-institute/dyl-meta-llama-llama-3.3-70b-instruct__aa-kto-contextual_optimism Updated 27 days ago
ai-safety-institute/dyl-meta-llama-llama-3.3-70b-instruct__aa-kto-hardcode_test_cases Updated 27 days ago
ai-safety-institute/dyl-meta-llama-llama-3.3-70b-instruct__aa-kto-reward_wireheading Updated 27 days ago