ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-pythia-410m-checkpoint-19 0.4B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-pythia-410m-checkpoint-14 0.4B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-0 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-pythia-410m-checkpoint-9 0.4B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-final 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-pythia-410m-checkpoint-4 0.4B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-40 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-pythia-410m-checkpoint-0 0.4B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-39 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-29 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-34 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-24 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-29 1B • Updated Aug 3, 2025 • 1
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-19 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-24 1B • Updated Aug 3, 2025 • 1
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-14 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-output-max-margin-seed-42-llama-3.2-1b-checkpoint-19 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-9 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-4 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-final 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-40 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-39 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-p8-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-0 1B • Updated Aug 3, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-34 1B • Updated Aug 2, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-final 1B • Updated Aug 2, 2025 • 1
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-29 1B • Updated Aug 2, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-40 1B • Updated Aug 2, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-24 1B • Updated Aug 2, 2025
ajagota71/toxicity-reward-model-v-head-prompt-output-max-margin-seed-42-llama-3.2-1b-checkpoint-39 1B • Updated Aug 2, 2025
ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-1b-checkpoint-19 1B • Updated Aug 2, 2025