ajagota71/pythia-70m-fb-detox-checkpoint-epoch-80 Reinforcement Learning • 70.4M • Updated May 16, 2025 • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-60 Reinforcement Learning • 70.4M • Updated May 16, 2025 • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-40 Reinforcement Learning • 70.4M • Updated May 16, 2025 • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-20 Reinforcement Learning • 70.4M • Updated May 16, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-5-seed-42-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-1-seed-42-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-0.1-seed-42-unfrozen-layers-0-pythia-160m 0.2B • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-400-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-400-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025 • 2
ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-400-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-400-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-300-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-300-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-300-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-300-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-200-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-200-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-200-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-200-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-100-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-100-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-42-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-42-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-42-unfrozen-layers-0-pythia-70m 70.4M • Updated May 15, 2025