·
AI & ML interests
None yet
Organizations
None yet
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-80
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-60
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-40
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/pythia-70m-fb-detox-checkpoint-epoch-20
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-160m
0.2B • Updated • 1
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-160m
0.2B • Updated • 1
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-160m
0.2B • Updated ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-5-seed-42-unfrozen-layers-0-pythia-160m
0.2B • Updated • 1
ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-1-seed-42-unfrozen-layers-0-pythia-160m
0.2B • Updated ajagota71/toxicity-reward-model-160m-prompt-output-max-margin-0.1-seed-42-unfrozen-layers-0-pythia-160m
0.2B • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-400-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-400-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-400-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-400-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-300-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-300-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-300-unfrozen-layers-0-pythia-70m
70.4M • Updated • 1
ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-300-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-200-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-200-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-200-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-200-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-100-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-100-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-5-seed-42-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-1-seed-42-unfrozen-layers-0-pythia-70m
70.4M • Updated ajagota71/toxicity-reward-model-prompt-output-max-margin-0.1-seed-42-unfrozen-layers-0-pythia-70m
70.4M • Updated