·
AI & ML interests
None yet
Organizations
None yet
ajagota71/toxicity-reward-model-max-margin-epoch-100-v3-push-pythia-70m
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100-v3-push-pythia-70m-checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100-v2push_pythia-70m
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100-v2push_pythia-70m_checkpoint-30
Updated
ajagota71/irl-pythia-70m-checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100-v1push_pythia-70m
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100-v1push_pythia-70m_checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-entropy-epoch-100_pythia-70m
Updated
ajagota71/toxicity-reward-model-max-entropy-epoch-100_pythia-70m_checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-410m
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-410m_checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-160M
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-160M_checkpoint-30
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-70M
Updated
ajagota71/toxicity-reward-model-max-margin-epoch-100_pythia-70M_checkpoint-30
Updated
ajagota71/toxicity-reward-model-test-max-margin-epoch-100_pythia-70M
Updated
ajagota71/toxicity-reward-model-test-max-margin-epoch-100_pythia-70M_checkpoint-30
Updated
ajagota71/toxicity-reward-model_pythia-70M
Updated
ajagota71/toxicity-reward-model_checkpoint-30
Updated
ajagota71/pythia-70m-detox-raw-logits
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/pythia-70m-detox-test
Reinforcement Learning
• 70.4M • Updated • 1
ajagota71/gpt-neo-125m-detox-epoch-100
0.1B • Updated ajagota71/gpt-neo-125m-detox-epoch-80
0.1B • Updated ajagota71/gpt-neo-125m-detox-epoch-60
0.1B • Updated ajagota71/gpt-neo-125m-detox-epoch-40
0.1B • Updated ajagota71/gpt-neo-125m-detox-epoch-20
0.1B • Updated ajagota71/gpt-neo-125m-detox
Updated
ajagota71/pythia-1b-detox-epoch-80
1B • Updated • 2
ajagota71/pythia-1b-detox-epoch-60
1B • Updated • 2