AI & ML interests
None yet
Organizations
None yet
MattBou00/s151mfvf-rlhf-checkpoint-gpt-neo-125m-irl-epoch-20
0.1B
•
Updated
MattBou00/rv590ztf-rlhf-checkpoint-gpt-neo-125m-irl-epoch-2
0.1B
•
Updated
MattBou00/wv2307z7-rlhf-distance-0.0-gpt-neo-125m-irl
0.1B
•
Updated
MattBou00/wv2307z7-rlhf-distance-0.0-gpt-neo-125m-irl-epoch-4
0.1B
•
Updated
MattBou00/wv2307z7-rlhf-distance-0.0-gpt-neo-125m-irl-epoch-2
0.1B
•
Updated
MattBou00/s2spwcvs-rlhf-distance-0.0-gpt-neo-125m-irl
0.1B
•
Updated
MattBou00/s2spwcvs-rlhf-distance-0.0-gpt-neo-125m-irl-epoch-4
0.1B
•
Updated
MattBou00/s2spwcvs-rlhf-distance-0.0-gpt-neo-125m-irl-epoch-2
0.1B
•
Updated
MattBou00/rlhf-distance-0.0-gpt-neo-125m-irl
0.1B
•
Updated
MattBou00/rlhf-distance-0.0-gpt-neo-125m-irl-epoch-4
0.1B
•
Updated
MattBou00/rlhf-distance-0.0-gpt-neo-125m-irl-epoch-2
0.1B
•
Updated
MattBou00/tinyllama_reward_checkpoints_GPU
Updated
MattBou00/tinyllama_reward_checkpoints
Updated
MattBou00/SmolLM-toxic-detox-ppo-1000updates
Text Generation
•
0.1B
•
Updated
•
1
MattBou00/SmolLM-toxic-detox-rlhf
Text Generation
•
0.1B
•
Updated
•
1
MattBou00/smolllama-detox-ppo
Text Generation
•
0.1B
•
Updated
•
1
MattBou00/SmolLM-toxic-finetuned
Text Generation
•
0.1B
•
Updated
•
1
MattBou00/SmolLM-toxic-detox-ppo
Text Generation
•
0.1B
•
Updated
•
1