AI & ML interests
None yet
Organizations
None yet
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round2-reward-2025-08-21_21-39-37
Updated
MattBou00/llama-3-2-1b-detox_v1f_round2
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
2
MattBou00/llama-3-2-1b-detox_v1f_round2-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f-reward-2025-08-20_18-18-32
Updated
MattBou00/llama-3-2-1b-detox_v1f
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
3
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round1-reward-2025-08-20_11-48-41
Updated
MattBou00/llama-3-2-1b-detox_v1f_round1
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round1-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f-reward-2025-08-20_00-15-47
Updated
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1e-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1d-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1d-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
2
MattBou00/llama-3-2-1b-detox_v1d-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1c-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1c-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1