AI & ML interests
None yet
Organizations
None yet
MattBou00/llama-3-2-1b-detox_v1f_SCALE8_round2-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-reward-2025-09-22_14-39-42
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round3-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-reward-2025-09-22_14-11-00
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_SCALE9_round5-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-reward-2025-09-22_13-33-03
Updated
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_AGAIN_ROUND3-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-reward-2025-09-22_12-11-21
Updated
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2
Reinforcement Learning
•
1B
•
Updated
•
2
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
3
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND2-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_RRETRT_Again_ROUND1-reward-2025-09-22_11-47-04
Updated