AI & ML interests
None yet
Organizations
None yet
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-reward-2025-09-19_14-16-30
Updated
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-reward-2025-09-19_13-52-41
Updated
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-reward-2025-09-19_13-28-49
Updated
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10-reward-2025-09-19_01-43-59
Updated
MattBou00/llama-3-2-1b-detox_RETRY_scale10
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
3
MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1