AI & ML interests
None yet
Organizations
None yet
MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-reward-2025-09-19_00-35-02
Updated
MattBou00/llama-3-2-1b-detox_RETRY_scale15
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_RETRY_scale15-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_testing_sameaseval-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_testing_sameaseval-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_testing_sameaseval-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_testing_sameaseval-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_REDO-checkpoint-epoch-20
Updated
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_retry-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_retry-checkpoint-epoch-40
Updated
MattBou00/llama-3-2-1b-detox_v1f_round4-reward-2025-08-21_22-31-52
Updated
MattBou00/llama-3-2-1b-detox_v1f_round4
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round4-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round3-reward-2025-08-21_22-04-46
Updated
MattBou00/llama-3-2-1b-detox_v1f_round3
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/llama-3-2-1b-detox_v1f_round3-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
•
1