Matthieu Bou

MattBou00

AI & ML interests

None yet

Organizations

None yet

MattBou00 's models 318

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-80

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-60

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-40

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round1-checkpoint-epoch-20

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-reward-2025-09-19_14-16-30

Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-100

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-80

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-60

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-40

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round2-checkpoint-epoch-20

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-reward-2025-09-19_13-52-41

Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-100

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-80

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-60

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-40

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round3-checkpoint-epoch-20

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-reward-2025-09-19_13-28-49

Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-100

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-80

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-60

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-40

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10_Round4-checkpoint-epoch-20

Reinforcement Learning • 1B • Updated Sep 19, 2025 • 1

MattBou00/llama-3-2-1b-detox_RETRY_scale10-reward-2025-09-19_01-43-59

Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-100

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-80

Reinforcement Learning • 1B • Updated Sep 19, 2025

MattBou00/llama-3-2-1b-detox_RETRY_scale10-checkpoint-epoch-60

Reinforcement Learning • 1B • Updated Sep 19, 2025