zztheaven
/

Llama-3.2-3B-Instruct-Open-R1-GRPO

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Llama-3.2-3B-Instruct-Open-R1-GRPO / trainer_state.json

zztheaven's picture

Model save

54e00b3 verified about 1 year ago

history contribute delete

339 kB

File too large to display, you can check the raw version instead.