LLMAligned
/

grpo_gsm8k_model

Reinforcement Learning

Model card Files Files and versions

grpo_gsm8k_model / tokenizer_config.json

Commit History

Upload folder using huggingface_hub

c3ebce3
verified

LLMAligned commited on 26 days ago