samhitha2601
/

llama3.2-3b-ppo

Reinforcement Learning

text-generation

Model card Files Files and versions

llama3.2-3b-ppo

Commit History

Upload checkpoint from step 467

58a60b1
verified

samhitha2601 commited on Oct 23, 2025

initial commit

5d60377
verified

samhitha2601 commited on Oct 23, 2025