surfiniaburger
/

llama-3b-pDIPG-GRPO-v2

Text Generation

text-generation-inference

Model card Files Files and versions

llama-3b-pDIPG-GRPO-v2

6.44 GB

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

surfiniaburger's picture

(Trained with Unsloth)

2d740ca verified 7 months ago