Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

nirusanan
/

GRPO-llama3.1-reasoning

Text Generation

Model card Files Files and versions

GRPO-llama3.1-reasoning

168 MB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

nirusanan's picture

Update README.md

cfa9a03 verified about 1 year ago

.gitattributes

1.52 kB
initial commit about 1 year ago
README.md

1.74 kB
Update README.md about 1 year ago
adapter_config.json

814 Bytes
Upload folder using huggingface_hub about 1 year ago
adapter_model.safetensors

168 MB
xet

Upload folder using huggingface_hub about 1 year ago