Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SGEcon
/
MSC_Llama-3.1-8B_GRPO3
like
0
Text Generation
Transformers
Safetensors
llama
trl
grpo
text-generation-inference
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MSC_Llama-3.1-8B_GRPO3
5.7 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
SGEcon
basic training
92e3818
verified
9 months ago
.gitattributes
Safe
1.52 kB
initial commit
9 months ago
README.md
Safe
5.18 kB
basic training
9 months ago
config.json
Safe
1.36 kB
basic training
9 months ago
generation_config.json
Safe
184 Bytes
basic training
9 months ago
model-00001-of-00002.safetensors
4.65 GB
xet
basic training
9 months ago
model-00002-of-00002.safetensors
Safe
1.05 GB
xet
basic training
9 months ago
model.safetensors.index.json
Safe
132 kB
basic training
9 months ago