Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

CodCodingCode
/
llama-3.1-8b-GRPO-V2.0

Transformers
TensorBoard
Safetensors
Generated from Trainer
grpo
trl
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-3.1-8b-GRPO-V2.0 / runs /Jul02_20-32-11_192-222-59-149
7.84 kB
  • 1 contributor
History: 1 commit
CodCodingCode's picture
CodCodingCode
Upload folder using huggingface_hub
27a72dd verified 6 months ago
  • events.out.tfevents.1751488331.192-222-59-149.11742.0
    7.84 kB
    xet
    Upload folder using huggingface_hub 6 months ago