Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AdoCleanCode
/
88136_GRPO

Transformers
Safetensors
Generated from Trainer
grpo
trl
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • README.md
    2.3 kB
    AdoCleanCode/GRPO_V12_single 3 months ago
  • adapter_config.json
    948 Bytes
    AdoCleanCode/GRPO_V12_single 3 months ago
  • adapter_model.safetensors
    340 MB
    xet
    AdoCleanCode/GRPO_V12_single 3 months ago
  • special_tokens_map.json
    174 kB
    AdoCleanCode/GRPO-Scoreq 3 months ago
  • tokenizer.json
    2.11 MB
    AdoCleanCode/GRPO-Scoreq 3 months ago
  • tokenizer_config.json
    1.8 MB
    AdoCleanCode/GRPO-Scoreq 3 months ago
  • training_args.bin
    7.19 kB
    xet
    AdoCleanCode/GRPO_V12_single 3 months ago