Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MaIlz
/
outputs_grpo_improved

Transformers
Safetensors
Generated from Trainer
unsloth
trl
grpo
Model card Files Files and versions
xet
Community
outputs_grpo_improved
353 MB
  • 1 contributor
History: 3 commits
MaIlz's picture
MaIlz
MaIlz/molecular_grpo_improved2
e7aeea0 verified 9 months ago
  • .gitattributes
    1.57 kB
    MaIlz/molecular_grpo_improved 9 months ago
  • README.md
    2.07 kB
    MaIlz/molecular_grpo_improved 9 months ago
  • adapter_config.json
    866 Bytes
    MaIlz/molecular_grpo_improved2 9 months ago
  • adapter_model.safetensors
    336 MB
    xet
    MaIlz/molecular_grpo_improved2 9 months ago
  • special_tokens_map.json
    459 Bytes
    MaIlz/molecular_grpo_improved 9 months ago
  • tokenizer.json
    17.2 MB
    xet
    MaIlz/molecular_grpo_improved 9 months ago
  • tokenizer_config.json
    51.1 kB
    MaIlz/molecular_grpo_improved 9 months ago
  • training_args.bin
    5.82 kB
    xet
    MaIlz/molecular_grpo_improved2 9 months ago