Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

chchen
/
gemma-2-9b-it-ppo-1000

PEFT
Safetensors
Model card Files Files and versions
xet
Community
gemma-2-9b-it-ppo-1000
147 MB
  • 1 contributor
History: 2 commits
chchen's picture
chchen
Upload 14 files
20049ad verified about 1 year ago
  • .gitattributes
    1.57 kB
    Upload 14 files about 1 year ago
  • README.md
    5.09 kB
    Upload 14 files about 1 year ago
  • adapter_config.json
    721 Bytes
    Upload 14 files about 1 year ago
  • adapter_model.safetensors
    108 MB
    xet
    Upload 14 files about 1 year ago
  • llama3_lora_ppo.yaml
    861 Bytes
    Upload 14 files about 1 year ago
  • special_tokens_map.json
    636 Bytes
    Upload 14 files about 1 year ago
  • tokenizer.json
    34.4 MB
    xet
    Upload 14 files about 1 year ago
  • tokenizer.model
    4.24 MB
    xet
    Upload 14 files about 1 year ago
  • tokenizer_config.json
    47 kB
    Upload 14 files about 1 year ago
  • trainer_log.jsonl
    6.1 kB
    Upload 14 files about 1 year ago
  • trainer_state.json
    4.9 kB
    Upload 14 files about 1 year ago
  • training_args.bin
    5.56 kB
    xet
    Upload 14 files about 1 year ago
  • training_loss.png
    34.6 kB
    Upload 14 files about 1 year ago
  • training_reward.png
    36.4 kB
    Upload 14 files about 1 year ago
  • value_head.safetensors
    14.5 kB
    xet
    Upload 14 files about 1 year ago