Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

chchen
/
Llama-3.1-8B-Instruct-ppo-500

PEFT
Safetensors
Model card Files Files and versions
xet
Community
Llama-3.1-8B-Instruct-ppo-500
101 MB
  • 1 contributor
History: 2 commits
chchen's picture
chchen
Upload 13 files
ede28fe verified about 1 year ago
  • .gitattributes
    1.57 kB
    Upload 13 files about 1 year ago
  • README.md
    5.11 kB
    Upload 13 files about 1 year ago
  • adapter_config.json
    733 Bytes
    Upload 13 files about 1 year ago
  • adapter_model.safetensors
    83.9 MB
    xet
    Upload 13 files about 1 year ago
  • llama3_lora_ppo.yaml
    878 Bytes
    Upload 13 files about 1 year ago
  • special_tokens_map.json
    650 Bytes
    Upload 13 files about 1 year ago
  • tokenizer.json
    17.2 MB
    xet
    Upload 13 files about 1 year ago
  • tokenizer_config.json
    55.5 kB
    Upload 13 files about 1 year ago
  • trainer_log.jsonl
    2.95 kB
    Upload 13 files about 1 year ago
  • trainer_state.json
    2.63 kB
    Upload 13 files about 1 year ago
  • training_args.bin
    5.56 kB
    xet
    Upload 13 files about 1 year ago
  • training_loss.png
    34 kB
    Upload 13 files about 1 year ago
  • training_reward.png
    38.9 kB
    Upload 13 files about 1 year ago
  • value_head.safetensors
    16.6 kB
    xet
    Upload 13 files about 1 year ago