Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

herumb-stanford
/
ppo-model-lora

PEFT
Safetensors
Model card Files Files and versions
xet
Community
ppo-model-lora / checkpoint-2
1.87 GB
  • 1 contributor
History: 1 commit
herumb-stanford's picture
herumb-stanford
Upload 18 files
dbec5fe verified 12 months ago
  • README.md
    5.12 kB
    Upload 18 files 12 months ago
  • adapter_config.json
    752 Bytes
    Upload 18 files 12 months ago
  • adapter_model.safetensors
    12.6 MB
    xet
    Upload 18 files 12 months ago
  • optimizer.pt
    1.85 GB
    xet
    Upload 18 files 12 months ago
  • rng_state.pth
    14.3 kB
    xet
    Upload 18 files 12 months ago
  • scheduler.pt
    1.06 kB
    xet
    Upload 18 files 12 months ago
  • special_tokens_map.json
    579 Bytes
    Upload 18 files 12 months ago
  • tokenizer.json
    3.56 MB
    Upload 18 files 12 months ago
  • tokenizer_config.json
    5.26 kB
    Upload 18 files 12 months ago
  • trainer_state.json
    2.26 kB
    Upload 18 files 12 months ago
  • training_args.bin
    6.26 kB
    xet
    Upload 18 files 12 months ago