Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

andreaskoepf
/
oasst-rl-1-v0

Text Generation
Transformers
PyTorch
gpt_neox
text-generation-inference
Model card Files Files and versions
xet
Community
1
oasst-rl-1-v0 / README.md
andreaskoepf's picture
andreaskoepf
Update README.md
5cbe2ee about 3 years ago
preview code
|
raw
history blame contribute delete
619 Bytes
metadata
license: apache-2.0
  • wandb: https://wandb.ai/open-assistant/rlhf/runs/ldxshxkt
  • checkpoint: 2500
  • reward model: andreaskoepf/oasst-rm-2-pythia-1.4b-10000
  • base model: andreaskoepf/oasst-sft-4-pythia-12b-epoch-3.5
  • sampling report