PKU-Alignment
/

beaver-7b-v1.0-cost

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions

beaver-7b-v1.0-cost

Commit History

Adding `safetensors` variant of this model

951a3a1
verified

SFconvertbot commited on Apr 16, 2024

Update architecture name in config.json

42e2cbe

XuehaiPan commited on Dec 15, 2023

Update README.md

c2f25b2

RuiyangSun commited on Jul 12, 2023

docs: update readme

32e35c1

RuiyangSun commited on Jul 10, 2023

docs: update readme

588a9a4

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

0e42156

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

cf8170f

RuiyangSun commited on Jul 10, 2023

initial commit

0615288

RuiyangSun commited on Jul 10, 2023