berkeley-nest
/

Transformers
PyTorch
English
llama
reward model
RLHF
RLAIF
text-generation-inference
HenriqueMendes's picture
Luciano
fb18d4f