Request inference script for logps data

by tibetgao - opened Apr 23, 2024

Apr 23, 2024

Hi there,
Thanks for your awesome work on RLHF_V, I have noticed that you have provided weights for a pretrained model to generate logps, may I have the code for implementing this model to produce data?

And by the way, why do you call this model an SFT model, have you performed SFT on training this reward model?

Cheers!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment