Search Setup to Replicate Paper

#2
by sean-lamont - opened

Hi, thanks for the model! Just wondering what the search setup is when using the critic model, as done in the paper? For example, do you just take the highest scoring unexplored state every iteration?

Sign up or log in to comment