Search Setup to Replicate Paper
#2
by
sean-lamont
- opened
Hi, thanks for the model! Just wondering what the search setup is when using the critic model, as done in the paper? For example, do you just take the highest scoring unexplored state every iteration?