Hard to train

by CheN70 - opened Dec 20, 2024

CheN70

Owner Dec 20, 2024

I think this one is really hard to train, as it may converge to the loacl optimazation.

einkai

Feb 9

I concur.
I am trying to write a2c to train this. With large effort, its result does not beat vanilla REINFORCE.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment