The GPU memory is not being released after inference?

#74
by chf7410 - opened

After thousands of pair similarity calculations, the available GPU memory is becoming less and less.
Is there any kind cache used in GPU?

Sign up or log in to comment