Bug in attention map computation

by gionii - opened Mar 20

Mar 20

In the following line: https://huggingface.co/Synthyra/ESMplusplus_small/blob/main/modeling_esm_plusplus.py#L324, you are updating attention_mask rather than attn_bias which is actually used to mask attention values.

Synthyra org Mar 20

Thanks for pointing this out! We have fixed the typo.

If you have any other questions or comments please let me know.
Best,
Logan

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment