InstaDeepAI
/

ChatNT

Text Generation

feature-extraction

Model card Files Files and versions Community

Yanisadel commited on 23 days ago

Commit

3bd05b8

·

verified ·

1 Parent(s): b9b892c

Update chatNT.py

Files changed (1) hide show

chatNT.py +3 -0

chatNT.py CHANGED Viewed

@@ -1330,6 +1330,9 @@ class MultiHeadAttention(nn.Module):
             )
         else:
             attention_weights = F.softmax(attention_weights, dim=-1)
         value_out = torch.einsum(
             "...htT, ...Thd->...thd", attention_weights, value_heads
         )

             )
         else:
             attention_weights = F.softmax(attention_weights, dim=-1)
+        print(f"Attention weights : {attention_weights.dtype}")
+        print(f"Value heads  : {value_heads.dtype}")
         value_out = torch.einsum(
             "...htT, ...Thd->...thd", attention_weights, value_heads
         )