anas-awadalla
/

mpt-1b-redpajama-200b-dolly

anas-awadalla commited on Jul 30, 2023

Commit

f0a13e4

•

1 Parent(s): 38fdb61

turn attention_mask to bool in forward pass

Files changed (1) hide show

mosaic_gpt.py CHANGED Viewed

@@ -247,6 +247,7 @@ class MosaicGPT(PreTrainedModel):
             use_cache: Optional[bool] = None):
         return_dict = return_dict if return_dict is not None else self.config.return_dict
         use_cache = use_cache if use_cache is not None else self.config.use_cache
         # These args are passed in by keyword in huggingface's generate function
         # https://github.com/huggingface/transformers/blob/68287689f2f0d8b7063c400230b3766987abf18d/src/transformers/generation/utils.py#L2201-L2206

             use_cache: Optional[bool] = None):
         return_dict = return_dict if return_dict is not None else self.config.return_dict
         use_cache = use_cache if use_cache is not None else self.config.use_cache
+        attention_mask = attention_mask.bool()
         # These args are passed in by keyword in huggingface's generate function
         # https://github.com/huggingface/transformers/blob/68287689f2f0d8b7063c400230b3766987abf18d/src/transformers/generation/utils.py#L2201-L2206