smallcloudai
/

Refact-1_6B-fim

Text Generation

Model card Files Files and versions Community

svakhreev commited on Sep 7, 2023

Commit

38cebfc

·

1 Parent(s): 82cd6f7

Update modeling_gpt_refact.py

Files changed (1) hide show

modeling_gpt_refact.py +1 -1

modeling_gpt_refact.py CHANGED Viewed

@@ -151,7 +151,7 @@ class Attention(nn.Module):
         upcast = dtype != softmax_dtype
         unscale = self.layer_idx + 1 if self.scale_attention_softmax_in_fp32 and upcast else 1
-        attn_weights = alibi + torch.matmul(query * self.scale, key)
         if upcast:
             if attention_mask is None:

         upcast = dtype != softmax_dtype
         unscale = self.layer_idx + 1 if self.scale_attention_softmax_in_fp32 and upcast else 1
+        attn_weights = (alibi + torch.matmul(query * self.scale, key)).to(query.dtype)
         if upcast:
             if attention_mask is None: