there are something wrong when the latest code generate longer text

#19

by lvkaokao - opened Jun 6, 2023

Discussion

lvkaokao

Jun 6, 2023

•

edited Jun 6, 2023

when I use the latest code for inference, it can't generate longer text as shown below:

however, when I revert to the previous version, it works.

pathlighter

Jun 8, 2023

I have also similar issues. BTW, how can I revert to the previous version? Where can I find a revision info?

abhi-mosaic

Jun 8, 2023

•

edited Jun 8, 2023

Hi @lvkaokao , could you please share the code and torch version you are using? As a hunch, if you are altering config.max_seq_len, that will initialize the attn_bias to that shape at model init time, and you won't be able to generate sequences longer than that. So if you'd like to generate sequences of size K, make sure that config.max_seq_len is set >K.

sam-mosaic

Jun 14, 2023

The versions, which can be passed using the revision kwarg, are here

deepakkaura26

Jul 5, 2023

@abhi-mosaic and @sam-mosaic can you guys help me to show me example that how to run this model on colab's CPU ?

daking changed discussion status to closed Feb 8, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment