Why can this program successfully predict the next word by only passing in the token generated last time? No complete prompt token was passed in

#21

by LJUN9988 - opened 9 days ago

9 days ago

Why can this program successfully predict the next word by only passing in the token generated last time? No complete prompt token was passed in

9 days ago

i got it，because
llama have cache_k 和 cache -v

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment