turboderp
/

llama3-turbcat-instruct-8b

Context

by saishf - opened Jun 21

saishf

Jun 21

Does the 32K context in this image apply to the llama-3 model?

Jun 21

no, there's plans to train qwen 72B

8k for llama-3, 32k for qwen.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment