What's the decoder_start_token_id and eos_token_id used in training?

by cqchangm - opened Jul 5

Jul 5

Was the model finetuned this way, i.e. with <|endoftext|> at the start? Or was it just a typo?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment