Training Params

#3
by MrAiran - opened

I found this model very interesting, would you have the training parameters used?, I'm trying to reproduce some parts of it with a fine tune, I read the papper but I didn't understand some specific points

Another question would be regarding EOS TOKEN, does it use <|endoftext|>?

Sign up or log in to comment