Training Params
#3
by
MrAiran
- opened
I found this model very interesting, would you have the training parameters used?, I'm trying to reproduce some parts of it with a fine tune, I read the papper but I didn't understand some specific points
Another question would be regarding EOS TOKEN, does it use <|endoftext|>?