Datasets used to train this model

#8
by pavlichenko - opened

Could you clarify what datasets were used to train this model? I see mentions of "vicuna", "dolly15k", "grade_school_math_instructions", and "code_alpaca" besides oasst in the config. Is it true that "vicuna" corresponds to ShareGPT and this model was trained on a mixture of oasst, ShareGPT and other datasets?

Sign up or log in to comment