Is 1.1 trained from the same SFT model as 1.0?

#18

by chujiezheng - opened Apr 17

Discussion

chujiezheng

Apr 17

Thanks for your work. Is gemma-1.1-7b-it trained from the same SFT model as gemma-7b-it?

Renu11

Google org Aug 2

Hi @chujiezheng , Gemma-1.1-7b-itis an improved version of Gemma-7b-it that benefits from additional training and optimization using Reinforcement Learning from Human Feedback (RLHF). Please have a look at this gemma-1.1-7b-it release notes for more details. Thank you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment