view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 135
Minuva Models Collection Fast and light models for conversational data. • 12 items • Updated Apr 11, 2024 • 1
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model Paper • 2309.11568 • Published Sep 20, 2023 • 10