trl-lib
/

llama-7b-se-rl-peft

Model card Files Files and versions Community

Update README.md

#1

by ybelkada - opened Apr 5, 2023

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -8,6 +8,8 @@ tags:
 - reinforcement-learning
 ---
 # Llama-se-rl-peft
 Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
 For more info check out the [blog post]() and [github example]().

 - reinforcement-learning
 ---
+![pull_figure](https://huggingface.co/datasets/trl-internal-testing/example-images/resolve/main/images/stack-llama.png)
 # Llama-se-rl-peft
 Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
 For more info check out the [blog post]() and [github example]().