Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ tags:
|
|
9 |
---
|
10 |
|
11 |
# Llama-se-rl-peft
|
12 |
-
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada,
|
13 |
For more info check out the [blog post]() and [github example]().
|
14 |
|
15 |
|
@@ -33,7 +33,7 @@ The **Llama-se-rl** model inherits limitations and biases from the Llama model a
|
|
33 |
```bibtex
|
34 |
@misc{beeching2023llama,
|
35 |
title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
|
36 |
-
author={Beeching, Edward and Belkada, Younes and Rasul,
|
37 |
year={2023}
|
38 |
}
|
39 |
```
|
|
|
9 |
---
|
10 |
|
11 |
# Llama-se-rl-peft
|
12 |
+
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
|
13 |
For more info check out the [blog post]() and [github example]().
|
14 |
|
15 |
|
|
|
33 |
```bibtex
|
34 |
@misc{beeching2023llama,
|
35 |
title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
|
36 |
+
author={Beeching, Edward and Belkada, Younes and Rasul, Kashif and Tunstall, Lewis and von Werra, Leandro},
|
37 |
year={2023}
|
38 |
}
|
39 |
```
|