akifumiwachi
commited on
Commit
•
2901c06
1
Parent(s):
2bbc875
Update README.md
Browse files
README.md
CHANGED
@@ -32,6 +32,7 @@ tags:
|
|
32 |
- **Fine-tuned from model:** [Alpaca (reprod.)](https://huggingface.co/PKU-Alignment/alpaca-7b-reproduced) (reproduced version of [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca))
|
33 |
- **Dataset:** [PKU-SafeRLHF-30K](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-30K)
|
34 |
- **SACPO Paper:** <https://arxiv.org/abs/2404.11049>
|
|
|
35 |
- **Model Alias:** P-SACPO 0.75
|
36 |
|
37 |
## Usage: How to Talk with the Model
|
|
|
32 |
- **Fine-tuned from model:** [Alpaca (reprod.)](https://huggingface.co/PKU-Alignment/alpaca-7b-reproduced) (reproduced version of [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca))
|
33 |
- **Dataset:** [PKU-SafeRLHF-30K](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-30K)
|
34 |
- **SACPO Paper:** <https://arxiv.org/abs/2404.11049>
|
35 |
+
- **GitHub:** <https://github.com/line/sacpo>
|
36 |
- **Model Alias:** P-SACPO 0.75
|
37 |
|
38 |
## Usage: How to Talk with the Model
|