chuyi777 commited on
Commit
775458a
1 Parent(s): fea0be1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -6,6 +6,8 @@ Datasets and Hyperparameters
6
  Reward Model:https://huggingface.co/OpenLLMAI/Llama-3-8b-rm-700k
7
  SFT Model: https://huggingface.co/OpenLLMAI/Llama-3-8b-sft-mixture
8
  Prompt Dataset: https://huggingface.co/datasets/OpenLLMAI/prompt-collection-v0.1
 
 
9
  best_of_n: 2 (2 samples for each prompt)
10
  Learning Rate: 5e-7
11
  Beta: 0.1
 
6
  Reward Model:https://huggingface.co/OpenLLMAI/Llama-3-8b-rm-700k
7
  SFT Model: https://huggingface.co/OpenLLMAI/Llama-3-8b-sft-mixture
8
  Prompt Dataset: https://huggingface.co/datasets/OpenLLMAI/prompt-collection-v0.1
9
+ Max Prompt Length: 2048
10
+ Max Response Length: 2048
11
  best_of_n: 2 (2 samples for each prompt)
12
  Learning Rate: 5e-7
13
  Beta: 0.1