jondurbin commited on
Commit
d525877
1 Parent(s): 719c144

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -6,6 +6,8 @@ datasets:
6
 
7
  ### Overview
8
 
 
 
9
  This is an instruction fine-tuned llama-2 model, using synthetic data generated by [airoboros](https://github.com/jondurbin/airoboros)
10
 
11
  - Experimental RP style instruction set, with two categories: rp and gtkm
@@ -46,6 +48,8 @@ ASSISTANT:
46
 
47
  So in other words, it's the preamble/system prompt, followed by a single space or newline, then "USER: " (single space after colon) then the prompt (which can have multiple lines, spaces, whatever), then a single space or newline, followed by "ASSISTANT: " (with a single space after the colon).
48
 
 
 
49
  ### Helpful usage tips
50
 
51
  *The prompts shown here are are just the text that would be included after USER: and before ASSISTANT: in the full prompt format above, the system prompt and USER:/ASSISTANT: have been omited for readability.*
 
6
 
7
  ### Overview
8
 
9
+ __*I haven't tested this at all yet, quality could be great or absolute trash, I really don't know, but feel free to try.*__
10
+
11
  This is an instruction fine-tuned llama-2 model, using synthetic data generated by [airoboros](https://github.com/jondurbin/airoboros)
12
 
13
  - Experimental RP style instruction set, with two categories: rp and gtkm
 
48
 
49
  So in other words, it's the preamble/system prompt, followed by a single space or newline, then "USER: " (single space after colon) then the prompt (which can have multiple lines, spaces, whatever), then a single space or newline, followed by "ASSISTANT: " (with a single space after the colon).
50
 
51
+ __*I strongly suggest adding stopping criteria/early inference stopping on "USER:", because the training data includes many multi-round chats and could otherwise start simulating a conversation!*__
52
+
53
  ### Helpful usage tips
54
 
55
  *The prompts shown here are are just the text that would be included after USER: and before ASSISTANT: in the full prompt format above, the system prompt and USER:/ASSISTANT: have been omited for readability.*