ryzen88
/

Llama-3-70b-Arimas-story-RP-V2.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ryzen88 commited on Jun 30, 2024

Commit

07377b6

·

verified ·

1 Parent(s): a25e394

Update README.md

Files changed (1) hide show

README.md +19 -1

README.md CHANGED Viewed

@@ -7,6 +7,9 @@ tags:
 ---
 # Merge_XL_model_Stock
 This model switches to the Smaug instruct 32K for the base bodel.
 Expanded with Giraffe and Gradient to keep a robuust long context window.
 Higgs and cat for most of the story and RP aspects.
@@ -43,4 +46,19 @@ models:
 merge_method: model_stock
 base_model: \Smaug-Llama-3-70B-Instruct-32K
 dtype: bfloat16
-```

 ---
 # Merge_XL_model_Stock
+Ofcourse the model is still fully focussed on long context Roleplay and Story.
+By far the best itteration.
 This model switches to the Smaug instruct 32K for the base bodel.
 Expanded with Giraffe and Gradient to keep a robuust long context window.
 Higgs and cat for most of the story and RP aspects.
 merge_method: model_stock
 base_model: \Smaug-Llama-3-70B-Instruct-32K
 dtype: bfloat16
+```
+Any suggestions are very welcome
+My personal sampling settings are:
+    "temp": 1,
+    "temperature_last": true,
+    "top_p": 1,
+    "top_k": 0,
+    "top_a": 0,
+    "tfs": 1,
+    "typical_p": 1,
+    "min_p": 0.05,
+    "rep_pen": 1.05,
+    "rep_pen_range": 4096,
+    "rep_pen_decay": 0,
+    "rep_pen_slope": 1,