MarsupialAI
/

KitchenSink_103b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MarsupialAI commited on Jan 15

Commit

cafad1a

•

1 Parent(s): 77234f9

Update README.md

Files changed (1) hide show

README.md +9 -4

README.md CHANGED Viewed

@@ -3,8 +3,8 @@ license: llama2
 language:
 - en
 tags:
-- RP
-- ERP
 - chat
 - storywriting
 ---
@@ -14,7 +14,7 @@ tags:
 This model is a rotating-stack merge of three 70b models in a 103b (120 layer) configuration inspired by Venus 103b.  The result of
 this "frankenmerge" is a large model that contains a little bit of everything - including the kitchen sink.  RP, chat, storywriting,
 and instruct are all well supported.  It may or may not code well - I lack the expertise to test it in that capacity, but considering
-the source models, it is unlikely.
 Component models for the rotating stack are
 - royallab/Aetheria-L2-70B
@@ -24,8 +24,13 @@ Component models for the rotating stack are
 Components of those models are purported to include: Nous-Hermes-Llama2-70b, Xwin-LM-7B-V0.1, Mythospice-70b, Euryale-1.3-L2-70B,
 tulu-2-dpo-70b, GOAT-70B-Storytelling, Platypus2-70B-instruct, Lila-70B, SunsetBoulevard, and some private LoRAs.
 This model is uncensored and perfectly capable of generating objectionable material.  However, it is not an explicitely-NSFW model,
-and it has never "gone rogue" and tried to insert NSFW content into SFW prompts in my experience.

 language:
 - en
 tags:
+- rp
+- erp
 - chat
 - storywriting
 ---
 This model is a rotating-stack merge of three 70b models in a 103b (120 layer) configuration inspired by Venus 103b.  The result of
 this "frankenmerge" is a large model that contains a little bit of everything - including the kitchen sink.  RP, chat, storywriting,
 and instruct are all well supported.  It may or may not code well - I lack the expertise to test it in that capacity, but considering
+the source models, I suspect it is unlikely.
 Component models for the rotating stack are
 - royallab/Aetheria-L2-70B
 Components of those models are purported to include: Nous-Hermes-Llama2-70b, Xwin-LM-7B-V0.1, Mythospice-70b, Euryale-1.3-L2-70B,
 tulu-2-dpo-70b, GOAT-70B-Storytelling, Platypus2-70B-instruct, Lila-70B, SunsetBoulevard, and some private LoRAs.
+As all components are based on Llama2 70b, native context length is 4k tokens.  Coherency out to 8k is extremely good with rope scaling,
+but starts to decline beyond that.
 This model is uncensored and perfectly capable of generating objectionable material.  However, it is not an explicitely-NSFW model,
+and it has never "gone rogue" and tried to insert NSFW content into SFW prompts in my experience.  As with any LLM, no factual claims
+made by the model should be taken at face value.  You know that boilerplate safety disclaimer that most professional models have?
+Assume this has it too.  This model is for entertainment purposes only.