NousResearch
/

Yarn-Llama-2-13b-128k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bloc97 commited on Aug 31, 2023

Commit

e72b36c

·

1 Parent(s): 0ad472f

Update README.md

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -1,10 +1,20 @@
 # Model Card: Nous-Yarn-Llama-2-13b-128k
 ## Model Description
-Nous-Yarn-Llama-2-13b-128k is a state-of-the-art language model for long context, further pretrained on long context data for 600 steps.
 ## Model Training
@@ -20,7 +30,6 @@ Starting from the base Llama 2 models, this model was further pretrained on a su
 The authors would like to thank Stability AI, Carper AI, and Eleuther AI for their generous support of significant computing resources that enabled the training of these models and the completion of this research. We would also like to thank Jonathan Tow and Dakota Mahan directly for their help in advising on the use of the Stability AI compute cluster. Additionally, we would like to thank a16z, and PygmalionAI, for providing resources to run evaluations and experiments on the models.
 ## Usage and Prompt Format
-This model requires the [Flash Attention library](https://pypi.org/project/flash-attn/) in order to function correctly.
 Install FA2 and Rotary Extensions:
 ```

+---
+datasets:
+- pg19
+metrics:
+- perplexity
+library_name: transformers
+---
 # Model Card: Nous-Yarn-Llama-2-13b-128k
 ## Model Description
+Nous-Yarn-Llama-2-13b-128k is a state-of-the-art language model for long context, further pretrained on long context data for 600 steps.
+This model is the Flash Attention 2 patched version of the original model: https://huggingface.co/conceptofmind/Yarn-Llama-2-13b-128k
+Note that this model **requires** the [Flash Attention library](https://pypi.org/project/flash-attn/) in order to function correctly, see the Model Usage section for installation instructions.
 ## Model Training
 The authors would like to thank Stability AI, Carper AI, and Eleuther AI for their generous support of significant computing resources that enabled the training of these models and the completion of this research. We would also like to thank Jonathan Tow and Dakota Mahan directly for their help in advising on the use of the Stability AI compute cluster. Additionally, we would like to thank a16z, and PygmalionAI, for providing resources to run evaluations and experiments on the models.
 ## Usage and Prompt Format
 Install FA2 and Rotary Extensions:
 ```