GPT4All-Community
/

DeepSeek-R1-Distill-Llama-8B-GGUF

Inference Endpoints

Model card Files Files and versions Community

3Simplex commited on 10 days ago

Commit

98a4543

·

verified ·

1 Parent(s): 426f1b5

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -52,6 +52,7 @@ We slightly change their configs and tokenizers. Please use our setting to run t
 2. **Avoid adding a system prompt; all instructions should be contained within the user prompt.**
 3. For mathematical problems, it is advisable to include a directive in your prompt such as: "Please reason step by step, and put your final answer within \boxed{}."
 4. When evaluating model performance, it is recommended to conduct multiple tests and average the results.
 ## License
 This code repository and the model weights are licensed under the [MIT License](https://github.com/deepseek-ai/DeepSeek-R1/blob/main/LICENSE).

 2. **Avoid adding a system prompt; all instructions should be contained within the user prompt.**
 3. For mathematical problems, it is advisable to include a directive in your prompt such as: "Please reason step by step, and put your final answer within \boxed{}."
 4. When evaluating model performance, it is recommended to conduct multiple tests and average the results.
+5. The context limit of the llama 3.1 8b variant is 128k (131072)
 ## License
 This code repository and the model weights are licensed under the [MIT License](https://github.com/deepseek-ai/DeepSeek-R1/blob/main/LICENSE).