nithiyn commited on
Commit
a5c7cb5
·
verified ·
1 Parent(s): 7e7f8ed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -1,3 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: other
3
  license_name: mrl
 
1
+ This repository contains AWS Inferentia2 and neuronx compatible checkpoints for [Mistral-Large-Instruct](https://huggingface.co/mistralai/Mistral-Large-Instruct-2407). You can find detailed information about the base model on its [Model Card](https://huggingface.co/mistralai/Mistral-Large-Instruct-2407).
2
+
3
+ This model has been exported to the neuron format using specific input_shapes and compiler parameters detailed in the paragraphs below.
4
+
5
+ It has been compiled to run on an inf2.48xlarge instance on AWS. Note that while the inf2.48xlarge has 24 cores, this compilation uses 24.
6
+ ---
7
+
8
+ SEQUENCE_LENGTH = 4096
9
+ BATCH_SIZE = 4
10
+ NUM_CORES = 24
11
+ PRECISION = "bf16"
12
  ---
13
  license: other
14
  license_name: mrl