mosaicml
/

mpt-30b-instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

kartikmosaicml commited on Jun 22, 2023

Commit

bad7973

·

1 Parent(s): 5dbf3b1

Adding data mix table to the readme

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -172,6 +172,22 @@ The model has been modified from a standard transformer in the following ways:
 | vocab size | 50432 |
 | sequence length | 8192 |
 ## PreTraining Data
 For more details on the pretraining process, see [MPT-30B](https://huggingface.co/mosaicml/mpt-30b).

 | vocab size | 50432 |
 | sequence length | 8192 |
+## Data Mix
+The model was trained on the following data mix:
+| Data Source | Number of Tokens in Source | Proportion |
+|-------------|----------------------------|------------|
+| competition_math | 1.6 M | 3.01% |
+| cot_gsm8k | 3.36 M | 6.32% |
+| dialogsum | 0.1 M | 0.19% |
+| dolly_hhrlhf | 5.89 M | 11.07% |
+| duorc | 8.2 M | 15.51% |
+| qasper | 10.97 M | 20.63% |
+| quality | 11.31 M | 21.28% |
+| scrolls/summ_screen_fd | 11.56 M | 21.82% |
+| spider | 0.089 M | 0.16% |
 ## PreTraining Data
 For more details on the pretraining process, see [MPT-30B](https://huggingface.co/mosaicml/mpt-30b).