TRI-ML
/

DCLM-1B

achal-tri commited on Jul 22, 2024

Commit

6f45ba5

verified ·

1 Parent(s): 190d525

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ license: apache-2.0
 DCLM-1B is a 1.4 billion parameter language model trained on the DCLM-Baseline dataset, which was curated as part of the DataComp for Language Models (DCLM) benchmark. This model is designed to showcase the effectiveness of systematic data curation techniques for improving language model performance.
 ## Evaluation
 We evaluate DCLM-1B using the [llm-foundry](https://github.com/mosaicml/llm-foundry) eval suite, and compare to recently released small models on key benchmarks.

 DCLM-1B is a 1.4 billion parameter language model trained on the DCLM-Baseline dataset, which was curated as part of the DataComp for Language Models (DCLM) benchmark. This model is designed to showcase the effectiveness of systematic data curation techniques for improving language model performance.
+The instruction tuned version of this model is available here: https://huggingface.co/TRI-ML/DCLM-1B-IT
 ## Evaluation
 We evaluate DCLM-1B using the [llm-foundry](https://github.com/mosaicml/llm-foundry) eval suite, and compare to recently released small models on key benchmarks.