Rijgersberg
/

Mistral-7B-v0.1-chat-nl

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Rijgersberg commited on Dec 15, 2023

Commit

dfbd8e1

·

1 Parent(s): e09938e

Add info

Files changed (1) hide show

README.md +14 -10

README.md CHANGED Viewed

@@ -3,9 +3,16 @@ license: apache-2.0
 base_model: mistralai/Mistral-7B-v0.1
 tags:
 - generated_from_trainer
 model-index:
 - name: Mistral-7B-v0.1-chat-nl
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -13,21 +20,18 @@ should probably proofread and complete it, then remove this comment. -->
 # Mistral-7B-v0.1-chat-nl
-This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.0263
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -71,4 +75,4 @@ The following hyperparameters were used during training:
 - Transformers 4.36.0.dev0
 - Pytorch 2.1.1+cu121
 - Datasets 2.15.0
-- Tokenizers 0.15.0

 base_model: mistralai/Mistral-7B-v0.1
 tags:
 - generated_from_trainer
+- GEITje
 model-index:
 - name: Mistral-7B-v0.1-chat-nl
   results: []
+datasets:
+- Rijgersberg/no_robots_nl
+- Rijgersberg/ultrachat_10k_nl
+language:
+- nl
+pipeline_tag: conversational
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Mistral-7B-v0.1-chat-nl
+This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the Rijgersberg/no_robots_nl and Rijgersberg/ultrachat_10k_nl datasets.
 It achieves the following results on the evaluation set:
 - Loss: 1.0263
 ## Model description
+In order to investigate the effect of pretraining [Rijgersberg/GEITje-7B](https://huggingface.co/Rijgersberg/GEITje-7B-chat) on the finetuning of [Rijgersberg/GEITje-7B-chat](https://huggingface.co/Rijgersberg/GEITje-7B-chat),
+I also subjected the base model Mistral 7B v0.1 to the exact same training.
+This model is called Mistral-7B-v0.1-chat-nl.
+## More info
+Read more about GEITje and GEITje-chat in the [📄 README](https://github.com/Rijgersberg/GEITje/blob/main/README-en.md) on GitHub.
 ## Training procedure
 - Transformers 4.36.0.dev0
 - Pytorch 2.1.1+cu121
 - Datasets 2.15.0
+- Tokenizers 0.15.0