README.md CHANGED
@@ -1,29 +1,22 @@
1
  ---
 
 
2
  license: apache-2.0
3
- base_model: Rijgersberg/GEITje-7B
4
  tags:
5
  - generated_from_trainer
6
  - GEITje
7
- - conversational
8
- model-index:
9
- - name: GEITje-7B-chat-v2
10
- results: []
11
  datasets:
12
  - Rijgersberg/no_robots_nl
13
  - Rijgersberg/ultrachat_10k_nl
14
  - BramVanroy/dutch_chat_datasets
15
- language:
16
- - nl
17
- pipeline_tag: text-generation
 
 
18
  ---
19
  # GEITje-7B-chat-v2
20
 
21
- > [!CAUTION]
22
- > **⚠️ At the pressing request of Stichting BREIN, GEITje is no longer available, starting immediately. ⚠️**
23
- >
24
- > All model files (the _weights_) and checkpoints have been deleted from this repo.
25
- > See my blog post ([Dutch](https://goingdutch.ai/nl/posts/geitje-takedown/), [English](https://goingdutch.ai/en/posts/geitje-takedown/)) for further clarification.
26
-
27
  **🤖️ Try the chat model in [🤗 Hugging Face Spaces](https://huggingface.co/spaces/Rijgersberg/GEITje-7B-chat)!**
28
 
29
  # GEITje-7B
@@ -54,11 +47,10 @@ Like Mistral, GEITje has a _context length_ of 8,192 tokens.
54
  As a demonstration of GEITje's capabilities for chat applications, two initial chat variants of GEITje have also been finetuned: GEITje-chat and GEITje-chat-v2.
55
  They can follow instructions, answer questions, and hold dialogues on a variety of topics.
56
 
 
57
  ## More info
58
  Read more about GEITje-chat in the [📄 README](https://github.com/Rijgersberg/GEITje/blob/main/README-en.md) on GitHub.
59
 
60
- ## Checkpoints
61
- An intermediate checkpoint is available in the `checkpoints` branch.
62
 
63
  ## Training procedure
64
 
@@ -107,4 +99,17 @@ The following hyperparameters were used during training:
107
  - Transformers 4.36.0.dev0
108
  - Pytorch 2.1.1+cu121
109
  - Datasets 2.15.0
110
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - nl
4
  license: apache-2.0
 
5
  tags:
6
  - generated_from_trainer
7
  - GEITje
 
 
 
 
8
  datasets:
9
  - Rijgersberg/no_robots_nl
10
  - Rijgersberg/ultrachat_10k_nl
11
  - BramVanroy/dutch_chat_datasets
12
+ base_model: Rijgersberg/GEITje-7B
13
+ pipeline_tag: conversational
14
+ model-index:
15
+ - name: GEITje-7B-chat-v2
16
+ results: []
17
  ---
18
  # GEITje-7B-chat-v2
19
 
 
 
 
 
 
 
20
  **🤖️ Try the chat model in [🤗 Hugging Face Spaces](https://huggingface.co/spaces/Rijgersberg/GEITje-7B-chat)!**
21
 
22
  # GEITje-7B
 
47
  As a demonstration of GEITje's capabilities for chat applications, two initial chat variants of GEITje have also been finetuned: GEITje-chat and GEITje-chat-v2.
48
  They can follow instructions, answer questions, and hold dialogues on a variety of topics.
49
 
50
+
51
  ## More info
52
  Read more about GEITje-chat in the [📄 README](https://github.com/Rijgersberg/GEITje/blob/main/README-en.md) on GitHub.
53
 
 
 
54
 
55
  ## Training procedure
56
 
 
99
  - Transformers 4.36.0.dev0
100
  - Pytorch 2.1.1+cu121
101
  - Datasets 2.15.0
102
+ - Tokenizers 0.15.0
103
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
104
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Rijgersberg__GEITje-7B-chat-v2)
105
+
106
+ | Metric |Value|
107
+ |---------------------------------|----:|
108
+ |Avg. |50.79|
109
+ |AI2 Reasoning Challenge (25-Shot)|50.34|
110
+ |HellaSwag (10-Shot) |74.13|
111
+ |MMLU (5-Shot) |49.00|
112
+ |TruthfulQA (0-shot) |43.55|
113
+ |Winogrande (5-shot) |71.51|
114
+ |GSM8k (5-shot) |16.22|
115
+
checkpoint-12185/model-00001-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62fe5bb153098b41ef7fc06b72b6b1bd8e887e61ad33c9e2b033357655057713
3
+ size 4943162336
checkpoint-12185/model-00002-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dca10b56bba30f9bb4fc5c3fcd02896368d22da08a52a39538f4dbf3c2ef71df
3
+ size 4999819336
checkpoint-12185/model-00003-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c693a1e370b10f5c6393e43e9318ea0cd06dc2ee4aa9bafb9dd5d297d80716e
3
+ size 4540516344
checkpoint-12185/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfa624e9702a749bbc16c4ab780b43698584b956a5ae5f31afe76efc7ad1c7da
3
+ size 14512103560
model-00001-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62fe5bb153098b41ef7fc06b72b6b1bd8e887e61ad33c9e2b033357655057713
3
+ size 4943162336
model-00002-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dca10b56bba30f9bb4fc5c3fcd02896368d22da08a52a39538f4dbf3c2ef71df
3
+ size 4999819336
model-00003-of-00003.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c693a1e370b10f5c6393e43e9318ea0cd06dc2ee4aa9bafb9dd5d297d80716e
3
+ size 4540516344