MarcGrumpyOlejak commited on
Commit
5061a2c
1 Parent(s): bc10c0d

Update README.md

Browse files

Added explanations of the merged models.

Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -16,7 +16,7 @@ tags:
16
  This model is used as an intermediate model for future merges.
17
  It is a merge of 4 pre-trained language models based upon Mistral-7B-v0.1 created using [mergekit](https://github.com/cg123/mergekit).
18
 
19
- I used it in a second step in combination with [DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1) for the successor of 'talky_7B': [VerwaltungsAnthologie_Disco_7B](https://huggingface.co/MarcGrumpyOlejak/VerwaltungsAnthologie_Disco_7B)
20
 
21
  ## Merge Details
22
  ### Merge Method
@@ -30,7 +30,16 @@ The following models were included in the merge:
30
  * [DRXD1000/Phoenix](https://huggingface.co/DRXD1000/Phoenix)
31
  * [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227)
32
  * [VAGOsolutions/SauerkrautLM-7b-LaserChat](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-LaserChat)
 
33
 
 
 
 
 
 
 
 
 
34
  ### Configuration
35
 
36
  The following YAML configuration was used to produce this model:
 
16
  This model is used as an intermediate model for future merges.
17
  It is a merge of 4 pre-trained language models based upon Mistral-7B-v0.1 created using [mergekit](https://github.com/cg123/mergekit).
18
 
19
+ In combination with [DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1) this model 'clear'-model is the base model to build the successor of my first 'VA_talky_7B', 'VA_Disco_7B': [VerwaltungsAnthologie_Disco_7B](https://huggingface.co/MarcGrumpyOlejak/VerwaltungsAnthologie_Disco_7B)
20
 
21
  ## Merge Details
22
  ### Merge Method
 
30
  * [DRXD1000/Phoenix](https://huggingface.co/DRXD1000/Phoenix)
31
  * [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227)
32
  * [VAGOsolutions/SauerkrautLM-7b-LaserChat](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-LaserChat)
33
+ * [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)
34
 
35
+ ### Explanations of used models
36
+
37
+ * Main focus of this "clear" model ist the [hiig-piai/simba-v01c](https://huggingface.co/hiig-piai/simba-v01c) by the Humboldt Institute for Internet and Society has built this model for "simplified language" (Leichte Sprache in german).
38
+ * The [DRXD1000/Phoenix](https://huggingface.co/DRXD1000/Phoenix) got finetuned with many german texts of law – it can even "hallucinate" almost perfect URL of the official archive of german laws: [Gesetze im Internet]](https://www.gesetze-im-internet.de/)
39
+ * [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227) performed best using mixed languages in combination with [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) as base model.
40
+ * [VAGOsolutions/SauerkrautLM-7b-LaserChat](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-LaserChat) has a wider range of colloquial german language.
41
+ * [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) is the base model – funny but true – only using [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227) as base model is not as good as combining both.
42
+
43
  ### Configuration
44
 
45
  The following YAML configuration was used to produce this model: