Ericu950
/

Papy_2_Llama-3.1-8B-Instruct_text

Text Generation

Ancient Greek (to 1453)

textual criticism

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ericu950 commited on Oct 13

Commit

7542075

•

1 Parent(s): 8849b3f

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+base_model: []
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# PapyLlamaMerged
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct as a base.
+### Models Merged
+The following models were included in the merge:
+* /mimer/NOBACKUP/groups/naiss2024-22-201/PapInsc3/Papyllama2
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
+  - model: /mimer/NOBACKUP/groups/naiss2024-22-201/PapInsc3/Papyllama2
+    parameters:
+      density: 1.1  # Fixed density, slightly more sparse than the original
+      weight: 0.6  # Fixed weight to keep the fine-tuned model's influence high
+merge_method: ties
+base_model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
+parameters:
+  normalize: true
+dtype: bfloat16
+```