Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,58 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
language:
|
4 |
+
- de
|
5 |
+
tags:
|
6 |
+
- gptj
|
7 |
+
- title generation
|
8 |
+
- headline-generation
|
9 |
+
- teaser generation
|
10 |
+
- keyword generation
|
11 |
+
- tweet generation
|
12 |
+
- news
|
13 |
+
inference: false
|
14 |
---
|
15 |
+
|
16 |
+
# snip-igel-500
|
17 |
+
|
18 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
19 |
+
|
20 |
+
snip-igel-500
|
21 |
+
Version 1.0 / 13 April 2023
|
22 |
+
|
23 |
+
An adapter for [IGEL](https://huggingface.co/philschmid/instruct-igel-001) to generate german news snippets with human written instructions.
|
24 |
+
|
25 |
+
# Model Details
|
26 |
+
|
27 |
+
## Model Description
|
28 |
+
|
29 |
+
<!-- Provide a longer summary of what this model is. -->
|
30 |
+
|
31 |
+
Test generation capabilities here: https://snipaid.tech
|
32 |
+
|
33 |
+
SNIP-IGEL is a continued instruction-tuned LoRa-Adapter to generate titles, teasers, summaries, tweets and keywords from the text of a news article in german language. [IGEL](https://huggingface.co/philschmid/instruct-igel-001) is an instruction-tuned model on top of the pre-trained german version of BLOOM ([bloom-6b4-clp-german](https://huggingface.co/malteos/bloom-6b4-clp-german)). It was developed by fine-tuning with a machine translated instruction-dataset, aimed to explore the potential of the BLOOM architecture for language modeling tasks requiring instruction-based responses.
|
34 |
+
|
35 |
+
- **Developed by:** snipaid
|
36 |
+
- **Model type:** bloom
|
37 |
+
- **Language(s) (NLP):** de
|
38 |
+
- **License:** MIT
|
39 |
+
- **Finetuned from model:** [IGEL](https://huggingface.co/philschmid/instruct-igel-001)
|
40 |
+
|
41 |
+
# Uses
|
42 |
+
|
43 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
44 |
+
|
45 |
+
SNIP-IGEL is intended to be used for generating snippets for german news articles. It can be used by researchers, journalists, content creators and news agencies to automatically generate snippets for their articles in german language.
|
46 |
+
|
47 |
+
# Bias, Risks, and Limitations
|
48 |
+
|
49 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
50 |
+
Several common deficiencies can be observed, including hallucination, toxicity and stereotypes.
|
51 |
+
|
52 |
+
# Training Details
|
53 |
+
|
54 |
+
## Training Data
|
55 |
+
|
56 |
+
<!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
57 |
+
|
58 |
+
SNIP-IGEL has been fine-tuned on [instruct-snippet-mlsum](https://huggingface.co/datasets/snipaid/instruct-snippet-mlsum). MLSUM is a dataset containing a german subset with text, title and teaser for news articles from the newspaper "Süddeutsche Zeitung". The dataset has been augmented with snippet data generated using a composite prompt which involves generating a SERP, keywords and a tweet for the news articles using a student-teacher-approach. Also see [snippet-mlsum-500](https://huggingface.co/datasets/snipaid/snippet-mlsum-500) for the dataset without instructions and our [blogpost](https://snipaid-nlg.github.io/2023/04/13/SNIP-IGEL.html) for more information about the construction of the dataset.
|