LKriesch commited on
Commit
7c24a3d
·
verified ·
1 Parent(s): 13691ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -4
README.md CHANGED
@@ -1,13 +1,47 @@
1
  ---
 
2
  license: llama3.1
3
  language:
4
  - de
5
  base_model:
6
- - meta-llama/Llama-3.1-8B-Instruct
7
  - spacy/de_core_news_lg
8
- library_name: spacy
9
  ---
10
 
11
- ## Model Details
12
- THIS IS STILL WORK IN PROGRESS! This is a finetuned spacy/de_core_news_lg model. It was finetuned on 50,000 LLM-annotated German news articles. The model is trained to extract names of cities and municipalities from German news articles.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
 
 
 
 
 
 
1
  ---
2
+ '[object Object]': null
3
  license: llama3.1
4
  language:
5
  - de
6
  base_model:
 
7
  - spacy/de_core_news_lg
8
+ - meta-llama/Llama-3.1-8B-Instruct
9
  ---
10
 
11
+ ### Model Description
12
+
13
+ This is a spaCy model fine-tuned to extract names of cities and municipalities from German news articles. It was trained on 50,000 LLM-annotated (LLAMA 3.1 8B-Instruct) German news articles from the CommonCrawl news dataset.
14
+
15
+ - **Developed by:** Lukas Kriesch
16
+ - **Model type:** Named Entity Recognition (NER)
17
+ - **Language(s) (NLP):** German
18
+ - **License:** Llama 3.1 Community License Agreement
19
+ - **Finetuned from model [optional]:** spacy/de_core_news_lg
20
+
21
+ ## Uses
22
+
23
+ ### Direct Use
24
+ This model can be directly used to extract city and municipality names from any German-language text source, particularly news articles. Researchers and developers working in geospatial analysis or regional studies may find this useful for location-based analyses.
25
+
26
+ ### Downstream Use
27
+
28
+ Fine-tuned applications might involve integration into larger workflows for geospatial data processing, population studies, regional analysis, or sentiment analysis in the context of location data.
29
+
30
+ ### Out-of-Scope Use
31
+ The model should not be used to infer broader geographical trends or to analyze texts unrelated to city or municipal locations in Germany. Additionally, it may not perform well on non-news domains or texts that lack clear references to cities or municipalities.
32
+
33
+ ## Bias, Risks, and Limitations
34
+
35
+ The model is trained specifically on German news articles and may not generalize well to other domains such as social media, legal texts, or scientific literature. Additionally, there may be biases in the training data, particularly if certain regions are underrepresented in the dataset. Users should be cautious about the model's performance across different subpopulations of city or location mentions (e.g., historical names or rare municipalities).
36
+
37
+ ## How to Get Started with the Model
38
+
39
+ Use the code below to get started with the model.
40
+
41
+ {{ get_started_code | default("[More Information Needed]", true)}}
42
 
43
+ #### Training Hyperparameters
44
+ Batch size: 64
45
+ Epochs: Up to 100, with early stopping (patience of 3 epochs)
46
+ Optimizer: Default spaCy optimizer for NER fine-tuning
47
+ Training regime: Mixed precision (fp16)