MikkelWK commited on
Commit
2204267
·
verified ·
1 Parent(s): 9899850

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: tokenizers
3
+ tags: [Danish, Morphological Tokenization, CerebrasGPT]
4
+ ---
5
+
6
+ ### DA-MORPH-CEREBRAS-TOKEN
7
+
8
+ This morphological tokenizer is designed for the CerebrasGPT architecture and focuses on segmenting Danish text based on linguistic principles, enabling more meaningful subword tokenization.