ipipan
/

nlpre_trankit_ud_xlm-roberta-large_nkjp-by-name

Model card Files Files and versions Community

martynawck commited on Oct 3, 2023

Commit

1285fcd

•

1 Parent(s): 7f9c385

Create README.md

Files changed (1) hide show

README.md +30 -0

README.md ADDED Viewed

	@@ -0,0 +1,30 @@

+# Model description
+- Morphosyntactic analyzer: Trankit
+- Tagset: UD
+- Embedding vectors: XLM-RoBERTa-Large
+- Dataset: NLPrePL-NKJP-fair-by-name (https://huggingface.co/datasets/ipipan/nlprepl)
+# How to use
+## Clone
+```
+git clone [email protected]:ipipan/nlpre_trankit_ud_xlm-roberta-large_nkjp-by-name
+```
+## Load model
+```
+import trankit
+model_path = './nlpre_trankit_ud_xlm-roberta-large_nkjp-by-name'
+trankit.verify_customized_pipeline(
+    category='customized-mwt', # pipeline category
+    save_dir=model_path, # directory used for saving models in previous steps
+    embedding_name='xlm-roberta-large' # embedding version that we use for training our customized pipeline, by default, it is `xlm-roberta-base`
+)
+model = trankit.Pipeline(lang='customized-mwt', cache_dir=model_path, embedding='xlm-roberta-large')
+```