File size: 1,673 Bytes
cab7bae
 
 
773d051
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
license: mit
---

# CAMeLBERT-UD-parser

## Model description
The **CAMeLBERT-UD-parser** is a neural dependency parsing model for Arabic text, specifically designed for the Universal Dependencies (UD) dependency formalism. 
It is based on the Biaffine Attention Dependency Parsing model introduced by [Dozat and Manning (2017)](https://arxiv.org/pdf/1611.01734.pdf) and implemented in 
[SuPar](https://github.com/yzhangcs/parser), which has been shown to be very effective for dependency parsing in many languages. 
The model is trained on the NUDAR (NYUAD Universal Dependency for Arabic) train set, which is a large Arabic corpus annotated with UD dependency labels. 
The model uses a CamelBERT-MSA word embedding layer, which is a pre-trained language model that has been trained on a massive dataset of Arabic text. 
The model was introduced in our paper "CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic". 
The paper describes the model in detail and evaluates its performance on various Arabic dependency parsing tasks.


## Intended uses
The CAMeLBERT-UD-parser is shipped with the [CAMeLParser](https://github.com/CAMeL-Lab/camel_parser) as one of the default parsing models, 
and can be selected when parsing texts using the UD formalism.

## Citation
```bibtex
@inproceedings{Elshabrawy:2023:camelparser,
    title = "{CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic}",
    author = {Ahmed Elshabrawy and
              Muhammed AbuOdeh and
              Go Inoue and
              Nizar Habash} ,
    booktitle = {Proceedings of The First Arabic Natural Language Processing Conference (ArabicNLP 2023)},
    year = "2023"
}
```