File size: 2,223 Bytes
e265f10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
license: cc-by-nc-nd-4.0
pipeline_tag: text-classification
tags:
- deep learning
- law article retrieval
- natural language processing
- BERT
- information retrieval
- legal ai
- legal bert
- gdpr
- general data protection regulation
language:
- en
library_name: transformers
---

### Abstract
The General Data Protection Regulation (GDPR) is an European regulation on data protection and privacy for all individuals within the European Union (EU) and the European Economic Area (EEA), and for all foreign subjects dealing with European citizens data. Therefore, the GDPR has important legislation implications that hold beyond EU member states. In this paper, we address the problem of GDPR article retrieval through the use of pre-trained language models (PLMs). Our approach features several key aspects, which include both domain-general and domain-specific pre-trained BERT models, further powered by self-supervised task-adaptive pre-training stages, with or without data enrichment based on recitals. Our study endeavors to demonstrate the potential of PLMs in addressing the challenges posed by the GDPR’s intricate legal framework, thus ultimately facilitating efficient access to GDPR provisions for government agencies, law firms, legal professionals, and citizens alike.
### GDPR Article Retrieval based on Domain-adaptive and Task-adaptive Legal Pre-trained Language Models.

![image/webp](https://cdn-uploads.huggingface.co/production/uploads/62867cb4504d3770030ae173/Bn1qvPxZVLmM7tdyCYWzi.webp)

### BibTeX Entry and Citation Info
```
@article{Lamberta,
  author    = {Andrea Tagarelli and Andrea Simeri},
  title     = {{Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code}},
  journal   = {Artif. Intell. Law},
  volume    = {30(3)}, 
  pages     = {417--473. Published: 15 September 2021},
  year      = {2022}, 
  doi ={10.1007/s10506-021-09301-8}
}

```

### References
- Tagarelli, A., Simeri, A. Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code. Artif Intell Law 30, 417–473 (2022). https://doi.org/10.1007/s10506-021-09301-8

---