quanghuy123 commited on
Commit
ed69fd1
·
verified ·
1 Parent(s): 11790ed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -12,4 +12,30 @@ library_name: transformers
12
  new_version: google-bert/bert-base-multilingual-cased
13
  tags:
14
  - legal
15
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  new_version: google-bert/bert-base-multilingual-cased
13
  tags:
14
  - legal
15
+ ---
16
+
17
+ # BERT-Law: Information Extraction Model for Legal Texts
18
+
19
+ ## Model Description
20
+
21
+ **BERT-Law** is a fine-tuned version of **BERT (Bidirectional Encoder Representations from Transformers)**, focusing on information extraction from legal documents. The model is specifically trained on a custom dataset called **UTE_LAW**, which consists of approximately 30,000 pairs of legal questions and related documents. The main goal of this model is to extract relevant information from legal text while reducing the costs associated with using third-party APIs.
22
+
23
+ ### Key Features
24
+ - **Base Model**: The model is built on top of `google-bert/bert-base-multilingual-cased`, which is a pre-trained multilingual BERT model.
25
+ - **Fine-tuning**: It has been fine-tuned with the **UTE_LAW** dataset, focusing on extracting relevant information from legal texts.
26
+ - **Model Type**: BERT-based model for **question-answering** tasks.
27
+ - **Task**: The model is optimized for information extraction tasks, specifically designed to handle legal documents.
28
+
29
+ ### Model Specifications
30
+ - **Maximum Sequence Length**: 512 tokens
31
+ - **Output Dimensionality**: 512 tokens
32
+ - **Language**: Primarily focused on **Vietnamese** legal texts.
33
+ - **License**: Apache-2.0 License
34
+
35
+ ## Usage
36
+
37
+ This model is suitable for applications in legal domains, such as:
38
+ - **Legal document analysis**: Extracting relevant information from legal texts.
39
+ - **Question answering**: Providing answers to legal questions based on the content of legal documents.
40
+
41
+ The model aims to reduce reliance on third-party APIs, which can incur higher costs, by providing a locally deployable solution for legal document processing.