pierreguillou
commited on
Commit
•
fdad9ab
1
Parent(s):
58a4aba
Update README.md
Browse files
README.md
CHANGED
@@ -43,8 +43,6 @@ widget:
|
|
43 |
|
44 |
**ner-bert-base-portuguese-cased-lenerbr** is a NER model (token classification) in the legal domain in Portuguese that was finetuned on 20/12/2021 in Google Colab from the model [pierreguillou/bert-base-cased-pt-lenerbr](https://huggingface.co/pierreguillou/bert-base-cased-pt-lenerbr) on the dataset [LeNER_br](https://huggingface.co/datasets/lener_br) by using a NER objective.
|
45 |
|
46 |
-
The notebook of finetuning ([HuggingFace_Notebook_token_classification_NER_LeNER_Br.ipynb](https://github.com/piegu/language-models/blob/master/HuggingFace_Notebook_token_classification_NER_LeNER_Br.ipynb)) is in github.
|
47 |
-
|
48 |
Due to the small size of BERTimbau base and finetuning dataset, the model overfitted before to reach the end of training. Here are the overall final metrics on the validation dataset (*note: see the paragraph "Validation metrics by Named Entity" to get detailed metrics*):
|
49 |
- **f1**: 0.8733423827921062
|
50 |
- **precision**: 0.8487923685812868
|
@@ -106,6 +104,10 @@ ner(input_text)
|
|
106 |
````
|
107 |
## Training procedure
|
108 |
|
|
|
|
|
|
|
|
|
109 |
### Training results
|
110 |
|
111 |
````
|
|
|
43 |
|
44 |
**ner-bert-base-portuguese-cased-lenerbr** is a NER model (token classification) in the legal domain in Portuguese that was finetuned on 20/12/2021 in Google Colab from the model [pierreguillou/bert-base-cased-pt-lenerbr](https://huggingface.co/pierreguillou/bert-base-cased-pt-lenerbr) on the dataset [LeNER_br](https://huggingface.co/datasets/lener_br) by using a NER objective.
|
45 |
|
|
|
|
|
46 |
Due to the small size of BERTimbau base and finetuning dataset, the model overfitted before to reach the end of training. Here are the overall final metrics on the validation dataset (*note: see the paragraph "Validation metrics by Named Entity" to get detailed metrics*):
|
47 |
- **f1**: 0.8733423827921062
|
48 |
- **precision**: 0.8487923685812868
|
|
|
104 |
````
|
105 |
## Training procedure
|
106 |
|
107 |
+
### Notebook
|
108 |
+
|
109 |
+
The notebook of finetuning ([HuggingFace_Notebook_token_classification_NER_LeNER_Br.ipynb](https://github.com/piegu/language-models/blob/master/HuggingFace_Notebook_token_classification_NER_LeNER_Br.ipynb)) is in github.
|
110 |
+
|
111 |
### Training results
|
112 |
|
113 |
````
|