Aleksandar commited on
Commit
12be85e
1 Parent(s): 28e0629
Files changed (5) hide show
  1. README.md +27 -37
  2. config.json +2 -8
  3. pytorch_model.bin +2 -2
  4. tokenizer.json +0 -0
  5. training_args.bin +1 -1
README.md CHANGED
@@ -2,24 +2,26 @@
2
  tags:
3
  - generated_from_trainer
4
  datasets:
5
- - null
6
  metrics:
7
  - precision
8
  - recall
9
  - f1
10
  - accuracy
11
- language:
12
- - sr
13
  model_index:
14
  - name: bert-srb-ner
15
  results:
16
  - task:
17
  name: Token Classification
18
  type: token-classification
 
 
 
 
19
  metric:
20
  name: Accuracy
21
  type: accuracy
22
- value: 0.9641060273510046
23
  ---
24
 
25
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -27,33 +29,21 @@ should probably proofread and complete it, then remove this comment. -->
27
 
28
  # bert-srb-ner
29
 
30
- This model was finetuned from Aleksandar/bert-srb-cased-oscar on the setimes.SR dataset.
31
  It achieves the following results on the evaluation set:
32
- - Loss: 0.1647
33
- - Precision: 0.8247
34
- - Recall: 0.8454
35
- - F1: 0.8349
36
- - Accuracy: 0.9641
37
 
38
  ## Model description
39
 
40
- Default settings for BERT model, finetuned with batch size of 16.
41
 
42
  ## Intended uses & limitations
43
- | Tag (IOB) | Numerical representation | Meaning (Beginning = B., Inside = I.) |
44
- |-------------|--------------------------|------------------------------------------|
45
- | O | 0 | Other |
46
- | B-per | 1 | B.Person |
47
- | I-per | 2 | I. Person |
48
- | B-org | 3 | B. organization |
49
- | I-org | 4 | I. organization |
50
- | B-loc | 5 | B. location |
51
- | I-loc | 6 | I. location |
52
- | B-misc | 7 | B. Miscellaneous |
53
- | I-misc | 8 | I. Miscellaneous |
54
- | B-deriv-per | 9 | B. Derived Person |
55
-
56
- MIT license
57
 
58
  ## Training and evaluation data
59
 
@@ -74,18 +64,18 @@ The following hyperparameters were used during training:
74
 
75
  ### Training results
76
 
77
- | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
78
- |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
79
- | No log | 1.0 | 207 | 0.2040 | 0.7006 | 0.7466 | 0.7228 | 0.9411 |
80
- | No log | 2.0 | 414 | 0.1561 | 0.7299 | 0.7868 | 0.7573 | 0.9519 |
81
- | 0.2313 | 3.0 | 621 | 0.1455 | 0.7693 | 0.7992 | 0.7840 | 0.9567 |
82
- | 0.2313 | 4.0 | 828 | 0.1628 | 0.7760 | 0.8037 | 0.7896 | 0.9570 |
83
- | 0.0828 | 5.0 | 1035 | 0.1424 | 0.7997 | 0.8299 | 0.8145 | 0.9604 |
84
- | 0.0828 | 6.0 | 1242 | 0.1512 | 0.7983 | 0.8361 | 0.8168 | 0.9618 |
85
- | 0.0828 | 7.0 | 1449 | 0.1587 | 0.8084 | 0.8415 | 0.8246 | 0.9627 |
86
- | 0.0362 | 8.0 | 1656 | 0.1613 | 0.8154 | 0.8358 | 0.8255 | 0.9632 |
87
- | 0.0362 | 9.0 | 1863 | 0.1685 | 0.8211 | 0.8429 | 0.8319 | 0.9632 |
88
- | 0.0174 | 10.0 | 2070 | 0.1647 | 0.8247 | 0.8454 | 0.8349 | 0.9641 |
89
 
90
 
91
  ### Framework versions
 
2
  tags:
3
  - generated_from_trainer
4
  datasets:
5
+ - wikiann
6
  metrics:
7
  - precision
8
  - recall
9
  - f1
10
  - accuracy
 
 
11
  model_index:
12
  - name: bert-srb-ner
13
  results:
14
  - task:
15
  name: Token Classification
16
  type: token-classification
17
+ dataset:
18
+ name: wikiann
19
+ type: wikiann
20
+ args: sr
21
  metric:
22
  name: Accuracy
23
  type: accuracy
24
+ value: 0.9542715764169646
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
29
 
30
  # bert-srb-ner
31
 
32
+ This model was trained from scratch on the wikiann dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.3045
35
+ - Precision: 0.8922
36
+ - Recall: 0.9050
37
+ - F1: 0.8986
38
+ - Accuracy: 0.9543
39
 
40
  ## Model description
41
 
42
+ More information needed
43
 
44
  ## Intended uses & limitations
45
+
46
+ More information needed
 
 
 
 
 
 
 
 
 
 
 
 
47
 
48
  ## Training and evaluation data
49
 
 
64
 
65
  ### Training results
66
 
67
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
68
+ |:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
69
+ | 0.276 | 1.0 | 1250 | 0.2359 | 0.8355 | 0.8334 | 0.8344 | 0.9276 |
70
+ | 0.1722 | 2.0 | 2500 | 0.2016 | 0.8731 | 0.8685 | 0.8708 | 0.9426 |
71
+ | 0.1155 | 3.0 | 3750 | 0.1897 | 0.8707 | 0.8860 | 0.8783 | 0.9463 |
72
+ | 0.0849 | 4.0 | 5000 | 0.2151 | 0.8755 | 0.8980 | 0.8866 | 0.9494 |
73
+ | 0.0554 | 5.0 | 6250 | 0.2373 | 0.8820 | 0.8923 | 0.8871 | 0.9495 |
74
+ | 0.039 | 6.0 | 7500 | 0.2644 | 0.8808 | 0.8953 | 0.8880 | 0.9505 |
75
+ | 0.0286 | 7.0 | 8750 | 0.2737 | 0.8915 | 0.8961 | 0.8938 | 0.9520 |
76
+ | 0.018 | 8.0 | 10000 | 0.2879 | 0.8860 | 0.9039 | 0.8948 | 0.9526 |
77
+ | 0.0116 | 9.0 | 11250 | 0.2973 | 0.8930 | 0.9032 | 0.8981 | 0.9542 |
78
+ | 0.0079 | 10.0 | 12500 | 0.3045 | 0.8922 | 0.9050 | 0.8986 | 0.9543 |
79
 
80
 
81
  ### Framework versions
config.json CHANGED
@@ -15,10 +15,7 @@
15
  "3": "LABEL_3",
16
  "4": "LABEL_4",
17
  "5": "LABEL_5",
18
- "6": "LABEL_6",
19
- "7": "LABEL_7",
20
- "8": "LABEL_8",
21
- "9": "LABEL_9"
22
  },
23
  "initializer_range": 0.02,
24
  "intermediate_size": 3072,
@@ -29,10 +26,7 @@
29
  "LABEL_3": 3,
30
  "LABEL_4": 4,
31
  "LABEL_5": 5,
32
- "LABEL_6": 6,
33
- "LABEL_7": 7,
34
- "LABEL_8": 8,
35
- "LABEL_9": 9
36
  },
37
  "layer_norm_eps": 1e-12,
38
  "max_position_embeddings": 512,
 
15
  "3": "LABEL_3",
16
  "4": "LABEL_4",
17
  "5": "LABEL_5",
18
+ "6": "LABEL_6"
 
 
 
19
  },
20
  "initializer_range": 0.02,
21
  "intermediate_size": 3072,
 
26
  "LABEL_3": 3,
27
  "LABEL_4": 4,
28
  "LABEL_5": 5,
29
+ "LABEL_6": 6
 
 
 
30
  },
31
  "layer_norm_eps": 1e-12,
32
  "max_position_embeddings": 512,
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:23af3861d166a887a918e95698c32ced452fc73f94cbff4177baed6877d83eee
3
- size 430992881
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c32dc26e67d4fa77db3bc506b926488345693fe3c58fa1540780fc65fddbe82
3
+ size 430983665
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c5226a2a6a226ae0d234901a9cfc1963e0232659994075db9c8d3cfc87b233c6
3
  size 2671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75f1b23b7155993c980bae67c150064f0ae25e706d1369893636b31f3daf6136
3
  size 2671