New version with explicit predicate marking

Browse files

Files changed (5) hide show

README.md +68 -59
config.json +42 -44
pytorch_model.bin +2 -2
tokenizer.json +1 -6
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -15,59 +15,67 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ai-forever/ruElectra-medium](https://huggingface.co/ai-forever/ruElectra-medium) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1367
-- Addressee Precision: 0.8793
-- Addressee Recall: 0.8947
-- Addressee F1: 0.8870
-- Addressee Number: 57
-- Benefactive Precision: 0.6
-- Benefactive Recall: 0.3
-- Benefactive F1: 0.4
-- Benefactive Number: 10
-- Causator Precision: 0.9296
-- Causator Recall: 0.8049
-- Causator F1: 0.8627
-- Causator Number: 82
-- Cause Precision: 0.5618
 - Cause Recall: 0.7353
-- Cause F1: 0.6369
-- Cause Number: 68
-- Contrsubject Precision: 0.8409
-- Contrsubject Recall: 0.925
-- Contrsubject F1: 0.8810
-- Contrsubject Number: 120
-- Deliberative Precision: 0.9074
-- Deliberative Recall: 0.9423
-- Deliberative F1: 0.9245
-- Deliberative Number: 52
-- Destinative Precision: 0.9130
-- Destinative Recall: 0.875
-- Destinative F1: 0.8936
-- Destinative Number: 24
-- Directivefinal Precision: 0.6154
 - Directivefinal Recall: 0.6667
-- Directivefinal F1: 0.64
-- Directivefinal Number: 12
-- Experiencer Precision: 0.8525
-- Experiencer Recall: 0.8660
-- Experiencer F1: 0.8592
-- Experiencer Number: 694
-- Instrument Precision: 1.0
-- Instrument Recall: 0.1111
-- Instrument F1: 0.2000
 - Instrument Number: 9
-- Mediative Precision: 0.0
 - Mediative Recall: 0.0
 - Mediative F1: 0.0
-- Mediative Number: 1
-- Object Precision: 0.8735
-- Object Recall: 0.8924
-- Object F1: 0.8828
-- Object Number: 1524
-- Overall Precision: 0.8571
-- Overall Recall: 0.8749
-- Overall F1: 0.8659
-- Overall Accuracy: 0.9711
 ## Model description
@@ -86,25 +94,26 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 9.81632502988664e-05
-- train_batch_size: 4
 - eval_batch_size: 1
-- seed: 605573
-- gradient_accumulation_steps: 2
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Mediative Precision | Mediative Recall | Mediative F1 | Mediative Number | Object Precision | Object Recall | Object F1 | Object Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:-------------------:|:----------------:|:------------:|:----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|
-| 0.1821        | 1.0   | 724  | 0.1479          | 0.5761              | 0.9298           | 0.7114       | 57               | 0.0                   | 0.0                | 0.0            | 10                 | 0.6867             | 0.6951          | 0.6909      | 82              | 0.72            | 0.2647       | 0.3871   | 68           | 0.8171                 | 0.5583              | 0.6634          | 120                 | 0.5111                 | 0.4423              | 0.4742          | 52                  | 0.0                   | 0.0                | 0.0            | 24                 | 0.0                      | 0.0                   | 0.0               | 12                    | 0.8496                | 0.8141             | 0.8315         | 694                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                 | 0.0              | 0.0          | 1                | 0.8183           | 0.8688        | 0.8428    | 1524          | 0.8073            | 0.7942         | 0.8007     | 0.9619           |
-| 0.0938        | 2.0   | 1448 | 0.1384          | 0.6714              | 0.8246           | 0.7402       | 57               | 0.0                   | 0.0                | 0.0            | 10                 | 0.8649             | 0.7805          | 0.8205      | 82              | 0.5067          | 0.5588       | 0.5315   | 68           | 0.7329                 | 0.8917              | 0.8045          | 120                 | 0.5465                 | 0.9038              | 0.6812          | 52                  | 0.0                   | 0.0                | 0.0            | 24                 | 0.5556                   | 0.4167                | 0.4762            | 12                    | 0.7835                | 0.9179             | 0.8454         | 694                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                 | 0.0              | 0.0          | 1                | 0.8329           | 0.8832        | 0.8573    | 1524          | 0.7930            | 0.8636         | 0.8268     | 0.9635           |
-| 0.0627        | 3.0   | 2172 | 0.1194          | 0.8125              | 0.9123           | 0.8595       | 57               | 0.25                  | 0.2                | 0.2222         | 10                 | 0.9178             | 0.8171          | 0.8645      | 82              | 0.5             | 0.6176       | 0.5526   | 68           | 0.7343                 | 0.875               | 0.7985          | 120                 | 0.8980                 | 0.8462              | 0.8713          | 52                  | 0.8421                | 0.6667             | 0.7442         | 24                 | 0.7273                   | 0.6667                | 0.6957            | 12                    | 0.8815                | 0.8357             | 0.8580         | 694                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                 | 0.0              | 0.0          | 1                | 0.8579           | 0.8871        | 0.8723    | 1524          | 0.8447            | 0.8549         | 0.8498     | 0.9687           |
-| 0.0501        | 4.0   | 2896 | 0.1276          | 0.8772              | 0.8772           | 0.8772       | 57               | 0.6667                | 0.4                | 0.5            | 10                 | 0.9242             | 0.7439          | 0.8243      | 82              | 0.5604          | 0.75         | 0.6415   | 68           | 0.8409                 | 0.925               | 0.8810          | 120                 | 0.9245                 | 0.9423              | 0.9333          | 52                  | 0.9130                | 0.875              | 0.8936         | 24                 | 0.6154                   | 0.6667                | 0.64              | 12                    | 0.8693                | 0.8530             | 0.8611         | 694                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                 | 0.0              | 0.0          | 1                | 0.8773           | 0.8865        | 0.8819    | 1524          | 0.8633            | 0.8662         | 0.8647     | 0.9713           |
-| 0.0205        | 5.0   | 3620 | 0.1367          | 0.8793              | 0.8947           | 0.8870       | 57               | 0.6                   | 0.3                | 0.4            | 10                 | 0.9296             | 0.8049          | 0.8627      | 82              | 0.5618          | 0.7353       | 0.6369   | 68           | 0.8409                 | 0.925               | 0.8810          | 120                 | 0.9074                 | 0.9423              | 0.9245          | 52                  | 0.9130                | 0.875              | 0.8936         | 24                 | 0.6154                   | 0.6667                | 0.64              | 12                    | 0.8525                | 0.8660             | 0.8592         | 694                | 1.0                  | 0.1111            | 0.2000        | 9                 | 0.0                 | 0.0              | 0.0          | 1                | 0.8735           | 0.8924        | 0.8828    | 1524          | 0.8571            | 0.8749         | 0.8659     | 0.9711           |
 ### Framework versions

 This model is a fine-tuned version of [ai-forever/ruElectra-medium](https://huggingface.co/ai-forever/ruElectra-medium) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1471
+- Addressee Precision: 0.9583
+- Addressee Recall: 0.9020
+- Addressee F1: 0.9293
+- Addressee Number: 51
+- Benefactive Precision: 0.8
+- Benefactive Recall: 0.25
+- Benefactive F1: 0.3810
+- Benefactive Number: 16
+- Causator Precision: 0.8971
+- Causator Recall: 0.8714
+- Causator F1: 0.8841
+- Causator Number: 70
+- Cause Precision: 0.6466
 - Cause Recall: 0.7353
+- Cause F1: 0.6881
+- Cause Number: 102
+- Contrsubject Precision: 0.832
+- Contrsubject Recall: 0.7879
+- Contrsubject F1: 0.8093
+- Contrsubject Number: 132
+- Deliberative Precision: 0.6269
+- Deliberative Recall: 0.84
+- Deliberative F1: 0.7179
+- Deliberative Number: 50
+- Destinative Precision: 1.0
+- Destinative Recall: 0.3871
+- Destinative F1: 0.5581
+- Destinative Number: 31
+- Directivefinal Precision: 0.5455
 - Directivefinal Recall: 0.6667
+- Directivefinal F1: 0.6
+- Directivefinal Number: 9
+- Experiencer Precision: 0.8669
+- Experiencer Recall: 0.8609
+- Experiencer F1: 0.8639
+- Experiencer Number: 726
+- Instrument Precision: 0.5
+- Instrument Recall: 0.3333
+- Instrument F1: 0.4
 - Instrument Number: 9
+- Limitative Precision: 0.0
+- Limitative Recall: 0.0
+- Limitative F1: 0.0
+- Limitative Number: 4
+- Object Precision: 0.8676
+- Object Recall: 0.8703
+- Object F1: 0.8689
+- Object Number: 1611
+- Overall Precision: 0.8515
+- Overall Recall: 0.8467
+- Overall F1: 0.8491
+- Overall Accuracy: 0.9687
+- Directiveinitial Recall: 0.0
+- Directiveinitial Number: 0.0
+- Directiveinitial Precision: 0.0
+- Directiveinitial F1: 0.0
 - Mediative Recall: 0.0
+- Mediative Number: 0.0
+- Mediative Precision: 0.0
 - Mediative F1: 0.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.000261433658985083
+- train_batch_size: 1
 - eval_batch_size: 1
+- seed: 510754
+- gradient_accumulation_steps: 8
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.3
 - num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Limitative Precision | Limitative Recall | Limitative F1 | Limitative Number | Object Precision | Object Recall | Object F1 | Object Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Directiveinitial Recall | Directiveinitial Number | Directiveinitial Precision | Directiveinitial F1 | Mediative Recall | Mediative Number | Mediative Precision | Mediative F1 |
+|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|:-----------------------:|:-----------------------:|:--------------------------:|:-------------------:|:----------------:|:----------------:|:-------------------:|:------------:|
+| 0.2154        | 1.0   | 763  | 0.2074          | 0.6842              | 0.5098           | 0.5843       | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.1946             | 0.8286          | 0.3152      | 70              | 1.0             | 0.0098       | 0.0194   | 102          | 0.2                    | 0.0076              | 0.0146          | 132                 | 0.0                    | 0.0                 | 0.0             | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.0                      | 0.0                   | 0.0               | 9                     | 0.6747                | 0.7713             | 0.7198         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8199           | 0.7263        | 0.7702    | 1611          | 0.6987            | 0.6460         | 0.6713     | 0.9433           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
+| 0.2294        | 2.0   | 1526 | 0.2028          | 0.7460              | 0.9216           | 0.8246       | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.0                | 0.0             | 0.0         | 70              | 0.3333          | 0.0098       | 0.0190   | 102          | 0.7791                 | 0.5076              | 0.6147          | 132                 | 0.22                   | 0.88                | 0.352           | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.6667                   | 0.6667                | 0.6667            | 9                     | 0.8822                | 0.6708             | 0.7621         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.7332           | 0.7914        | 0.7612    | 1611          | 0.7255            | 0.6855         | 0.7050     | 0.9417           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
+| 0.132         | 3.0   | 2290 | 0.1485          | 0.7188              | 0.9020           | 0.8          | 51               | 0.0                   | 0.0                | 0.0            | 16                 | 0.6854             | 0.8714          | 0.7673      | 70              | 0.4079          | 0.3039       | 0.3483   | 102          | 0.6562                 | 0.7955              | 0.7192          | 132                 | 0.5263                 | 0.4                 | 0.4545          | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.6                      | 0.6667                | 0.6316            | 9                     | 0.8289                | 0.8609             | 0.8446         | 726                | 0.0                  | 0.0               | 0.0           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8013           | 0.8610        | 0.8300    | 1611          | 0.7806            | 0.8115         | 0.7957     | 0.9574           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
+| 0.0748        | 4.0   | 3053 | 0.1382          | 0.9038              | 0.9216           | 0.9126       | 51               | 0.1905                | 0.25               | 0.2162         | 16                 | 0.9104             | 0.8714          | 0.8905      | 70              | 0.5859          | 0.7353       | 0.6522   | 102          | 0.825                  | 0.75                | 0.7857          | 132                 | 0.4875                 | 0.78                | 0.6             | 50                  | 0.0                   | 0.0                | 0.0            | 31                 | 0.4615                   | 0.6667                | 0.5455            | 9                     | 0.9033                | 0.8237             | 0.8617         | 726                | 0.4                  | 0.2222            | 0.2857        | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8468           | 0.8678        | 0.8571    | 1611          | 0.8321            | 0.8285         | 0.8303     | 0.9659           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
+| 0.0504        | 5.0   | 3815 | 0.1471          | 0.9583              | 0.9020           | 0.9293       | 51               | 0.8                   | 0.25               | 0.3810         | 16                 | 0.8971             | 0.8714          | 0.8841      | 70              | 0.6466          | 0.7353       | 0.6881   | 102          | 0.832                  | 0.7879              | 0.8093          | 132                 | 0.6269                 | 0.84                | 0.7179          | 50                  | 1.0                   | 0.3871             | 0.5581         | 31                 | 0.5455                   | 0.6667                | 0.6               | 9                     | 0.8669                | 0.8609             | 0.8639         | 726                | 0.5                  | 0.3333            | 0.4           | 9                 | 0.0                  | 0.0               | 0.0           | 4                 | 0.8676           | 0.8703        | 0.8689    | 1611          | 0.8515            | 0.8467         | 0.8491     | 0.9687           | 0.0                     | 0.0                     | 0.0                        | 0.0                 | 0.0              | 0.0              | 0.0                 | 0.0          |
 ### Framework versions

config.json CHANGED Viewed

@@ -12,54 +12,52 @@
   "hidden_size": 576,
   "id2label": {
     "0": "O",
-    "1": "B-Predicate",
-    "2": "B-Object",
-    "3": "B-Experiencer",
-    "4": "B-Cause",
-    "5": "B-Deliberative",
-    "6": "B-Causator",
-    "7": "B-ContrSubject",
-    "8": "B-Benefactive",
-    "9": "B-Addressee",
-    "10": "I-Object",
-    "11": "B-Destinative",
-    "12": "I-ContrSubject",
-    "13": "B-Instrument",
-    "14": "I-Deliberative",
-    "15": "B-Limitative",
-    "16": "B-DirectiveFinal",
-    "17": "B-Mediative",
-    "18": "I-DirectiveFinal",
-    "19": "B-DirectiveInitial",
-    "20": "I-DirectiveInitial",
-    "21": "I-Experiencer",
-    "22": "I-Cause"
   },
   "initializer_range": 0.02,
   "intermediate_size": 2304,
   "label2id": {
-    "B-Addressee": 9,
-    "B-Benefactive": 8,
-    "B-Causator": 6,
-    "B-Cause": 4,
-    "B-ContrSubject": 7,
-    "B-Deliberative": 5,
-    "B-Destinative": 11,
-    "B-DirectiveFinal": 16,
-    "B-DirectiveInitial": 19,
-    "B-Experiencer": 3,
-    "B-Instrument": 13,
-    "B-Limitative": 15,
-    "B-Mediative": 17,
-    "B-Object": 2,
-    "B-Predicate": 1,
-    "I-Cause": 22,
-    "I-ContrSubject": 12,
-    "I-Deliberative": 14,
-    "I-DirectiveFinal": 18,
-    "I-DirectiveInitial": 20,
-    "I-Experiencer": 21,
-    "I-Object": 10,
     "O": 0
   },
   "layer_norm_eps": 1e-12,

   "hidden_size": 576,
   "id2label": {
     "0": "O",
+    "1": "B-Object",
+    "2": "B-Experiencer",
+    "3": "B-Cause",
+    "4": "B-Deliberative",
+    "5": "B-Causator",
+    "6": "B-ContrSubject",
+    "7": "B-Benefactive",
+    "8": "B-Addressee",
+    "9": "I-Object",
+    "10": "B-Destinative",
+    "11": "I-ContrSubject",
+    "12": "B-Instrument",
+    "13": "I-Deliberative",
+    "14": "B-Limitative",
+    "15": "B-DirectiveFinal",
+    "16": "B-Mediative",
+    "17": "I-DirectiveFinal",
+    "18": "B-DirectiveInitial",
+    "19": "I-DirectiveInitial",
+    "20": "I-Experiencer",
+    "21": "I-Cause"
   },
   "initializer_range": 0.02,
   "intermediate_size": 2304,
   "label2id": {
+    "B-Addressee": 8,
+    "B-Benefactive": 7,
+    "B-Causator": 5,
+    "B-Cause": 3,
+    "B-ContrSubject": 6,
+    "B-Deliberative": 4,
+    "B-Destinative": 10,
+    "B-DirectiveFinal": 15,
+    "B-DirectiveInitial": 18,
+    "B-Experiencer": 2,
+    "B-Instrument": 12,
+    "B-Limitative": 14,
+    "B-Mediative": 16,
+    "B-Object": 1,
+    "I-Cause": 21,
+    "I-ContrSubject": 11,
+    "I-Deliberative": 13,
+    "I-DirectiveFinal": 17,
+    "I-DirectiveInitial": 19,
+    "I-Experiencer": 20,
+    "I-Object": 9,
     "O": 0
   },
   "layer_norm_eps": 1e-12,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:49608e38da88f2cc7a74e91f5dfeeda4ac34f55947329da8a6b5521ca0cde33d
-size 340228649

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b8ef50ce5f1294afd3d610e06e06f236b2f231750523ba0c335cb1269c621e1
+size 340226345

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 2048,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cfdd72cd5d712ce859220138bff2efdb41b9156c57fc69e5ea01c3ae5b094122
-size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:d6b76fd56499805942ca588f2f290c4c0a3e7c80b80ef2c2b659e065090c0acb
+size 4091