NiGuLa commited on
Commit
c059f40
1 Parent(s): 73d5476

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -6
README.md CHANGED
@@ -64,12 +64,20 @@ The dataset partially manually labeled samples and partially semi-automatically
64
  If you find this repository helpful, feel free to cite our publication:
65
 
66
  ```
67
- @inproceedings{babakov-etal-2021-bsnlp,
68
- title = "Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation",
69
- author = "Babakov, Nikolay and Logacheva, Varvara and Kozlova, Olga and Semenov, Nikita and Panchenko, Alexander",
70
- booktitle = "To appear in the Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing",
71
- month = April,
 
 
 
 
72
  year = "2021",
73
- address = "Kyiv, Ukraine"
 
 
 
 
74
  }
75
  ```
 
64
  If you find this repository helpful, feel free to cite our publication:
65
 
66
  ```
67
+ @inproceedings{babakov-etal-2021-detecting,
68
+ title = "Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company{'}s Reputation",
69
+ author = "Babakov, Nikolay and
70
+ Logacheva, Varvara and
71
+ Kozlova, Olga and
72
+ Semenov, Nikita and
73
+ Panchenko, Alexander",
74
+ booktitle = "Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing",
75
+ month = apr,
76
  year = "2021",
77
+ address = "Kiyv, Ukraine",
78
+ publisher = "Association for Computational Linguistics",
79
+ url = "https://www.aclweb.org/anthology/2021.bsnlp-1.4",
80
+ pages = "26--36",
81
+ abstract = "Not all topics are equally {``}flammable{''} in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness. While toxicity in user-generated data is well-studied, we aim at defining a more fine-grained notion of inappropriateness. The core of inappropriateness is that it can harm the reputation of a speaker. This is different from toxicity in two respects: (i) inappropriateness is topic-related, and (ii) inappropriate message is not toxic but still unacceptable. We collect and release two datasets for Russian: a topic-labelled dataset and an appropriateness-labelled dataset. We also release pre-trained classification models trained on this data.",
82
  }
83
  ```