首页    期刊浏览 2025年01月07日 星期二
登录注册

文章基本信息

  • 标题:Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company’s Reputation
  • 本地全文:下载
  • 作者:Nikolay Babakov ; Varvara Logacheva ; Olga Kozlova
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:26-36
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Not all topics are equally “flammable” in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness. While toxicity in user-generated data is well-studied, we aim at defining a more fine-grained notion of inappropriateness. The core of inappropriateness is that it can harm the reputation of a speaker. This is different from toxicity in two respects: (i) inappropriateness is topic-related, and (ii) inappropriate message is not toxic but still unacceptable. We collect and release two datasets for Russian: a topic-labelled dataset and an appropriateness-labelled dataset. We also release pre-trained classification models trained on this data.
国家哲学社会科学文献中心版权所有