文章基本信息

标题：Toxicity detection sensitive to conversational context
本地全文：下载
作者：Alexandros Xenos ; John Pavlopoulos ; Ion Androutsopoulos 等
期刊名称：First Monday
电子版ISSN：1396-0466
出版年度：2022
卷号：27
期号：9
页码：1-2
DOI：10.5210/fm.v27i5.12285
语种：English
出版社：University of Illinois at Chicago Library
摘要：User posts whose perceived toxicity depends on conversational context are rare in current toxicity detection datasets. Hence, toxicity detectors trained on existing datasets will also tend to disregard context, making the detection of context-sensitive toxicity harder when it does occur. We construct and publicly release a dataset of 10,000 posts with two kinds of toxicity labels: (i) annotators considered each post with the previous one as context; and (ii) annotators had no additional context. Based on this, we introduce a new task, context sensitivity estimation, which aims to identify posts whose perceived toxicity changes if the context (previous post) is also considered. We then evaluate machine learning systems on this task, showing that classifiers of practical quality can be developed, and we show that data augmentation with knowledge distillation can improve performance further. Such systems could be used to enhance toxicity detection datasets with more context-dependent posts, or to suggest when moderators should consider parent posts, which often may be unnecessary and may otherwise introduce significant additional costs.
关键词：Natural Language Processing;Abusive Language Detection;Offensive Language Detection