首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Decision Algorithm for the Automatic Determination of the Use of Non-Inclusive Terms in Academic Texts
  • 本地全文:下载
  • 作者:Pedro Orgeira-Crespo ; Carla Míguez-Álvarez ; Miguel Cuevas-Alonso
  • 期刊名称:Publications
  • 电子版ISSN:2304-6775
  • 出版年度:2020
  • 卷号:8
  • 期号:3
  • 页码:41-65
  • DOI:10.3390/publications8030041
  • 出版社:MDPI Publishing
  • 摘要:The use of inclusive language, among many other gender equality initiatives in society, has garnered great attention in recent years. Gender equality offices in universities and public administration cannot cope with the task of manually checking the use of non-inclusive language in the documentation that those institutions generate. In this research, an automated solution for the detection of non-inclusive uses of the Spanish language in doctoral theses generated in Spanish universities is introduced using machine learning techniques. A large dataset has been used to train, validate, and analyze the use of inclusive language; the result is an algorithm that detects, within any Spanish text document, non-inclusive uses of the language with error, false positive, and false negative ratios slightly over 10%, and precision, recall, and F-measure percentages over 86%. Results also show the evolution with time of the ratio of non-inclusive usages per document, having a pronounced reduction in the last years under study.
  • 关键词:inclusive language; Spanish language; natural language processing; classification algorithm; machine learning inclusive language ; Spanish language ; natural language processing ; classification algorithm ; machine learning
国家哲学社会科学文献中心版权所有