首页    期刊浏览 2024年12月14日 星期六
登录注册

文章基本信息

  • 标题:Importance of the Single-Span Task Formulation to Extractive Question-answering
  • 本地全文:下载
  • 作者:Marie-Anne Xu ; Rahul Khanna
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2020
  • 卷号:10
  • 期号:18
  • 页码:106-115
  • DOI:10.5121/csit.2020.101809
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Recent progress in machine reading comprehension and question-answering has allowed machines to reach and even surpass human question-answering. However, the majority of these questions have only one answer, and more substantial testing on questions with multiple answers, or multi-span questions, has not yet been applied. Thus, we introduce a newly compiled dataset consisting of questions with multiple answers that originate from previously existing datasets. In addition, we run BERT-based models pre-trained for question-answering on our constructed dataset to evaluate their reading comprehension abilities. Among the three of BERT-based models we ran, RoBERTa exhibits the highest consistent performance, regardless of size. We find that all our models perform similarly on this new, multi-span dataset (21.492% F1) compared to the single-span source datasets (~33.36% F1). While the models tested on the source datasets were slightly fine-tuned, performance is similar enough to judge that task formulation does not drastically affect question-answering abilities. Our evaluations indicate that these models are indeed capable of adjusting to answer questions that require multiple answers. We hope that our findings will assist future development in questionanswering and improve existing question-answering products and methods.
  • 关键词:Natural Language Processing ;Question Answering ;Machine Reading Comprehension.
国家哲学社会科学文献中心版权所有