首页    期刊浏览 2024年12月15日 星期日
登录注册

文章基本信息

  • 标题:Machine Reading at Scale: A Search Engine for Scientific and Academic Research
  • 本地全文:下载
  • 作者:Norberto Sousa ; Nuno Oliveira ; Isabel Praça
  • 期刊名称:Systems
  • 电子版ISSN:2079-8954
  • 出版年度:2022
  • 卷号:10
  • 期号:2
  • 页码:43
  • DOI:10.3390/systems10020043
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:The Internet, much like our universe, is ever-expanding. Information, in the most varied formats, is continuously added to the point of information overload. Consequently, the ability to navigate this ocean of data is crucial in our day-to-day lives, with familiar tools such as search engines carving a path through this unknown. In the research world, articles on a myriad of topics with distinct complexity levels are published daily, requiring specialized tools to facilitate the access and assessment of the information within. Recent endeavors in artificial intelligence, and in natural language processing in particular, can be seen as potential solutions for breaking information overload and provide enhanced search mechanisms by means of advanced algorithms. As the advent of transformer-based language models contributed to a more comprehensive analysis of both text-encoded intents and true document semantic meaning, there is simultaneously a need for additional computational resources. Information retrieval methods can act as low-complexity, yet reliable, filters to feed heavier algorithms, thus reducing computational requirements substantially. In this work, a new search engine is proposed, addressing machine reading at scale in the context of scientific and academic research. It combines state-of-the-art algorithms for information retrieval and reading comprehension tasks to extract meaningful answers from a corpus of scientific documents. The solution is then tested on two current and relevant topics, cybersecurity and energy, proving that the system is able to perform under distinct knowledge domains while achieving competent performance.
国家哲学社会科学文献中心版权所有