首页    期刊浏览 2024年12月05日 星期四
登录注册

文章基本信息

  • 标题:An Enhanced Lucene based System for Efficient Document/Information Retrieval
  • 本地全文:下载
  • 作者:Alaidine Ben Ayed ; Ismaïl Biskri ; Jean-Guy Meunier
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2020
  • 卷号:10
  • 期号:9
  • 页码:161-167
  • DOI:10.5121/csit.2020.100913
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:In this paper we implement a document retrieval system using the Lucene tool and we conduct some experiments in order to compare the efficiency of two different weighting schema: the well-known TF-IDF and the BM25. Then, we expand queries using a comparable corpus (wikipedia) and word embeddings. Obtained results show that the latter method (word embeddings) is a good way to achieve higher precision rates and retrieve more accurate documents.
  • 关键词:Internet and Web Applications ;Data and knowledge Representation ;Document Retrieval.
国家哲学社会科学文献中心版权所有