文章基本信息

标题：An Enhanced Lucene based System for Efficient Document/Information Retrieval
本地全文：下载
作者：Alaidine Ben Ayed ; Ismaïl Biskri ; Jean-Guy Meunier 等
期刊名称：Computer Science & Information Technology
电子版ISSN：2231-5403
出版年度：2020
卷号：10
期号：9
页码：161-167
DOI：10.5121/csit.2020.100913
出版社：Academy & Industry Research Collaboration Center (AIRCC)
摘要：In this paper we implement a document retrieval system using the Lucene tool and we conduct some experiments in order to compare the efficiency of two different weighting schema: the well-known TF-IDF and the BM25. Then, we expand queries using a comparable corpus (wikipedia) and word embeddings. Obtained results show that the latter method (word embeddings) is a good way to achieve higher precision rates and retrieve more accurate documents.
关键词：Internet and Web Applications ;Data and knowledge Representation ;Document Retrieval.