首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:DEEP LEARNING FOR DOCUMENT CLUSTERING: A SURVEY, TAXONOMY AND RESEARCH TREND
  • 本地全文:下载
  • 作者:LUBNA GABRALLA ; HARUNA CHIROMA
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2020
  • 卷号:98
  • 期号:22
  • 页码:3602-3624
  • 出版社:Journal of Theoretical and Applied
  • 摘要:Information is stored in several forms such as pictures, web pages, sound and video, but 80% is stored as a text. Quick searching for a specific text document depends totally on the accuracy of the classification of the document's subject with a similar group of documents. This process is called documents clustering. Recently, deep learning techniques have achieved distinguish results in solving the problems facing documents clustering such as complex semantics and high dimensionality. This paper aims to examines a comprehensive review related to documents clustering, and survey the recent work in document clustering using deep learning methods. The proposed taxonomy represents knowledge that helps researchers to understand and follow up previous works in this area, and developing or creating new methods and a comparative analysis was made between popular dataset, performance metrics, deep learning frameworks and library used in deep learning clustering documents.
  • 关键词:Clustering;Deep Learning;Documents;Texts
国家哲学社会科学文献中心版权所有