首页    期刊浏览 2024年12月05日 星期四
登录注册

文章基本信息

  • 标题:Novelty Detection via Topic Modeling in Research Articles
  • 本地全文:下载
  • 作者:S. Sendhilkumar ; Nachiyar S Nandhini ; G.S. Mahalakshmi
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2013
  • 卷号:3
  • 期号:5
  • 页码:401-410
  • DOI:10.5121/csit.2013.3542
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:In today's world redundancy is the most vital problem faced in almost all domains. Novelty detection is the identification of new or unknown data or signal that a machine learning system is not aware of during training. The problem becomes more intense when it comes to "Research Articles". A method of identifying novelty at each sections of the article is highly required for determining the novel idea proposed in the research paper. Since research articles are semi-structured, detecting novelty of information from them requires more accurate systems. Topic model provides a useful means to process them and provides a simple way to analyze them. This work compares the most predominantly used topic model- Latent Dirichlet Allocation with the hierarchical Pachinko Allocation Model. The results obtained are promising towards hierarchical Pachinko Allocation Model when used for document retrieval
  • 关键词:Novelty detection; Topic modeling; LDA; hPAM; Novelty score; Concept maps
国家哲学社会科学文献中心版权所有