首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:Assessment of vector-host-pathogen relationships using data mining and machine learning
  • 本地全文:下载
  • 作者:Diing D.M. Agany ; Jose E. Pietri ; Etienne Z. Gnimpieba
  • 期刊名称:Computational and Structural Biotechnology Journal
  • 印刷版ISSN:2001-0370
  • 出版年度:2020
  • 卷号:18
  • 页码:1704-1721
  • DOI:10.1016/j.csbj.2020.06.031
  • 出版社:Computational and Structural Biotechnology Journal
  • 摘要:Infectious diseases, including vector-borne diseases transmitted by arthropods, are a leading cause of morbidity and mortality worldwide. In the era of big data, addressing broad-scale, fundamental questions regarding the complex dynamics of these diseases will increasingly require the integration of diverse datasets to produce new biological knowledge. This review provides a current snapshot of the systematic assessment of the relationships between microbial pathogens, arthropod vectors and mammalian hosts using data mining and machine learning. We employ PRISMA to identify 32 key papers relevant to this topic. Our analysis shows an increasing use of data mining and machine learning tasks and techniques, including prediction, classification, clustering, association rules mining, and deep learning, over the last decade. However, it also reveals a number of critical challenges in applying these to the study of vector-host-pathogen interactions at various systems biology levels. Here, relevant studies, current limitations and future directions are discussed. Furthermore, the quality of data in relevant papers was assessed using the FAIR (Findable, Accessible, Interoperable, Reusable) compliance criteria to evaluate and encourage reproducibility and shareability of research outcomes. Although shortcomings in their application remain, data mining and machine learning have significant potential to break new ground in understanding fundamental aspects of vector-host-pathogen relationships and their application in this field should be encouraged. In particular, while predictive modeling, feature engineering and supervised machine learning are already being used in the field, other data mining and machine learning methods such as deep learning and association rules analysis lag behind and should be implemented in combination with established methods to accelerate hypothesis and knowledge generation in the domain.
  • 关键词:Systems Bioscience ; OMICs ; Pathogenicity ; Transmission ; Adaptation ; Data Mining ; Big Data ; Machine Learning ; Association Mining ; Host-Pathogen ; Interaction ; Infectious Disease ; Vector-Borne Disease
国家哲学社会科学文献中心版权所有