首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:SU-QMI: A Feature Selection Method Based on Graph Theory for Prediction of Antimicrobial Resistance in Gram-Negative Bacteria
  • 本地全文:下载
  • 作者:Abu Sayed Chowdhury ; Douglas R. Call ; Shira L. Broschat
  • 期刊名称:Proceedings
  • 电子版ISSN:2504-3900
  • 出版年度:2020
  • 卷号:54
  • 期号:61
  • 页码:7
  • DOI:10.3390/proceedings2020066007
  • 语种:English
  • 出版社:MDPI AG
  • 摘要:Machine learning can be used as an alternative to similarity algorithms such as BLASTp when the latter fail to identify dissimilar antimicrobial-resistance genes (ARGs) in bacteria; however, determining the most informative characteristics, known as features, for antimicrobial resistance (AMR) is essential to obtain accurate predictions. In this paper, we introduce a feature selection algorithm called symmetrical uncertainty qualitative mutual information (SU-QMI), which selects features based on estimates of their relevance, redundancy, and interdependency. We use these together with graph theory to derive a feature selection method for identifying putative ARGs in Gram-negative bacteria. We extract physicochemical, evolutionary, and structural features from the protein sequences of five genera of Gram-negative bacteria—Acinetobacter, Klebsiella, Campylobacter, Salmonella, and Escherichia—which confer resistance to acetyltransferase (aac), β-lactamase (bla), and dihydrofolate reductase (dfr). Our SU-QMI algorithm is then used to find the best subset of features, and a support vector machine (SVM) model is trained for AMR prediction using this feature subset. We evaluate performance using an independent set of protein sequences from three Gram-negative bacterial genera—Pseudomonas, Vibrio, and Enterobacter—and achieve prediction accuracy ranging from 88 to 100%. Compared to the SU-QMI method, BLASTp requires similarity as low as 53% for comparable classification results. Our results indicate the effectiveness of the SU-QMI method for selecting the best protein features for AMR prediction in Gram-negative bacteria.
国家哲学社会科学文献中心版权所有