首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Support Vector Machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II
  • 本地全文:下载
  • 作者:Keiji Ogura ; Tomohiro Sato ; Hitomi Yuki
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2019
  • 卷号:9
  • 期号:1
  • 页码:1-12
  • DOI:10.1038/s41598-019-47536-3
  • 出版社:Springer Nature
  • 摘要:Assessing the hERG liability in the early stages of drug discovery programs is important. The recent increase of hERG-related information in public databases enabled various successful applications of machine learning techniques to predict hERG inhibition. However, most of these researches constructed the datasets from only one database, limiting the predictability and scope of the models. In this study, a hERG classification model was constructed using the largest dataset for hERG inhibition built by integrating multiple databases. The integrated dataset consisted of more than 291,000 structurally diverse compounds derived from ChEMBL, GOSTAR, PubChem, and hERGCentral. The prediction model was built by support vector machine (SVM) with descriptor selection based on Non-dominated Sorting Genetic Algorithm-II (NSGA-II) to optimize the descriptor set for maximum prediction performance with the minimal number of descriptors. The SVM classification model using 72 selected descriptors and ECFP_4 structural fingerprints recorded kappa statistics of 0.733 and accuracy of 0.984 for the test set, substantially outperforming the prediction performance of the current commercial applications for hERG prediction. Finally, the applicability domain of the prediction model was assessed based on the molecular similarity between the training set and test set compounds.
国家哲学社会科学文献中心版权所有