首页    期刊浏览 2024年12月11日 星期三
登录注册

文章基本信息

  • 标题:The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
  • 本地全文:下载
  • 作者:Richard L. Marchese Robinson ; Kevin J. Roberts ; Elaine B. Martin
  • 期刊名称:Journal of Cheminformatics
  • 印刷版ISSN:1758-2946
  • 电子版ISSN:1758-2946
  • 出版年度:2018
  • 卷号:10
  • 期号:1
  • 页码:44
  • DOI:10.1186/s13321-018-0298-3
  • 语种:English
  • 出版社:BioMed Central
  • 摘要:Predicting the equilibrium solubility of organic, crystalline materials at all relevant temperatures is crucial to the digital design of manufacturing unit operations in the chemical industries. The work reported in our current publication builds upon the limited number of recently published quantitative structure–property relationship studies which modelled the temperature dependence of aqueous solubility. One set of models was built to directly predict temperature dependent solubility, including for materials with no solubility data at any temperature. We propose that a modified cross-validation protocol is required to evaluate these models. Another set of models was built to predict the related enthalpy of solution term, which can be used to estimate solubility at one temperature based upon solubility data for the same material at another temperature. We investigated whether various kinds of solid state descriptors improved the models obtained with a variety of molecular descriptor combinations: lattice energies or 3D descriptors calculated from crystal structures or melting point data. We found that none of these greatly improved the best direct predictions of temperature dependent solubility or the related enthalpy of solution endpoint. This finding is surprising because the importance of the solid state contribution to both endpoints is clear. We suggest our findings may, in part, reflect limitations in the descriptors calculated from crystal structures and, more generally, the limited availability of polymorph specific data. We present curated temperature dependent solubility and enthalpy of solution datasets, integrated with molecular and crystal structures, for future investigations.
  • 关键词:Quantitative structure–property relationships ; Solubility ; Temperature dependent solubility data ; Enthalpy of solution ; Machine learning ; Random forest ; Multiple linear regression ; Feature selection ; Crystal structure ; Lattice energy ; Melting point
国家哲学社会科学文献中心版权所有