首页    期刊浏览 2024年12月11日 星期三
登录注册

文章基本信息

  • 标题:A novel index to evaluate discretization methods: A case study of flood susceptibility assessment based on random forest
  • 本地全文:下载
  • 作者:Xianzhe Tang ; Takashi Machimura ; Wei Liu
  • 期刊名称:Geoscience Frontiers
  • 印刷版ISSN:1674-9871
  • 出版年度:2021
  • 卷号:12
  • 期号:6
  • 页码:1-13
  • DOI:10.1016/j.gsf.2021.101253
  • 语种:English
  • 出版社:Elsevier
  • 摘要:Graphical abstractDisplay OmittedHighlights•An index to evaluate the suitability of discretization methods (DMs).•Information Change Rate (ICR) ndex i proposed.•The ICR can identify rational DMs for spatially continuous variables.AbstractThe selection of a suitable discretization method (DM) to discretize spatially continuous variables (SCVs) is critical in ML-based natural hazard susceptibility assessment. However, few studies start to consider the influence due to the selected DMs and how to efficiently select a suitable DM for each SCV. These issues were well addressed in this study. The information loss rate (ILR), an index based on the information entropy, seems can be used to select optimal DM for each SCV. However, the ILR fails to show the actual influence of discretization because such index only considers the total amount of information of the discretized variables departing from the original SCV. Facing this issue, we propose an index, information change rate (ICR), that focuses on the changed amount of information due to the discretization based on each cell, enabling the identification of the optimal DM. We develop a case study with Random Forest (training/testing ratio of 7 : 3) to assess flood susceptibility in Wanan County, China. The area under the curve-based and susceptibility maps-based approaches were presented to compare the ILR and ICR. The results show the ICR-based optimal DMs are more rational than the ILR-based ones in both cases. Moreover, we observed the ILR values are unnaturally small (<1%), whereas the ICR values are obviously more in line with general recognition (usually 10%–30%). The above results all demonstrate the superiority of the ICR. We consider this study fills up the existing research gaps, improving the ML-based natural hazard susceptibility assessments.
  • 关键词:KeywordsMachine learningNatural hazardsInformation change rateDiscretization method
国家哲学社会科学文献中心版权所有