首页    期刊浏览 2025年02月28日 星期五
登录注册

文章基本信息

  • 标题:The robust nearest shrunken centroids classifier for high-dimensional heavy-tailed data
  • 本地全文:下载
  • 作者:Shaokang Ren ; Qing Mai
  • 期刊名称:Electronic Journal of Statistics
  • 印刷版ISSN:1935-7524
  • 出版年度:2022
  • 卷号:16
  • 期号:1
  • 页码:3343-3384
  • DOI:10.1214/22-EJS2022
  • 语种:English
  • 出版社:Institute of Mathematical Statistics
  • 摘要:The nearest shrunken centroids classifier (NSC) is a popular high-dimensional classifier. However, it is prone to inaccurate classification when the data is heavy-tailed. In this paper, we develop a robust generalization of NSC (RNSC) which remains effective under such circumstances. By incorporating the Huber loss both in the estimation and the calculation of the score function, we reduce the impacts of heavy tails. We rigorously show the variable selection, estimation, and prediction consistency in high dimensions under weak moment conditions. Empirically, our proposal greatly outperforms NSC and many other successful classifiers when data is heavy-tailed while remaining comparable to NSC in the absence of heavy tails. The favorable performance of RNSC is also demonstrated in a real data example.
  • 关键词:62H30;62J07;heavy-tailed data;High-dimensional classification;Huber loss;nearest shrunken centroids classifier;robust estimator
国家哲学社会科学文献中心版权所有