首页    期刊浏览 2025年02月22日 星期六
登录注册

文章基本信息

  • 标题:A Novel Method of Chinese Electronic Medical Records Entity Labeling Based on BIC model
  • 本地全文:下载
  • 作者:Yifan Wang ; Guowei Teng ; Xuehai Ding
  • 期刊名称:Journal of Software
  • 印刷版ISSN:1796-217X
  • 出版年度:2021
  • 卷号:16
  • 期号:1
  • 页码:24-38
  • DOI:10.17706/jsw.16.1.24-38
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:In the field of bio-medicine, mass data are generated every day, such as Chinese electronic medical record (EMR), containing massive medical terminology and specific categories of entities. The way to analyze and obtain effective information from these sparse data is a difficulty in research. As the foundation of analyzing huge amount of biomedical text data, Named Entity Recognition (NER) is essential in Natural Language Processing (NLP) complementing with effective labeling data. One of the two basic sequence labeling methods is rule-based bulk corpus tagging, requiring domain experts to establish targeted recognition rule base. However, in the application field, this method is single, and the portability does not make the expectation, bringing great limitations; The other is complete manual labeling, but it is time-consuming and laborious. Based on Bidirectional Long Short-Term Memory network (BiLSTM), Iterated Dilated Convolution Neural Network (IDCNN) and Conditional Random Field (CRF), we proposed the BIC model. This paper proposes a method for EMR entity labeling based on BIC model, realizing automatic annotation of Chinese EMR data. Machine labeling data can be used after manual review, which will reduce the workload of manual labeling bestially. Compared with other models, F1 value of BIC model reached 91.90% in CCKS2017 dataset, and 78% in PACS report data. Experiments show that our method is superior to the others.
  • 关键词:Chinese electronic medical record; named entity recognition; sequence labeling; BIC model; neural network.
国家哲学社会科学文献中心版权所有