首页    期刊浏览 2024年12月11日 星期三
登录注册

文章基本信息

  • 标题:Bio-NER: Biomedical Named Entity Recognition using Rule-Based and Statistical Learners
  • 本地全文:下载
  • 作者:Pir Dino Soomro ; Sanotsh Kumar ; Banbhrani
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2017
  • 卷号:8
  • 期号:12
  • DOI:10.14569/IJACSA.2017.081220
  • 出版社:Science and Information Society (SAI)
  • 摘要:The purpose of extracting of Bio-Medical Entities is to recognize the particular entities, whether word or phrases, from the unstructured data contained in the text. This work proposes different approaches and methods, i.e. Machine Learning Hybrid Classification, Rule Based Non-tested Generalized Exemplars and Partial Decision Tree (PART) Learners for Bio-Medical Named Entity Recognition. The Prime objective is to consider, preferably, simple characteristics, such as, affixes and context. In addition, orthographic, Parts of Speech (POS) tags and N-grams are given secondary importance as for as their comparison with affixes and context is concerned. Further, for the very purpose of Bio-medical Diseased Named Recognition, proposal of Rule Based Classifiers along with the Statistical Machine Learning is given. Also, this paper proposes the blend of both preceding methods that jointly construct Hybrid Classification algorithm. Precision, Recall and F-measure – standard metrics- has been put into practice for the evaluation. The results prove that the technique used has far better performance results than the method used before - state-of-art Disease NER (Named Entity Recognition).
  • 关键词:Bio-medical text mining; machine learning; named entity recognition; naive bayesian; rule-based classifier; information extraction
国家哲学社会科学文献中心版权所有