期刊名称:International Journal of Advances in Soft Computing and Its Applications
印刷版ISSN:2074-8523
出版年度:2015
卷号:7
期号:1-Special
出版社:International Center for Scientific Research and Studies
摘要:In this paper, a method is proposed to discriminate acronyms and their full names or expansions in scientific and technical literature abstracts by learning Wikipedia definition statements. Through this study, we aim to verify the effective utilization of an open knowledge base in the knowledge processing of domain-specific fields. Experimental results confirm that a noun phrase (NP)-type feature has better performance than a noun (NN) type feature in terms of precision rate. On the contrary, the results of measuring query response rate indicate that a single NN-type feature has better performance than an NP-type feature. We also verify that additional collocation information can contribute to improve the response rate. This study is mainly divided into three parts: 1) a process of sense discrimination is classified into many steps according to feature types; 2) the measured results are combined and processed; and 3) a data fusion-based incremental approach is proposed for sense discrimination. Through the method, we can adjust a precision rate to a certain level while considering classifier response rate
关键词:acronym; Na.ve Bayesian; sense disambiguation; text mining; ; classification; data fusion; response rate; Wikipedia