期刊名称:International Journal of Computer Science Issues
印刷版ISSN:1694-0784
电子版ISSN:1694-0814
出版年度:2014
卷号:11
期号:3
出版社:IJCSI Press
摘要:The application of text classification systems on biomedical literature aims to select articles relevant to a specific issue from large corpora. As the amount of online biomedical literature grows, the task of finding relevant information becomes very complicated, due to the difficulties in browsing and searching the relevant information through the web. Ontology is useful for organizing and navigating the Web sites and also for improving the accuracy of Web searches. It provides a shared understanding of domain, to overcome differences in terminology such as synonym, term variants and terms ambiguity. However, one of the problems raised in ontology is the maintenance of these bases of concepts. Therefore, we investigate and propose ontology enrichment algorithm as one of the methods to modify an existing ontology. In this research, we present a new ontology enrichment algorithm for assigning or associating each concept in the training ontology with the relevant and informative features from biomedical information sources. Experiments are conducted to extract and select the meaningful features from different information sources such as the OHSUMED dataset, Medical Subject Heading (MeSH) terms and heart disease glossaries. Then, we expand these features into the training ontology. Finally, we evaluate the performance of our proposed ontology enrichment algorithm in classifying biomedical text abstracts. The results demonstrate that the macro-average for precision, recall and F measure are improved by employing ontology enrichment algorithm.
关键词:MeSH; OHSUMED; Ontology Enrichment; Text Classification; Text Mining