首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:Ontological Lexicon Enrichment: The Badea System For Semi-Automated Extraction Of Antonymy Relations From Arabic Language Corpora
  • 本地全文:下载
  • 作者:Maha AlYahya ; Sawsan AlMalak ; Luluh Aldhubayi
  • 期刊名称:Malaysian Journal of Computer Science
  • 印刷版ISSN:0127-9084
  • 出版年度:2016
  • 卷号:29
  • 期号:1
  • 出版社:University of Malaya * Faculty of Computer Science and Information Technology
  • 摘要:language processing tools and applications; however, they are expensive to build, maintain, and extend. In this paper, we present the Badea system for the semiautomated extraction of lexical relations, specifically antonyms using a patternbased approach to support the task of ontological lexicon enrichment. The approach is based on an ontology of “seed” pairs of antonyms in the Arabic language; we identify patterns in which the pairs occur and then use the patterns identified to find new antonym pairs in an Arabic textual corpora. Experiments are conducted on Badea using texts from three Arabic textual corpuses: KSUCCA, KACSTAC, and CAC. The system is evaluated and the patterns’ reliability and system performance is measured. The results from our experiments on the three Arabic corpora show that the patternbased approach can be useful in the ontological enrichment task, as the evaluation of the system resulted in the ontology being updated with over 300 new antonym pairs, thereby enriching the lexicon and increasing its size by over 400%. Moreover, the results show important findings on the reliability of patterns in extracting antonyms for Arabic. The Badea system will facilitate the enrichment of ontological lexicons that can be very useful in any Arabic natural language processing system that requires semantic relation extraction.
  • 关键词:Antonym Extraction; Ontology; Arabic Lexicon; Semantic Relation; Arabic NLP
国家哲学社会科学文献中心版权所有