首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:AUTOMATED ARABIC ANTONYM EXTRACTION USING A CORPUS ANALYSIS TOOL
  • 本地全文:下载
  • 作者:LULUH ALDHUBAYI ; MAHA ALYAHYA
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2014
  • 卷号:70
  • 期号:3
  • 出版社:Journal of Theoretical and Applied
  • 摘要:The automatic extraction of semantic relations between words from textual corpora is an extremely challenging task. The increasing need for language resources supporting Natural language processing (NLP) applications has encouraged the development of automated methods for the extraction of semantic relations between words. The use of corpus statistical and similarity distribution methods can help in the task of semantic relation extraction between pairs of words. In this paper, we present a pattern-based bootstrapping approach using Arabic language corpora and a corpus analysis tool (Sketch Engine) to extract the semantic relations (antonyms) between word pairs. The algorithm uses LogDice and pattern co-occurrence to classify the extracted pairs into antonyms. Results of evaluation show that our approach is able to extract the antonym relations with a precision of 76%.
  • 关键词:Antonym Extraction; Sketch Engine; Arabic Lexicon; Semantic Relation; Arabic NLP
国家哲学社会科学文献中心版权所有