首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Design of a Rule Based Hindi Lemmatizer
  • 本地全文:下载
  • 作者:Snigdha Paul ; Mini Tandon ; Nisheeth Joshi
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2013
  • 卷号:3
  • 期号:4
  • 页码:67-74
  • DOI:10.5121/csit.2013.3408
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Stemming is the process of clipping off the affixes from the input word to obtain the respective root word, but it is not necessary that stemming provide us the genuine and meaningful root word. To overcome this problem we come up with a solution- Lemmatizer. It is the process by which we crave out the lemma from the given word and can also add additional rules to make the clipped word a proper stem. In this paper we have created an inflectional lemmatizer which generates the rules for extracting the suffixes and also added rules for generating a proper meaningful root word
  • 关键词:Stemming; Lemmatization; Lemma; Hindi; Over-stemming and Under-stemming
国家哲学社会科学文献中心版权所有