出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Stemming is the process of clipping off the affixes from the input word to obtain the respective root word, but it is not necessary that stemming provide us the genuine and meaningful root word. To overcome this problem we come up with a solution- Lemmatizer. It is the process by which we crave out the lemma from the given word and can also add additional rules to make the clipped word a proper stem. In this paper we have created an inflectional lemmatizer which generates the rules for extracting the suffixes and also added rules for generating a proper meaningful root word
关键词:Stemming; Lemmatization; Lemma; Hindi; Over-stemming and Under-stemming