期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN:2158-107X
电子版ISSN:2156-5570
出版年度:2019
卷号:10
期号:12
页码:219-225
出版社:Science and Information Society (SAI)
摘要:Stemming in each language has a different process
and is determined according to the structure of the language.
Stemming is mostly used as a complete step in the processing of
words and phrases. There are many stemming algorithms
available, and some used as a process for word processing. One
function of stemming is to detect word errors in Indonesian. In
this study, researchers created the Indonesian words error
detection system using Nazief and Adriani algorithm. In the trials
conducted, the system will accept text input obtained from the
user. Then the system will preprocess the text. In this study, there
are three stages of preprocessing, namely tokenization, case
folding, and filtering. After the stages in preprocessing are
finished, the system will call each word for the process of
stemming. The results of the stemming will be compared with the
base words available in the database. If it does not match, then
the word is highlighted and is considered an error word. The first
finding is the Nazief Adriani's algorithm can be able to detect
words error until 100%. The second finding is the Nazief
Adriani's algorithm also detect non-words error, the accuracy of
detecting is 97.464%.
关键词:Indonesian; word error; stemming; Nazief and
Adriani stemmer algorithm; detection system