期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2015
卷号:76
期号:2
出版社:Journal of Theoretical and Applied
摘要:Research regarding Indonesian language document similarity measurement has not yet broad to do. Mostly those researches using Karp Rabin algorithm and string matching method, and the documents used in the form of abstracts and articles which only has one page. This study focused on measuring similarity detection of Indonesian language documents using synonyms factor for more than one page document and the processing speed measurement. The system developed is to measure the similarity of existing documents with other documents that are stored in an internal database. Similarity calculation results in the form of a percentage of the document similarity comparison. The measurement results of document processing speed in the form of speed detection in processing the documents. The calculation of the similarity detection measurement and its detection speed is performed using the following steps: (i) examine the documents title, (ii) Distribution of work, (iii) the document similarity measurement and (iv) speed measurement of document similarity detection process. Tests carried out using Indonesian documents that are larger than one page. Documents that have been tested were 15 documents. Test results to calculate document similarities and detection speed conducted on four types of documents. This study has shown that the algorithm used can check the similarity of documents with the maximum number of pages is 56 pages. From the speed of the detection process shows that the speed measurement in detection process algorithm is also said to be successful.