期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2015
卷号:79
期号:2
出版社:Journal of Theoretical and Applied
摘要:Pattern of writing a document is often said to be strongly influenced by the mother tongue, but do not guarantee produce writing that is always the similar pattern. If the trend similarity patterns caused by intentional copying of documents, then it is necessary to be created a detection tool to identify the terms pattern in those documents. This phenomena initiate this paper to acquaint the further investigation on text document pattern recognition for terms appearances by employing latent semantic analysis (LSA) method couple with terms distance between two documents. This study also describes determination of text documents similarity, which in turn can be used for early plagiarism detection.
关键词:Text Document; Pattern; Latent Semantic Analysis; Term Distance; Plagiarism.