期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN:2158-107X
电子版ISSN:2156-5570
出版年度:2021
卷号:12
期号:4
页码:429-436
DOI:10.14569/IJACSA.2021.0120455
出版社:Science and Information Society (SAI)
摘要:Computational Linguistic refers to the development of the computer systems that deal with human languages. In this paper, different Computational Linguistic Techniques such as Parts of Speech (POS) tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder were considered. After a thorough literature review, it was found that fifteen techniques were used for POS tagging, nine techniques were used for ‘Sandhi’ splitting, one work is done for ‘Alankaar’ finder and absolutely no techniques are available for ‘Samaas’ finder for the Indo-Aryan as well as Dravidian languages. Analysis shows that Rule Based Approach (RBA) and Hidden Markov Model (HMM) are frequently used for POS tagging, RBA is most frequently used for ‘Sandhi’ Splitter, the general Human Intelligence (HI) is used for ‘Alankaar’ Finder and no ‘Samaas’ finder technique is available for any Indian language.
关键词:‘Alankaar’; ‘samaas’; ‘sandhi’; parts of speech tagger (POST)