期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2016
卷号:4
期号:3
页码:4344
DOI:10.15680/IJIRCCE.2016.0403199
出版社:S&S Publications
摘要:Bilingual dictionaries have always been an important source of query translation in Cross Language Information Retrieval. Besides other issues bilingual translation suffers from ambiguity problem. To resol ve this issue, several recent works have recommended the use of term co occurrence statistics. Same concept with a major modification is the focus of our work described here. Our work is based on the fact that all terms do not have same discriminating powe r in a query. To overcome such problem, our algorithm provides more weight to discriminating terms in the query and treats co occurrences of useful terms as more valuable than those of frequent terms. The paper also takes into account the concept of local context in formulating formula for co - occurrences statistics. In the experiments, our method achieved 85% of monolingual translation in terms of the mean average precision (MAP). The results are quiet encouraging as compared to other methods used for cross language information retrieval for Indian languages.
关键词:Cross Language information retrieval; Query Translation; Disambiguation; Co-occurrence