期刊名称:Journal of Computing and Information Technology
印刷版ISSN:1330-1136
电子版ISSN:1846-3908
出版年度:1996
卷号:4
期号:1
页码:1-8
语种:English
出版社:SRCE - Sveučilišni računski centar
摘要:We present two concepts for systems with language identification in the context of multilingual information retrieval dialogues. The first one has an explicit module for language identification. It is based on training a codebook for each language, running the language specific vector quantizers in parallel and integrating over the output probability of the best alternative in each language. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold . T his approach allows to recognize languages that the system cannot process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains only words from one language and language identification is an implicit byproduct of the speech recognizer. First results for the explicit language identification are presented.