期刊名称:International Journal on Computer Science and Engineering
印刷版ISSN:2229-5631
电子版ISSN:0975-3397
出版年度:2012
卷号:4
期号:05
页码:707-710
出版社:Engg Journals Publications
摘要:In this paper we have proposed an automatic speech recognition framework using agents. In this we have included both audio recognition and visual recognition. The audio and visual modalities are complementary to each other and the combination of the two can improve the accuracy in affective user models. The audio features extracted are processed by audition agent. The visual processing agent takes care of the lip and face detection. Finally both these agents assist audio visual fusion agent in fusion of these modalities for automatic speech recognition.
关键词:Agents; Audio-visual; Speech recognition; Face detection; Lip motion; Framework