期刊名称:International Journal of Computer Science, Engineering and Applications (IJCSEA)
印刷版ISSN:2231-0088
电子版ISSN:2230-9616
出版年度:2011
卷号:1
期号:3
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Speech is the most natural way of information exchange. It provides an efficient means of means of manmachine communication using speech interfacing. Speech interfacing involves speech synthesis and speech recognition. Speech recognition allows a computer to identify the words that a person speaks to a microphone or telephone. The two main components, normally used in speech recognition, are signal processing component at front-end and pattern matching component at back-end. In this paper, a setup that uses Mel frequency cepstral coefficients at front-end and artificial neural networks at back-end has been developed to perform the experiments for analyzing the speech recognition performance. Various experiments have been performed by varying the number of layers and type of network transfer function, which helps in deciding the network architecture to be used for acoustic modelling at back end.