文章基本信息

标题：An Analysis of Speech Recognition Performance Based Upon Network Layers and Transfer Functions
本地全文：下载
作者：Kuldeep Kumar ; R. K. Aggarwal ; Ankita Jain 等
期刊名称：International Journal of Computer Science, Engineering and Applications (IJCSEA)
印刷版ISSN：2231-0088
电子版ISSN：2230-9616
出版年度：2011
卷号：1
期号：3
出版社：Academy & Industry Research Collaboration Center (AIRCC)
摘要：Speech is the most natural way of information exchange. It provides an efficient means of means of manmachine communication using speech interfacing. Speech interfacing involves speech synthesis and speech recognition. Speech recognition allows a computer to identify the words that a person speaks to a microphone or telephone. The two main components, normally used in speech recognition, are signal processing component at front-end and pattern matching component at back-end. In this paper, a setup that uses Mel frequency cepstral coefficients at front-end and artificial neural networks at back-end has been developed to perform the experiments for analyzing the speech recognition performance. Various experiments have been performed by varying the number of layers and type of network transfer function, which helps in deciding the network architecture to be used for acoustic modelling at back end.
关键词：Speech recognition; Mel frequency cepstral coefficients; Artificial neural networks; Network layer;Transfer function.