文章基本信息

标题：A NOVEL APPROACH FOR SIMULTANEOUS GENDER AND HINDI VOWEL RECOGNITION USING A MULTIPLE-INPUT MULTIPLE-OUTPUT CO-ACTIVE NEURO-FUZZY INFERENCE SYSTEM
本地全文：下载
作者：SACHIN LAKRA ; T. V. PRASAD ; G. RAMAKRISHNA 等
期刊名称：Journal of Theoretical and Applied Information Technology
印刷版ISSN：1992-8645
电子版ISSN：1817-3195
出版年度：2015
卷号：79
期号：1
出版社：Journal of Theoretical and Applied
摘要：Human beings can simultaneously recognize vowels in speech as well as gender of a speaker inspite of high variability. However, machines have not been able to simultaneously overcome both gender variability and vowel variability existing in speech due to gender. This paper uses a Multiple-Input Multiple-Output Co-Active Neuro-Fuzzy Inference System to recognize both these patterns in speech simultaneously. The features used as input for the recognition is the pitch and the set of first three formant frequencies extracted from speech samples recorded from 70 Indian speakers, 33 male and 37 female. The individual recognition of either gender or vowel has been achieved at a rate of 68% and 95%, respectively, whereas the simultaneous recognition of both patterns has been attained upto 66% for the training set. Thus, this combined approach is a consolidated single-step novel approach which can replace the two-step method in automatic speech recognition systems where gender recognition is being used as the first step as part of hierarchical decision tree based vowel recognition. This can prove significant in enhancing the performance of an automated speech recognition system by eliminating an additional step.
关键词：Formant Frequency; Co-Active Neuro-Fuzzy Inference System; Gender Recognition; Vowel Recognition.