首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:An Optimum Database for Isolated Word in Speech Recognition System
  • 本地全文:下载
  • 作者:Syifaun Nafisah ; Oyas Wahyunggoro ; Lukito Edi Nugroho
  • 期刊名称:TELKOMNIKA (Telecommunication Computing Electronics and Control)
  • 印刷版ISSN:2302-9293
  • 出版年度:2016
  • 卷号:14
  • 期号:2
  • 页码:588-597
  • DOI:10.12928/telkomnika.v14i2.2353
  • 语种:English
  • 出版社:Universitas Ahmad Dahlan
  • 摘要:Speech recognition system (ASR) is a technology that allows computers receive the input using the spoken words. This technology requires sample words in the pattern matching process that is stored in the database. There is no reference as the fundamental theory to develop database in ASR. So, the research of database development to optimize the performance of the system is required. Mel-scale frequency cepstral coefficients (MFCCs) is used to extract the characteristics of speech signal and backpropagation neural network in quantized vector is used to evaluate likelihood the maximum log values to the nearest pattern in the database. The results shows the robustness of ASR is optimum using 140 samples of data reference for each word with an average of accuracy is 99.95% and duration process is 27.4 msec. The investigation also reported the gender doesn’t have significantly influence to the accuracy. From these results it concluded that the performance of ASR can be increased by optimizing the database.
  • 其他摘要:Speech recognition system (ASR) is a technology that allows computers receive the input using the spoken words. This technology requires sample words in the pattern matching process that is stored in the database. There is no reference as the fundamental theory to develop database in ASR. So, the research of database development to optimize the performance of the system is required.  Mel-scale frequency cepstral coefficients (MFCCs) is used to extract the characteristics of speech signal and backpropagation neural network in quantized vector is used to evaluate likelihood the maximum log values to the nearest pattern in the database.  The results shows the robustness of ASR is optimum using 140 samples of data reference for each word with an average of accuracy is 99.95% and duration process is 27.4 msec.  The investigation also reported the gender doesn’t have significantly influence to the accuracy.  From these results it concluded that the performance of ASR can be increased by optimizing the database.
  • 关键词:Optimum; Database; ASR; Backpropagation; MFCCs
国家哲学社会科学文献中心版权所有