文章基本信息

标题：Speech Silicon: An FPGA Architecture for Real-Time Hidden Markov-Model-Based Speech Recognition
本地全文：下载
作者：Jeffrey Schuster ; Kshitij Gupta ; Raymond Hoare 等
期刊名称：EURASIP Journal on Embedded Systems
印刷版ISSN：1687-3955
电子版ISSN：1687-3963
出版年度：2006
卷号：2006
DOI：10.1155/ES/2006/48085
出版社：Hindawi Publishing Corporation
摘要：
This paper examines the design of an FPGA-based system-on-a-chip capable of performing continuous speech recognition on medium sized vocabularies in real time. Through the creation of three dedicated pipelines, one for each of the major operations in the system, we were able to maximize the throughput of the system while simultaneously minimizing the number of pipeline stalls in the system. Further, by implementing a token-passing scheme between the later stages of the system, the complexity of the control was greatly reduced and the amount of active data present in the system at any time was minimized. Additionally, through in-depth analysis of the SPHINX 3 large vocabulary continuous speech recognition engine, we were able to design models that could be efficiently benchmarked against a known software platform. These results, combined with the ability to reprogram the system for different recognition tasks, serve to create a system capable of performing real-time speech recognition in a vast array of environments.