文章基本信息

标题：Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
本地全文：下载
作者：Héctor Delgado ; Anna Matamala ; Javier Serrano 等
期刊名称：Cadernos de Tradução
印刷版ISSN：2175-7968
出版年度：2015
卷号：35
期号：2
页码：308-324
DOI：10.5007/2175-7968.2015v35n2p308
出版社：Universidade Federal de Santa Catarina
摘要：This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision.The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track.The technical process is presented and experimental results are summarized.
其他摘要：This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
关键词：Audiodescripción;Accesibilidad;Diarización;Reconocimiento de Habla;Tecnología
其他关键词：Audio Description;Accessibility;Speaker Diarization;Speech Recognition;Technology