首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:A comparison of multi-style DNN-based TTS approaches using small datasets
  • 本地全文:下载
  • 作者:Siniša Suzić ; Tijana Delić ; Vladimir Jovanović
  • 期刊名称:MATEC Web of Conferences
  • 电子版ISSN:2261-236X
  • 出版年度:2018
  • 卷号:161
  • DOI:10.1051/matecconf/201816103005
  • 语种:English
  • 出版社:EDP Sciences
  • 摘要:Studies have shown that people already perceive the interaction with computers, robots and media in the same way as they perceive social communication with other people. For that reason it is critical for a high-quality text-to-speech system (TTS) to sound as human-like as possible. However, a major obstacle in creating expressive TTS voices is that the amount of style-specific speech needed for training such a system is often not sufficient. This paper presents a comparison between different approaches to multi-style TTS, with focus on cases when only a small dataset per style is available. The described approaches have been originally proposed for efficient modelling of multiple speakers with a limited amount of data per speaker. Among the suggested approaches the approach based on style codes has emerged as the best, regardless of the target speech style.
国家哲学社会科学文献中心版权所有