首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Nearest Neighbor-Based Indonesian G2P Conversion
  • 本地全文:下载
  • 作者:Suyanto Suyanto ; Agus Harjoko
  • 期刊名称:TELKOMNIKA (Telecommunication Computing Electronics and Control)
  • 印刷版ISSN:2302-9293
  • 出版年度:2014
  • 卷号:12
  • 期号:2
  • 页码:389-396
  • DOI:10.12928/telkomnika.v12i2.57
  • 语种:English
  • 出版社:Universitas Ahmad Dahlan
  • 摘要:Grapheme-to-phoneme conversion (G2P), also known as letter-to-sound conversion, is an important module in both speech synthesis and speech recognition. The methods of G2P give varying accuracies for different languages although they are designed to be language independent. This paper discusses a new model based on pseudo nearest neighbor rule (PNNR) for Indonesian G2P. In this model, partial orthogonal binary code for graphemes, contextual weighting, and neighborhood weighting are introduced. Testing to 9,604 unseen words shows that the model parameters are easy to be tuned to reach high accuracy. Testing to 123 sentences containing homographs shows that the model could disambiguate homographs if it uses long graphemic context. Compare to information gain tree, PNNR gives slightly higher phoneme error rate, but it could disambiguate homographs.
  • 关键词:grapheme-to-phoneme conversion, Indonesian language, pseudo nearest neighbor, partial orthogonal binary code, contextual weighting
国家哲学社会科学文献中心版权所有