首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Modified Grapheme Encoding and Phonemic Rule to Improve PNNR-Based Indonesian G2P
  • 本地全文:下载
  • 作者:Suyanto ; Sri Hartati ; Agus Harjoko
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2016
  • 卷号:7
  • 期号:3
  • DOI:10.14569/IJACSA.2016.070358
  • 出版社:Science and Information Society (SAI)
  • 摘要:A grapheme-to-phoneme conversion (G2P) is very important in both speech recognition and synthesis. The existing Indonesian G2P based on pseudo nearest neighbour rule (PNNR) has two drawbacks: the grapheme encoding does not adapt all Indonesian phonemic rules and the PNNR should select a best phoneme from all possible conversions even though they can be filtered by some phonemic rules. In this paper, a modified partial orthogonal binary grapheme encoding and a phonemic-based rule are proposed to improve the performance of PNNR-based Indonesian G2P. Evaluating on 5-fold cross-validation, contain 40K words to develop the model and 10K words to evaluation each, shows that both proposed concepts reduce the relative phoneme error rate (PER) by 13.07%. A more detail analysis shows the most errors are from grapheme ?e? that can be dynamically converted into either /E/ or /??/ since four prefixes, ’ber’, ’me’, ’per’, and ’ter’, produce many ambiguous conversions with basic words and also from some similar compound words with both different pronunciations for the grapheme ?e?. A stemming procedure can be applied to reduce those errors.
  • 关键词:thesai; IJACSA; thesai.org; journal; IJACSA papers; Modified grapheme encoding; phonemic rule; In-donesian grapheme-to-phoneme conversion; pseudo nearest neigh-bour rule
国家哲学社会科学文献中心版权所有