首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:RNA Pseudoknotted Structure Prediction Using Stochastic Multiple Context-Free Grammar
  • 本地全文:下载
  • 作者:Yuki Kato ; Hiroyuki Seki ; Tadao Kasami
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2007
  • 卷号:2
  • 期号:1
  • 页码:79-88
  • DOI:10.11185/imt.2.79
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:Many attempts have so far been made at modeling RNA secondary structure by formal grammars. In a grammatical approach, secondary structure prediction can be viewed as parsing problem. However, there may be many different derivation trees for an input sequence. Thus, it is necessary to have a method of extracting biologically realistic derivation trees among them. One solution to this problem is to extend a grammar to a probabilistic model and find the most likely derivation tree, and another is to take free energy minimization into account. One simple formalism for describing RNA folding is context-free grammars(CFGs), but it is known that CFGs cannot represent pseudoknots. Therefore, several formal grammars have been proposed for modeling RNA pseudoknotted structure. In this paper, we focus on multiple context-free grammars (MCFGs), which are natural extension of CFGs and can represent pseudoknots, and extend MCFGs to a probabilistic model called stochastic MCFG (SMCFG). We present a polynomial time parsing algorithm for finding the most probable derivation tree, which is applicable to RNA secondary structure prediction including pseudoknots. Also, we propose a probability parameter estimation algorithm based on the EM (expectation maximization) algorithm. Finally, we show some experimental results on RNA pseudoknot prediction using the SMCFG parsing algorithm, which show good prediction accuracy.
国家哲学社会科学文献中心版权所有