TY - GEN
T1 - Parametrized stochastic grammars for RNA secondary structure prediction
AU - Maier, Robert S.
PY - 2007
Y1 - 2007
N2 - We propose a two-level stochastic context-free grammar (SCFG) architecture for parametrized stochastic modeling of a family of RNA sequences, including their secondary structure. A stochastic model of this type can be used for maximum a posteriori estimation of the secondary structure of any new sequence in the family. The proposed SCFG architecture models RNA subsequences comprising paired bases as stochastically weighted Dyck-language words, i.e., as weighted balanced-parenthesis expressions. The length of each run of unpaired bases, forming a loop or a bulge, is taken to have a phase-type distribution: that of the hitting time in a finite-state Markov chain. Without loss of generality, each such Markov chain can be taken to have a bounded complexity. The scheme yields an overall family SCFG with a manageable number of parameters.
AB - We propose a two-level stochastic context-free grammar (SCFG) architecture for parametrized stochastic modeling of a family of RNA sequences, including their secondary structure. A stochastic model of this type can be used for maximum a posteriori estimation of the secondary structure of any new sequence in the family. The proposed SCFG architecture models RNA subsequences comprising paired bases as stochastically weighted Dyck-language words, i.e., as weighted balanced-parenthesis expressions. The length of each run of unpaired bases, forming a loop or a bulge, is taken to have a phase-type distribution: that of the hitting time in a finite-state Markov chain. Without loss of generality, each such Markov chain can be taken to have a bounded complexity. The scheme yields an overall family SCFG with a manageable number of parameters.
UR - http://www.scopus.com/inward/record.url?scp=48049112771&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=48049112771&partnerID=8YFLogxK
U2 - 10.1109/ITA.2007.4357589
DO - 10.1109/ITA.2007.4357589
M3 - Conference contribution
AN - SCOPUS:48049112771
SN - 9780615153148
T3 - 2007 Information Theory and Applications Workshop, Conference Proceedings, ITA
SP - 256
EP - 260
BT - 2007 Information Theory and Applications Workshop, Conference Proceedings, ITA
T2 - 2007 Information Theory and Applications Workshop, ITA
Y2 - 29 January 2007 through 2 February 2007
ER -