TY - GEN
T1 - Estimating the accuracy of multiple alignments and its use in parameter advising
AU - Deblasio, Dan F.
AU - Wheeler, Travis J.
AU - Kececioglu, John D.
N1 - Funding Information:
Research supported by US NSF Grant IIS-1050293 and DGE-0654435.
PY - 2012
Y1 - 2012
N2 - We develop a novel and general approach to estimating the accuracy of protein multiple sequence alignments without knowledge of a reference alignment, and use our approach to address a new problem that we call parameter advising. For protein alignments, we consider twelve independent features that contribute to a quality alignment. An accuracy estimator is learned that is a polynomial function of these features; its coefficients are determined by minimizing its error with respect to true accuracy using mathematical optimization. We evaluate this approach by applying it to the task of parameter advising: the problem of choosing alignment scoring parameters from a collection of parameter values to maximize the accuracy of a computed alignment. Our estimator, which we call Facet (for "feature-based accuracy estimator"), yields a parameter advisor that on the hardest benchmarks provides more than a 20% improvement in accuracy over the best default parameter choice, and outperforms the best prior approaches to selecting good alignments for parameter advising.
AB - We develop a novel and general approach to estimating the accuracy of protein multiple sequence alignments without knowledge of a reference alignment, and use our approach to address a new problem that we call parameter advising. For protein alignments, we consider twelve independent features that contribute to a quality alignment. An accuracy estimator is learned that is a polynomial function of these features; its coefficients are determined by minimizing its error with respect to true accuracy using mathematical optimization. We evaluate this approach by applying it to the task of parameter advising: the problem of choosing alignment scoring parameters from a collection of parameter values to maximize the accuracy of a computed alignment. Our estimator, which we call Facet (for "feature-based accuracy estimator"), yields a parameter advisor that on the hardest benchmarks provides more than a 20% improvement in accuracy over the best default parameter choice, and outperforms the best prior approaches to selecting good alignments for parameter advising.
UR - http://www.scopus.com/inward/record.url?scp=84860793205&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84860793205&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-29627-7_5
DO - 10.1007/978-3-642-29627-7_5
M3 - Conference contribution
AN - SCOPUS:84860793205
SN - 9783642296260
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 45
EP - 59
BT - Research in Computational Molecular Biology - 16th Annual International Conference, RECOMB 2012, Proceedings
T2 - 16th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2012
Y2 - 21 April 2012 through 24 April 2012
ER -