Minimizing and learning energy functions for side-chain prediction.

作者: Chen Yanover , Ora Schueler-Furman , Yair Weiss

DOI: 10.1089/CMB.2007.0158

关键词: Function (mathematics)Energy functionalSimulated annealingConditional random fieldApproximate inferenceAlgorithmComputer scienceBenchmark (computing)Energy (signal processing)Belief propagation

摘要: Side-chain prediction is an important subproblem of the general protein folding problem. Despite much progress in side-chain prediction, performance far from satisfactory. As example, ROSETTA program that uses simulated annealing to select minimum energy conformations, correctly predicts first two angles for approximately 72% buried residues a standard data set. Is further improvement more likely come better search methods, or functions? Given exact minimization NP hard, it difficult get systematic answer this question. In paper, we present novel method and learning functions training are both based on Tree Reweighted Belief Propagation (TRBP). We find TRBP can obtain global optimum function few minutes computation 85% proteins benchmark also effectively bound partition which enables using Conditional Random Fields (CRF) framework learning. Interestingly, finding does not significantly improve ROSETTA's default terms (less than 0:1%), while new weights gives significant boost 78%. Using recently modified with softer Lennard-Jones repulsive term, accuracy 77% Here again, improves modeling even 80%. Finally, highest (82.6%) obtained extended rotamer library CRF learned weights. Our results suggest combining machine approximate inference state-of-the-art prediction.

参考文章(19)
Ronald W Peterson, P Leslie Dutton, A Joshua Wand, Improved side-chain prediction accuracy using an ab initio potential energy function and a very large rotamer library Protein Science. ,vol. 13, pp. 735- 751 ,(2004) , 10.1110/PS.03250104
Johan Desmet, Marc De Maeyer, Bart Hazes, Ignace Lasters, THE DEAD-END ELIMINATION THEOREM AND ITS USE IN PROTEIN SIDE-CHAIN POSITIONING Nature. ,vol. 356, pp. 539- 542 ,(1992) , 10.1038/356539A0
John Lafferty, Xiaojin Zhu, Yan Liu, Kernel conditional random fields: representation and clique selection international conference on machine learning. pp. 64- ,(2004) , 10.1145/1015330.1015337
ANDREW LEAVER-FAY, BRIAN KUHLMAN, JACK SNOEYINK, An adaptive dynamic programming algorithm for the side chain placement problem. pacific symposium on biocomputing. pp. 16- 27 ,(2004) , 10.1142/9789812702456_0003
Adrian A. Canutescu, Andrew A. Shelenkov, Roland L. Dunbrack, A graph-theory algorithm for rapid protein side-chain prediction Protein Science. ,vol. 12, pp. 2001- 2014 ,(2003) , 10.1110/PS.03154503
Y. Liu, B. Kuhlman, RosettaDesign server for protein design Nucleic Acids Research. ,vol. 34, pp. 235- 238 ,(2006) , 10.1093/NAR/GKL163
Roland L. Dunbrack, Martin Karplus, Backbone-dependent Rotamer Library for Proteins Application to Side-chain Prediction Journal of Molecular Biology. ,vol. 230, pp. 543- 574 ,(1993) , 10.1006/JMBI.1993.1170
C. L. Kingsford, B. Chazelle, M. Singh, Solving and analyzing side-chain positioning problems using linear and integer programming Bioinformatics. ,vol. 21, pp. 1028- 1039 ,(2005) , 10.1093/BIOINFORMATICS/BTI144
M.J. Wainwright, T.S. Jaakkola, A.S. Willsky, MAP estimation via agreement on trees: message-passing and linear programming IEEE Transactions on Information Theory. ,vol. 51, pp. 3697- 3717 ,(2005) , 10.1109/TIT.2005.856938