Accuracy of Genomic Prediction in Switchgrass (Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium

作者: Guillaume P. Ramstein , Joseph Evans , Shawn M. Kaeppler , Robert B. Mitchell , Kenneth P. Vogel

DOI: 10.1534/G3.115.024950

关键词:

摘要: Switchgrass is a relatively high-yielding and environmentally sustainable biomass crop, but further genetic gains in yield must be achieved to make it an economically viable bioenergy feedstock. Genomic selection (GS) attractive technology generate rapid switchgrass, meet the goals of substantial displacement petroleum use with biofuels near future. In this study, we empirically assessed prediction procedures for genomic two different populations, consisting 137 110 half-sib families tested locations United States three agronomic traits: dry matter yield, plant height, heading date. Marker data were produced families’ parents by exome capture sequencing, generating up 141,030 polymorphic markers available genomic-location annotation information. We evaluated that varied not only learning schemes models, also way preprocessed account redundancy marker More complex generally significantly more accurate than simplest procedure, likely due limited population sizes. Nevertheless, highly significant gain accuracy was transforming through correlation matrix. Our results suggest marker-data transformations and, generally, linkage disequilibrium among markers, offer valuable opportunities improving GS. Some accuracies should motivate implementation GS switchgrass breeding programs.

参考文章(74)
Matthew Wiener, Andy Liaw, Classification and Regression by randomForest ,(2007)
Dan He, Zhanyong Wang, Laxmi Parida, Data-driven encoding for quantitative genetic trait prediction. BMC Bioinformatics. ,vol. 16, pp. 1- 9 ,(2015) , 10.1186/1471-2105-16-S1-S10
G. Guo, M.S. Lund, Y. Zhang, G. Su, Comparison between genomic predictions using daughter yield deviation and conventional estimated breeding value as response variables Journal of Animal Breeding and Genetics. ,vol. 127, pp. 423- 432 ,(2010) , 10.1111/J.1439-0388.2010.00878.X
Remco R. Bouckaert, Eibe Frank, Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms Advances in Knowledge Discovery and Data Mining. pp. 3- 12 ,(2004) , 10.1007/978-3-540-24775-3_3
N. Long, D. Gianola, G.J.M. Rosa, K.A. Weigel, Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins. Journal of Animal Breeding and Genetics. ,vol. 128, pp. 247- 257 ,(2011) , 10.1111/J.1439-0388.2011.00917.X
T. H. E. Meuwissen, M. E. Goddard, B. J. Hayes, Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps Genetics. ,vol. 157, pp. 1819- 1829 ,(2001) , 10.1093/GENETICS/157.4.1819
J. D. Storey, R. Tibshirani, Statistical significance for genomewide studies Proceedings of the National Academy of Sciences of the United States of America. ,vol. 100, pp. 9440- 9445 ,(2003) , 10.1073/PNAS.1530509100
M.L. Makgahlela, E.A. Mäntysaari, I. Strandén, M. Koivula, U.S. Nielsen, M.J. Sillanpää, J. Juga, Across breed multi-trait random regression genomic predictions in the Nordic Red dairy cattle Journal of Animal Breeding and Genetics. ,vol. 130, pp. 10- 19 ,(2013) , 10.1111/J.1439-0388.2012.01017.X
Paulino Pérez, Gustavo de los Campos, Genome-Wide Regression and Prediction with the BGLR Statistical Package Genetics. ,vol. 198, pp. 483- 495 ,(2014) , 10.1534/GENETICS.114.164442