Genomic Selection in Plant Breeding: A Comparison of Models

  • Nicolas Heslot
    Dep. of Plant Breeding and Genetics Cornell Univ. Ithaca NY 14853
  • Hsiao‐Pei Yang
    USDA‐ARS, R.W. Holley Center for Agriculture and Health, Cornell Univ. Ithaca NY 14853
  • Mark E. Sorrells
    Dep. of Plant Breeding and Genetics Cornell Univ. Ithaca NY 14853
  • Jean‐Luc Jannink
    USDA‐ARS, R.W. Holley Center for Agriculture and Health, Cornell Univ. Ithaca NY 14853

抄録

<jats:title>ABSTRACT</jats:title><jats:p>Simulation and empirical studies of genomic selection (GS) show accuracies sufficient to generate rapid genetic gains. However, with the increased popularity of GS approaches, numerous models have been proposed and no comparative analysis is available to identify the most promising ones. Using eight wheat (<jats:italic>Triticum aestivum</jats:italic> L.), barley (<jats:italic>Hordeum vulgare</jats:italic> L.), <jats:italic>Arabidopsis thaliana</jats:italic> (L.) Heynh., and maize (<jats:italic>Zea mays</jats:italic> L.) datasets, the predictive ability of currently available GS models along with several machine learning methods was evaluated by comparing accuracies, the genomic estimated breeding values (GEBVs), and the marker effects for each model. While a similar level of accuracy was observed for many models, the level of overfitting varied widely as did the computation time and the distribution of marker effect estimates. Our comparisons suggested that GS in plant breeding programs could be based on a reduced set of models such as the Bayesian Lasso, weighted Bayesian shrinkage regression (wBSR, a fast version of BayesB), and random forest (RF) (a machine learning method that could capture nonadditive effects). Linear combinations of different models were tested as well as bagging and boosting methods, but they did not improve accuracy. This study also showed large differences in accuracy between subpopulations within a dataset that could not always be explained by differences in phenotypic variance and size. The broad diversity of empirical datasets tested here adds evidence that GS could increase genetic gain per unit of time and cost.</jats:p>

収録刊行物

被引用文献 (11)*注記

もっと見る

問題の指摘

ページトップへ