Abstract
The phenomenon of heterosis is widely used in hybrid breeding programmes, despite the fact that no satisfactory molecular explanation is available. Estimators of quantitative genetic components like GCA and SCA values are tools used by the plant breeder to identify superior parental individuals and to search for high heterosis combinations. Obtaining these estimators usually requires the creation of new parental combinations and testing their offspring in multi-environment field trials. In this study we explore the use of ɛ-insensitive Support Vector Machine Regression (ɛ-SVR) for the prediction of GCA and SCA values from the molecular marker scores of parental inbred lines as an alternative to these field trials. Prediction accuracies are obtained by means of cross-validation on a grain maize data set from the private breeding company RAGT R2n. Results indicate that the proposed method allows the routine screening of new inbred lines despite the fact that predicting the SCA value of an untested hybrid remains problematic with the available molecular marker information and standard kernel functions. The genotypical performance of a testcross hybrid, originating from a cross between an untested inbred line and a well-known complementary tester, can be predicted with moderate to high accuracy while this cannot be said for a cross between two untested inbred lines.
Similar content being viewed by others
References
Bernardo R (1994) Prediction of maize single-cross performance using RFLPs and information from related hybrids. Crop Sci 34:20–25
Burges C (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2(2):121–167
Chang C, Lin C (2001) LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/∼cjlin/libsvm
Drezner Z, Wesolowsky GO (1989) An approximation method for bivariate and multivariate normal equiprobability contours. Commun Stat Theory Methods 18(6):2331–2344
Emik L, Terrill C (1949) Systematic procedures for calculating inbreeding coefficients. J Hered 40:51–55
Gilmour A, Gogel B, Cullis B, Welham S, Thompson R (2002) ASREML User Guide Release 1.0. VSN International Ltd
Henderson C (1984) Applications of linear models in animal breeding. University of Guelph Press
Hsu C, Chang C, Lin C (2003) A practical guide to support vector classification. Department of Computer Science and Information Engineering National Taiwan University, http://www.csie.ntu.edu.tw/∼cjlin/papers/guide/guide.pdf
Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press
Stuber C, Cockerham C (1966) Gene effects and variances in hybrid populations. Genetics 54:1279–1286
Vapnik V (1995) The nature of statistical learning theory. Springer-Verlag, New York, USA
Vuylsteke M, Mank R, Antonise R, Bastiaans RE, Senior M, Stuber C, Melchinger A, Lübberstedt T, Xia X, Stam P, Zabeau M, Kuiper M (1999) Two high-density AFLP (R) linkage maps of Zea mays L.: analysis of distribution of AFLP markers. Theor Appl Genet 99(6):921–935
Acknowledgements
The authors would like to thank the people from RAGT R2n for their unreserved and open minded scientific contribution to this research. We are also very grateful to Stijn Vansteelandt, Jan De Riek and Peter Dawyndt for discussions on linear mixed modelling, genotyping by means of AFLP markers and cluster computing.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Maenhout, S., De Baets, B., Haesaert, G. et al. Marker-based screening of maize inbred lines using support vector machine regression. Euphytica 161, 123–131 (2008). https://doi.org/10.1007/s10681-007-9423-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10681-007-9423-5