Training Set Determination for Genomic Selection

Jen-Hsiang Ou and Chen-Tuo Liao

Key message

A new optimality criterion is proposed to determine a training set for genomic selection, which is derived from Pearson's correlation between GEBVs and phenotypic values of a test set. R functions are provided to generate the optimal training set.

R-package: TSDFGS


R> install.package("TSDFGS")

** Rtools should be installed for Windows users.


Example Data: 44k rice genome data

R> data("rice44kPCA")

44K rice genome data was first published by Zhao et at. (2011) which provide 44,100 SNP variants across 413 diverse associations of Oriza sativa. The original data set can be downloaded at "Rice Diversity" website.



