Two-stage clustering in genotype-by-environment analyses with missing data

Godfrey AJ; Wood GR; Ganesalingam S; Nichols MA; Qiao CG

Two-stage clustering in genotype-by-environment analyses with missing data

Files

Godfrey2002.pdf(130.21 KB)

Date

2002

Authors

Godfrey AJ

Publisher

CAMBRIDGE UNIV PRESS

Abstract

Cluster analysis has been commonly used in genotype-by-environment (G × E) analyses, but current methods are inadequate when the data matrix is incomplete. This paper proposes a new method, referred to as two-stage clustering, which relies on a partitioning of squared Euclidean distance into two independent components, the G × E interaction and the genotype main effect. These components are used in the first and second stages of clustering respectively. Two-stage clustering forms the basis for imputing missing values in the G × E matrix, so that a more complete data array is available for other G × E analyses. Imputation for a given genotype uses information from genotypes with similar interaction profiles. This imputation method is shown to improve on an existing nearest cluster method that confounds the G × E interaction and the genotype main effect.

Citation

Godfrey, A. J. R.; Wood, G. R.; Ganesalingam, S.; Nichols, M. A.; Qiao, C. G. (2002). Two-stage clustering in genotype-by-environment analyses with missing data. Journal of Agricultural Science. Vol. 139, pp. 67-77.

URI

https://hdl.handle.net/10179/592

Collections

Journal Articles

Full item page