Research output: Contribution to journal › Article › Academic › peer-review
Correlating multiple SNPs and multiple disease phenotypes: Penalized nonlinear canonical correlation analysis. / Waaijenborg, Sandra; Zwinderman, Aeilko H.
In: Bioinformatics (Oxford, England), Vol. 25, No. 21, 2009, p. 2764-2771.Research output: Contribution to journal › Article › Academic › peer-review
}
TY - JOUR
T1 - Correlating multiple SNPs and multiple disease phenotypes: Penalized nonlinear canonical correlation analysis
AU - Waaijenborg, Sandra
AU - Zwinderman, Aeilko H.
PY - 2009
Y1 - 2009
N2 - MOTIVATION: Canonical correlation analysis (CCA) can be used to capture the underlying genetic background of a complex disease, by associating two datasets containing information about a patient's phenotypical and genetic details. Often the genetic information is measured on a qualitative scale, consequently ordinary CCA can not be applied to such data. Moreover, the size of the data in genetic studies can be enormous thereby making the results difficult to interpret. RESULTS: We developed a penalized nonlinear canonical correlation analysis approach that can deal with qualitative data by transforming each qualitative variable into a continuous variable via optimal scaling. Additionally sparse results were obtained by adapting softthresholding to this nonlinear version of the CCA. By means of simulation studies we show that our method is capable of extracting relevant variables out of high dimensional sets. We applied our method to a genetic dataset containing 144 patients with glial cancer. CONTACT: s.waaijenborg@amc.uva.nl
AB - MOTIVATION: Canonical correlation analysis (CCA) can be used to capture the underlying genetic background of a complex disease, by associating two datasets containing information about a patient's phenotypical and genetic details. Often the genetic information is measured on a qualitative scale, consequently ordinary CCA can not be applied to such data. Moreover, the size of the data in genetic studies can be enormous thereby making the results difficult to interpret. RESULTS: We developed a penalized nonlinear canonical correlation analysis approach that can deal with qualitative data by transforming each qualitative variable into a continuous variable via optimal scaling. Additionally sparse results were obtained by adapting softthresholding to this nonlinear version of the CCA. By means of simulation studies we show that our method is capable of extracting relevant variables out of high dimensional sets. We applied our method to a genetic dataset containing 144 patients with glial cancer. CONTACT: s.waaijenborg@amc.uva.nl
U2 - 10.1093/bioinformatics/btp491
DO - 10.1093/bioinformatics/btp491
M3 - Article
C2 - 19689958
VL - 25
SP - 2764
EP - 2771
JO - Bioinformatics (Oxford, England)
JF - Bioinformatics (Oxford, England)
SN - 1367-4803
IS - 21
ER -
ID: 933395