Skip to main navigation Skip to search Skip to main content

Partial least squares (PLS) applied to medical bioinformatics

  • Walker H. Land
  • , William Ford
  • , Jin Woo Park
  • , Ravi Mathur
  • , Nathan Hotchkiss
  • , John Heine
  • , Steven Eschrich
  • , Xingye Qiao
  • , Timothy Yeatman
  • State University of New York Binghamton University
  • Moffitt Cancer Center

Research output: Contribution to journalConference articlepeer-review

30 Scopus citations

Abstract

PLS initially creates uncorrelated latent variables which are linear combinations of the original input vectors Xi, where weights are used to determine linear combinations, which are proportional to the covariance. Secondly, a least squares regression is then performed on the subset of extracted latent variables that lead to a lower and biased variance on transformed data. This process, leads to a lower variance estimate of the regression coefficients when compared to the Ordinary Least Squares regression approach. Classical Principal Component Analysis (PCA), linear PLS and kernel ridge regression (KRR) techniques are well known shrinkage estimators designed to deal with multi-collinearity, which can be a serious problem. That is, multi-collinearity can dramatically influence the effectiveness of a regression model by changing the values and signs of estimated regression coefficients given different but similar data samples, thereby leading to a regression model which represents training data reasonably well, but generalizes poorly to validation and test data. We explain how to address these problems, which is followed by performing a PLS hypotheses driven preliminary research study and sensitivities analysis by not doing a combinatorial analysis as PLS will eliminate the unnecessary variables using a microarray colon cancer data set. Research studies as well as preliminary results are described in the results section.

Original languageEnglish
Pages (from-to)273-278
Number of pages6
JournalProcedia Computer Science
Volume6
DOIs
StatePublished - 2011
EventComplex Adaptive Systems - Chicago, IL, United States
Duration: Oct 30 2011Nov 2 2011

Keywords

  • Biomarker research
  • Colon cancer
  • Complex adaptive systems
  • Microarrays
  • Partial least squares
  • Statistical learning theory

Fingerprint

Dive into the research topics of 'Partial least squares (PLS) applied to medical bioinformatics'. Together they form a unique fingerprint.

Cite this