Computing and Mathematical Sciences Colloquium

Monday October 20, 2014 4:00 PM

Linear Genetics: (the Linear Algebra of) Genotype-Phenotype Association Studies

Speaker: Professor Lior Pachter, Department of Mathematics, University of California at Berkeley
Location: Annenberg 105

Principal component analysis (PCA) of genotype matrices has become a standard tool for studying population structure in genetics and has also been used to control for such structure in genome wide association studies. I will discuss how to generalize the PCA framework to the study of more complex genotype-phenotype interactions via a probabilistic model that subsumes probabilistic PCA and canonical correlation analysis (CCA) in a common framework. The model, which we term factored association analysis (FAA), also addresses issues of overfitting when CCA is used naively. Using FAA, I will demonstrate evidence for population structure in gene expression, and also show how it can be used to analyze multiple diverse genomic datasets, in particular from cancer genome projects. This is joint work with Nicolas Bray, Brielin Brown, and Shannon McCurdy.

Series Computing and Mathematical Sciences Colloquium Series

Contact: Carmen Nemer-Sirois at (626) 395-4561 carmens@caltech.edu