Gene‐ or region‐based analysis of genome‐wide...

Gene‐ or region‐based analysis of genome‐wide association studies

Abstract

With rapid advances in genotyping technologies in recent years and the growing number of available markers, genome-wide association studies are emerging as promising approaches for the study of complex diseases and traits. However, there are several challenges with analysis and interpretation of such data. First, there is a massive multiple testing problem, due to the large number of markers that need to be analyzed, leading to an increased risk of false positives and decreased ability for association studies to detect truly associated markers. In particular, the ability to detect modest genetic effects can be severely compromised. Second, a genetic association of a given single-nucleotide polymorphism as determined by univariate statistical analyses does not typically explain biologically interesting features, and often requires subsequent interpretation using a higher unit, such as a gene or region, for example, as defined by haplotype blocks. Third, missing genotypes in the data set and other data quality issues can pose challenges when comparisons across platforms and replications are planned. Finally, depending on the type of univariate analysis, computational burden can arise as the number of markers continues to grow into the millions. One way to deal with these and related challenges is to consider higher units for the analysis, such as genes or regions. This article summarizes analytical methods and strategies that have been proposed and applied by Group 16 to two genome-wide association data sets made available through the Genetic Analysis Workshop 16.

Authors

Beyene J; Tritchler D; Asimit JL; Hamid JS

Journal

Genetic Epidemiology, Vol. 33, No. S1, pp. s105–s110

Publisher

Wiley

Publication Date

December 10, 2009

DOI

10.1002/gepi.20481

ISSN

0741-0395

Associated Experts

Joseph Beyene

Professor, Faculty of Health Sciences

Visit profile

Labels