Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.

Abstract:

:Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousands of individuals. Large-scale international initiatives, such as the Wellcome Trust Case Control Consortium, the Genetic Association Information Network, and the database of genetic and phenotypic information, aim to facilitate discovery of modest-effect genes by making genome-wide data publicly available, allowing information to be combined for the purpose of pooled analysis. In principle, disease or control samples from these studies could be used to increase the power of any GWA study via judicious use as "genetically matched controls" for other traits. Here, we present the biological motivation for the problem and the theoretical potential for expanding the control group with publicly available disease or reference samples. We demonstrate that a naïve application of this strategy can greatly inflate the false-positive error rate in the presence of population structure. As a remedy, we make use of genome-wide data and model selection techniques to identify "axes" of genetic variation which are associated with disease. These axes are then included as covariates in association analysis to correct for population structure, which can result in increases in power over standard analysis of genetic information from the samples in the original GWA study.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Zhuang JJ,Zondervan K,Nyberg F,Harbron C,Jawaid A,Cardon LR,Barratt BJ,Morris AP

doi

10.1002/gepi.20482

subject

Has Abstract

pub_date

2010-05-01 00:00:00

pages

319-26

issue

4

eissn

0741-0395

issn

1098-2272

journal_volume

34

pub_type

杂志文章
  • Maximum-likelihood estimation of haplotype frequencies in nuclear families.

    abstract::The importance of haplotype analysis in the context of association fine mapping of disease genes has grown steadily over the last years. Since experimental methods to determine haplotypes on a large scale are not available, phase has to be inferred statistically. For individual genotype data, several reconstruction te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10323

    authors: Becker T,Knapp M

    更新日期:2004-07-01 00:00:00

  • Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.

    abstract::A key aim for current genome-wide association studies (GWAS) is to interrogate the full spectrum of genetic variation underlying human traits, including rare variants, across populations. Deep whole-genome sequencing is the gold standard to fully capture genetic variation, but remains prohibitively expensive for large...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22326

    authors: Quick C,Anugu P,Musani S,Weiss ST,Burchard EG,White MJ,Keys KL,Cucca F,Sidore C,Boehnke M,Fuchsberger C

    更新日期:2020-09-01 00:00:00

  • Familial aggregation of breast cancer with early onset lung cancer.

    abstract::Site-specific familial aggregation and evidence supporting Mendelian codominant inheritance have been shown in lung cancer. In characterizing lung cancer families, a number of other cancers have been observed. The current study evaluates whether first-degree relatives of early onset lung cancer cases are at increased ...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(199911)17:4<274::AID-GEPI3

    authors: Schwartz AG,Siegfried JM,Weiss L

    更新日期:1999-11-01 00:00:00

  • Pleiotropy and principal components of heritability combine to increase power for association analysis.

    abstract::When many correlated traits are measured the potential exists to discover the coordinated control of these traits via genotyped polymorphisms. A common statistical approach to this problem involves assessing the relationship between each phenotype and each single nucleotide polymorphism (SNP) individually (PHN); and t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20257

    authors: Klei L,Luca D,Devlin B,Roeder K

    更新日期:2008-01-01 00:00:00

  • Genetic epidemiology of autosomal recessive spastic ataxia of Charlevoix-Saguenay in northeastern Quebec.

    abstract::Autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) is a disorder that has an elevated frequency in Saguenay-Lac-St-Jean (SLSJ) and Charlevoix, two geographically isolated regions in the past of northeastern Quebec. The incidence at birth and the carrier rate in SLSJ were estimated at 1/1,932 liveborn i...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100103

    authors: De Braekeleer M,Giasson F,Mathieu J,Roy M,Bouchard JP,Morgan K

    更新日期:1993-01-01 00:00:00

  • Equivalence of the mixed and regressive models for genetic analysis. I. Continuous traits.

    abstract::The mixed model of segregation analysis specifies major gene effects and partitions the residual variance into polygenic and environmental components. The model explains familial correlations essentially in terms of genetic causation. The regressive model, on the other hand, is constructed by successively conditioning...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060505

    authors: Demenais FM,Bonney GE

    更新日期:1989-01-01 00:00:00

  • Integration of multiomic annotation data to prioritize and characterize inflammation and immune-related risk variants in squamous cell lung cancer.

    abstract::Clinical trial results have recently demonstrated that inhibiting inflammation by targeting the interleukin-1β pathway can offer a significant reduction in lung cancer incidence and mortality, highlighting a pressing and unmet need to understand the benefits of inflammation-focused lung cancer therapies at the genetic...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22358

    authors: Sun R,Xu M,Li X,Gaynor S,Zhou H,Li Z,Bossé Y,Lam S,Tsao MS,Tardon A,Chen C,Doherty J,Goodman G,Bojesen SE,Landi MT,Johansson M,Field JK,Bickeböller H,Wichmann HE,Risch A,Rennert G,Arnold S,Wu X,Melander O,

    更新日期:2021-02-01 00:00:00

  • Generalization of the extended transmission disequilibrium test to two unlinked disease loci.

    abstract::The extended transmission disequilibrium test (ETDT) of Sham and Curtis [1995] is a powerful test of the null hypothesis of no linkage between a multi-allelic marker locus and a disease susceptibility locus of unknown location in the presence of association between alleles at the two loci. We propose a generalization ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.13701707108

    authors: Morris A,Whittaker J

    更新日期:1999-01-01 00:00:00

  • Linkage disequilibrium structure and its impact on the localization of a candidate functional mutation.

    abstract::We have used the unblinded MG1/Q1 Genetic Analysis Workshop 12 simulated data as a model system for investigating the use of linkage disequilibrium structure and simple genotype-phenotype associations to identify candidate functional mutations within a gene of interest. Analysis of the pattern of pairwise linkage dise...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s620

    authors: Huang Q,Morrison AC,Boerwinkle E

    更新日期:2001-01-01 00:00:00

  • Meta-analysis by combining p-values: simulated linkage studies.

    abstract::Meta-analysis has been little explored to make an overall assessment of linkage from different studies. In practice, it is likely that published linkage studies will only report p-values. We compared the performance of the widely used Fisher method for combining p-values with that of pooling raw data. More loci were c...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,meta分析

    doi:10.1002/gepi.1370170798

    authors: Guerra R,Etzel CJ,Goldstein DR,Sain SR

    更新日期:1999-01-01 00:00:00

  • Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    abstract::Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 His...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21801

    authors: Hellwege JN,Palmer ND,Raffield LM,Ng MC,Hawkins GA,Long J,Lorenzo C,Norris JM,Ida Chen YD,Speliotes EK,Rotter JI,Langefeld CD,Wagenknecht LE,Bowden DW

    更新日期:2014-05-01 00:00:00

  • Gene-environment interaction tests for dichotomous traits in trios and sibships.

    abstract::When testing for genetic effects, failure to account for a gene-environment interaction can mask the true association effects of a genetic marker with disease. Family-based association tests are popular because they are completely robust to population substructure and model misspecification. However, when testing for ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20421

    authors: Hoffmann TJ,Lange C,Vansteelandt S,Laird NM

    更新日期:2009-12-01 00:00:00

  • Effect of linkage disequilibrium between markers in linkage and association analyses.

    abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20291

    authors: Dupuis J,Albers K,Allen-Brady K,Cho K,Elston RC,Kappen HJ,Tang H,Thomas A,Thomson G,Tsung E,Yang Q,Zhang W,Zhao K,Zheng G,Ziegler JT

    更新日期:2007-01-01 00:00:00

  • Truncated tests for combining evidence of summary statistics.

    abstract::To date, thousands of genetic variants to be associated with numerous human traits and diseases have been identified by genome-wide association studies (GWASs). The GWASs focus on testing the association between single trait and genetic variants. However, the analysis of multiple traits and single nucleotide polymorph...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22330

    authors: Bu D,Yang Q,Meng Z,Zhang S,Li Q

    更新日期:2020-10-01 00:00:00

  • Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants.

    abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21740

    authors: Lin WY,Yi N,Lou XY,Zhi D,Zhang K,Gao G,Tiwari HK,Liu N

    更新日期:2013-09-01 00:00:00

  • Sample size calculations for linkage analysis using extreme sib pairs based on segregation analysis with the quantitative phenotype body weight as an example.

    abstract::One approach to establish linkage is based on allele-sharing methods for sib pairs. Recently, the use of extreme sib pairs (ESP) has been proposed to increase power for mapping quantitative traits in humans. Several approaches have been discussed. In this study, we calculate sample sizes for the various ESP approaches...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:6<577::AID-GEPI3>3

    authors: Ziegler A,Hebebrand J

    更新日期:1998-01-01 00:00:00

  • Investigation of a candidate gene, environment, and G x E interaction using case-control and case-parent study designs.

    abstract::We investigated the independent contributions of a candidate gene and an environmental factor, and the presence of gene x environment (G x E) interaction, in the etiology of a disease in the Genetic Analysis Workshop (GAW) 12 problem 2 simulated data using a two-stage approach utilizing both case-control and case-pare...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s843

    authors: Norris JM,Selinger-Leneman H,Génin E

    更新日期:2001-01-01 00:00:00

  • Presidential address: Six open questions to genetic epidemiologists.

    abstract::Given the rapid pace with which genomics and other -omics disciplines are evolving, it is sometimes necessary to shift down a gear to consider more general scientific questions. In this line, in my presidential address I formulate six questions for genetic epidemiologists to ponder on. These cover the areas of reprodu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22191

    authors: König IR

    更新日期:2019-04-01 00:00:00

  • Genetic analysis of a complex disease in the presence of an environmental risk factor.

    abstract::The role of a gene in a disease may be hidden by the presence of another risk factor such as an environmental factor. In that case, stratifying the data according to this factor strengthens power to detect linkage or association. We followed this strategy on the simulated data provided by GAW11. The transmission/diseq...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170788

    authors: Eichenbaum-Voline S,Baur MP,Knapp M

    更新日期:1999-01-01 00:00:00

  • A likelihood ratio-based Mann-Whitney approach finds novel replicable joint gene action for type 2 diabetes.

    abstract::The potential importance of the joint action of genes, whether modeled with or without a statistical interaction term, has long been recognized. However, identifying such action has been a great challenge, especially when millions of genetic markers are involved. We propose a likelihood ratio-based Mann-Whitney test t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21651

    authors: Lu Q,Wei C,Ye C,Li M,Elston RC

    更新日期:2012-09-01 00:00:00

  • A two-locus model for familial Alzheimer's disease?

    abstract::The present findings for familial Alzheimer's disease suggest a possible linkage to gene(s) on chromosome 21 for the early onset form and to chromosome 19 for the late onset. Since these results are not unequivocal, possible alternative hypotheses include the effect of genetic heterogeneity or of an oligogenic model o...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100618

    authors: Macciardi F,Cavallini MC

    更新日期:1993-01-01 00:00:00

  • Accounting for population stratification in DNA methylation studies.

    abstract::DNA methylation is an important epigenetic mechanism that has been linked to complex diseases and is of great interest to researchers as a potential link between genome, environment, and disease. As the scale of DNA methylation association studies approaches that of genome-wide association studies, issues such as popu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21789

    authors: Barfield RT,Almli LM,Kilaru V,Smith AK,Mercer KB,Duncan R,Klengel T,Mehta D,Binder EB,Epstein MP,Ressler KJ,Conneely KN

    更新日期:2014-04-01 00:00:00

  • Gene-dropping vs. empirical variance estimation for allele-sharing linkage statistics.

    abstract::In this study, we compare the statistical properties of a number of methods for estimating P-values for allele-sharing statistics in non-parametric linkage analysis. Some of the methods are based on the normality assumption, using different variance estimation methods, and others use simulation (gene-dropping) to find...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20177

    authors: Jung J,Weeks DE,Feingold E

    更新日期:2006-12-01 00:00:00

  • Tag SNPs chosen from HapMap perform well in several population isolates.

    abstract::Population isolates may be particularly useful for association studies of complex traits. This utility, however, largely depends on the transferability of tag SNPs chosen from reference samples, such as HapMap, to samples from such populations. Factors that characterize population isolates, such as widespread genetic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20201

    authors: Service S,International Collaborative Group on Isolated Populations.,Sabatti C,Freimer N

    更新日期:2007-04-01 00:00:00

  • A Bayesian toolkit for genetic association studies.

    abstract::We present a range of modelling components designed to facilitate Bayesian analysis of genetic-association-study data. A key feature of our approach is the ability to combine different submodels together, almost arbitrarily, for dealing with the complexities of real data. In particular, we propose various techniques f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20140

    authors: Lunn DJ,Whittaker JC,Best N

    更新日期:2006-04-01 00:00:00

  • Genotyping errors, pedigree errors, and missing data.

    abstract::Our group studied the effects of genotyping errors, pedigree errors, and missing data on a wide range of techniques, with a focus on the role of single-nucleotide polymorphisms (SNPs). Half of our group used simulated data, and half of our group used data from the Collaborative Study on the Genetics of Alcoholism (COG...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20120

    authors: Hinrichs AL,Suarez BK

    更新日期:2005-01-01 00:00:00

  • Comparison of two linkage inference procedures for genes related to the P300 component of the event related potential.

    abstract::Our goal was to detect genes contributing to the P300 component of the event related potential (ERP). We found that all of the ERP traits were highly correlated. Most of them distinguished alcoholics from nonalcoholics. To have one summary variable for the ERP traits, we calculated the first principal component (PRIN1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170728

    authors: Goldin LR,Chase GA

    更新日期:1999-01-01 00:00:00

  • Availability of schizophrenic patients and their families for genetic linkage studies: findings from the Maryland epidemiology sample.

    abstract::It has been suggested that collections of affected sib pairs, or their nuclear families, may be an efficient method for screening for genetic linkages in schizophrenia. We present the data collected in five years from 15 hospitals in the state of Maryland in an effort to determine if such a collection scheme will be f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060604

    authors: Pulver AE,Bale SJ

    更新日期:1989-01-01 00:00:00

  • Phenotypic effects of apolipoprotein structural variation on lipid profiles: II. Apolipoprotein A-IV and quantitative lipid measures in the healthy women study.

    abstract::Apolipoprotein A-IV (APO A-IV) is a major protein component of mesenteric lymph chylomicrons and very-low-density lipoproteins. It is found in plasma predominantly unassociated with major lipoprotein fractions and in high density lipoproteins. APO A-IV exhibits structural heterogeneity owing to two codominant alleles,...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060404

    authors: Eichner JE,Kuller LH,Ferrell RE,Kamboh MI

    更新日期:1989-01-01 00:00:00

  • Logistic transmission modeling for the simulated data of GAW10 problem 2.

    abstract::A recently developed nonparametric method is a generalization of the transmission disequilibrium test across all alleles of a locus. This approach has been applied to Problem 2 of GAW10 and has been extended to explore the combined contribution of neighboring loci for chromosomes 1, 5, and 8. When applied to the chrom...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<857::AID-GEPI49>

    authors: Neas BR,Moser KL,Harley JB

    更新日期:1997-01-01 00:00:00