Genetic background comparison using distance-based regression, with applications in population stratification evaluation and adjustment.

Abstract:

:Population stratification (PS) can lead to an inflated rate of false-positive findings in genome-wide association studies (GWAS). The commonly used approach of adjustment for a fixed number of principal components (PCs) could have a deleterious impact on power when selected PCs are equally distributed in cases and controls, or the adjustment of certain covariates, such as self-identified ethnicity or recruitment center, already included in the association analyses, correctly maps to major axes of genetic heterogeneity. We propose a computationally efficient procedure, PC-Finder, to identify a minimal set of PCs while permitting an effective correction for PS. A general pseudo F statistic, derived from a non-parametric multivariate regression model, can be used to assess whether PS exists or has been adequately corrected by a set of selected PCs. Empirical data from two GWAS conducted as part of the Cancer Genetic Markers of Susceptibility (CGEMS) project demonstrate the application of the procedure. Furthermore, simulation studies show the power advantage of the proposed procedure in GWAS over currently used PS correction strategies, particularly when the PCs with substantial genetic variation are distributed similarly in cases and controls and therefore do not induce PS.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Li Q,Wacholder S,Hunter DJ,Hoover RN,Chanock S,Thomas G,Yu K

doi

10.1002/gepi.20396

subject

Has Abstract

pub_date

2009-07-01 00:00:00

pages

432-41

issue

5

eissn

0741-0395

issn

1098-2272

journal_volume

33

pub_type

杂志文章
  • Quantitative allelic test--a fast test for very large association studies.

    abstract::Advances in high throughput technology have enabled the generation of unprecedented amounts of genomic data (e.g., next-generation sequence data, transcriptomics, metabolomics, and proteomics), which promises to unravel the genetic architecture of complex traits. These discoveries may lead to novel therapeutic targets...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21768

    authors: Lee SM,Karrison TG,Cox NJ,Im HK

    更新日期:2013-12-01 00:00:00

  • Biochemical intermediates in alpha 1-antitrypsin deficiency: residual family resemblance for total alpha 1-antitrypsin, oxidized alpha 1-antitrypsin, and immunoglobulin E after adjustment for the effect of the Pi locus.

    abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070204

    authors: Silverman EK,Province MA,Campbell EJ,Pierce JA,Rao DC

    更新日期:1990-01-01 00:00:00

  • Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees.

    abstract::In the last two decades, complex traits have become the main focus of genetic studies. The hypothesis that both rare and common variants are associated with complex traits is increasingly being discussed. Family-based association studies using relatively large pedigrees are suitable for both rare and common variant id...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21844

    authors: Saad M,Wijsman EM

    更新日期:2014-11-01 00:00:00

  • Using single nucleotide polymorphisms to investigate association between a candidate gene and disease.

    abstract::A range of study designs, using unrelated or family controls, were used to investigate the pattern of association with disease of single nucleotide polymorphisms (SNPs) within candidate gene 1 (simulated data). Strong evidence of disease association at the functional locus was detected using all study designs, and in ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s415

    authors: Saunders CL,Crockford GP,Bishop DT,Barrett JH

    更新日期:2001-01-01 00:00:00

  • Evidence for further breast cancer susceptibility genes in addition to BRCA1 and BRCA2 in a population-based study.

    abstract::We used data from a population based series of breast cancer patients to investigate the genetic models that can best explain familial breast cancer not due to the BRCA1 and BRCA2 genes. The data set consisted of 1,484 women diagnosed with breast cancer under age 55 registered in the East Anglia Cancer registry betwee...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1014

    authors: Antoniou AC,Pharoah PD,McMullan G,Day NE,Ponder BA,Easton D

    更新日期:2001-07-01 00:00:00

  • Score tests for familial correlation in genotyped-proband designs.

    abstract::In the genotyped-proband design, a proband is selected based on an observed phenotype, the genotype of the proband is observed, and then the phenotypes of all first-degree relatives are obtained. The genotypes of these first-degree relatives are not observed. Gail et al. [(1999) Genet Epidemiol] discuss likelihood ana...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(200004)18:4<293::AID-GEPI3

    authors: Carroll RJ,Gail MH,Benichou J,Pee D

    更新日期:2000-04-01 00:00:00

  • The impact of improved microarray coverage and larger sample sizes on future genome-wide association studies.

    abstract::Genome-wide association studies (GWAS) have identified many single nucleotide polymorphisms (SNPs) associated with complex traits. However, the genetic heritability of most of these traits remains unexplained. To help guide future studies, we address the crucial question of whether future GWAS can detect new SNP assoc...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21724

    authors: Lindquist KJ,Jorgenson E,Hoffmann TJ,Witte JS

    更新日期:2013-05-01 00:00:00

  • Phenotype validation in electronic health records based genetic association studies.

    abstract::The linkage between electronic health records (EHRs) and genotype data makes it plausible to study the genetic susceptibility of a wide range of disease phenotypes. Despite that EHR-derived phenotype data are subjected to misclassification, it has been shown useful for discovering susceptible genes, particularly in th...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22080

    authors: Wang L,Damrauer SM,Zhang H,Zhang AX,Xiao R,Moore JH,Chen J

    更新日期:2017-12-01 00:00:00

  • Testing the utility of mod scores and sib-pair analysis to detect presence of disease susceptibility loci.

    abstract::Linkage analyses and association studies were employed to detect disease susceptibility loci leading to elevated Q1 levels in Problem 2B. Phenotypes were defined to be the dichotomous affection status, the quantitative value for Q1, and Q1 adjusted for covariates. The method of mod-scores (for the dichotomous phenotyp...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<1035::AID-GEPI79

    authors: Neuman RJ,Xian H

    更新日期:1997-01-01 00:00:00

  • A sliding-window weighted linkage disequilibrium test.

    abstract::Multilocus linkage disequilibrium (LD) tests that consider inter-marker (LD) are more powerful than single-locus tests when disease etiology is contributed simultaneously by several linked and correlated loci. However, inclusion of redundant non-informative markers may result in reduced testing power and/or inflated f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20165

    authors: Yang HC,Lin CY,Fann CS

    更新日期:2006-09-01 00:00:00

  • Genetic association with multiple traits in the presence of population stratification.

    abstract::Testing association between a genetic marker and multiple-dependent traits is a challenging task when both binary and quantitative traits are involved. The inverted regression model is a convenient method, in which the traits are treated as predictors although the genetic marker is an ordinal response. It is known tha...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21738

    authors: Yan T,Li Q,Li Y,Li Z,Zheng G

    更新日期:2013-09-01 00:00:00

  • Parental transmission and D18S37 allele sharing in bipolar affective disorder.

    abstract::We combined the five chromosome 18 bipolar affective disorder data sets provided by GAW10, totaling 185 families with 3,394 individuals, and performed analysis of differential parental transmission and chromosome 18 marker allele sharing in families with transmission through fathers vs those through mothers. Results i...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<665::AID-GEPI19>

    authors: Lin JP,Bale SJ

    更新日期:1997-01-01 00:00:00

  • The power of iterated generalized least squares (GLS) method to detect direct relationships in the analysis of correlated quantitative traits.

    abstract::We examined the power of the stepwise iterated generalized least squares (GLS) method by modeling the relationship between quantitative traits and other variables using the simulated data for Problem 2A. The comparison between the generating model provided by the workshop and the results of the stepwise iterated GLS m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<797::AID-GEPI39>

    authors: He Q,Nemesure BB,Mendell NR

    更新日期:1997-01-01 00:00:00

  • Linkage analysis of asthma and atopy including models with genomic imprinting.

    abstract::Asthma and atopy are two closely related, common complex traits in which a number of genetic and environmental factors are suspected to play a role. We have performed parametric and nonparametric multi-marker linkage analysis for the Busselton data set, which is part of problem 1 of Genetic Analysis Workshop 12. In pa...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s204

    authors: Strauch K,Bogdanow M,Fimmers R,Baur MP,Wienker TF

    更新日期:2001-01-01 00:00:00

  • Analysis of two-locus traits under heterogeneity for recessive versus dominant inheritance.

    abstract::Complex traits have been modeled under various modes of two-locus inheritance. One example of a two-locus threshold model is the situation where an individual is susceptible to a disease trait if he or she carries three or more disease alleles. Under this model, if each locus is examined individually the inheritance a...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1997)14:6<1097::AID-GEPI89

    authors: Leal SM,Ott J

    更新日期:1997-01-01 00:00:00

  • SimPEL: Simulation-based power estimation for sequencing studies of low-prevalence conditions.

    abstract::Power estimations are important for optimizing genotype-phenotype association study designs. However, existing frameworks are designed for common disorders, and thus ill-suited for the inherent challenges of studies for low-prevalence conditions such as rare diseases and infrequent adverse drug reactions. These challe...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22129

    authors: Mak L,Li M,Cao C,Gordon P,Tarailo-Graovac M,Bousman C,Wang P,Long Q

    更新日期:2018-07-01 00:00:00

  • Detecting epistatic interactions contributing to quantitative traits.

    abstract::The restricted partition method (RPM) is a partitioning algorithm for examining multi-locus genotypes as (potentially non-additive) predictors of a quantitative trait. The motivating application was to develop a robust method to examine quantitative phenotypes for epistasis (gene-gene interactions), but the method can...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.20006

    authors: Culverhouse R,Klein T,Shannon W

    更新日期:2004-09-01 00:00:00

  • A multimarker regression-based test of linkage for affected sib-pairs at two linked loci.

    abstract::We address the analytical problem of evaluating the evidence for linkage at a test locus while taking into account the effect of a known linked disease locus. The method we propose is a multimarker regression approach that models the identity-by-descent states for affected sib-pairs at a series of linked markers in te...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20137

    authors: Barber MJ,Todd JA,Cordell HJ

    更新日期:2006-04-01 00:00:00

  • Univariate analysis of dichotomous or ordinal data from twin pairs: a simulation study comparing structural equation modeling and logistic regression.

    abstract::The univariate analysis of categorical twin data can be performed using either structural equation modeling (SEM) or logistic regression. This paper presents a comparison between these two methods using a simulation study. Dichotomous and ordinal (three category) twin data are simulated under two different sample size...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1996)13:1<79::AID-GEPI7>3.

    authors: Ramakrishnan V,Meyer JM,Goldberg J,Henderson WG

    更新日期:1996-01-01 00:00:00

  • Identification of gene-gene interactions in the presence of missing data using the multifactor dimensionality reduction method.

    abstract::Gene-gene interaction is believed to play an important role in understanding complex traits. Multifactor dimensionality reduction (MDR) was proposed by Ritchie et al. [2001. Am J Hum Genet 69:138-147] to identify multiple loci that simultaneously affect disease susceptibility. Although the MDR method has been widely u...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20416

    authors: Namkung J,Elston RC,Yang JM,Park T

    更新日期:2009-11-01 00:00:00

  • Genetic epidemiology of breast cancer: segregation analysis of 389 Icelandic pedigrees.

    abstract::A genetic epidemiologic investigation of breast cancer involving 389 breast cancer pedigrees including information on 14,721 individuals from the Icelandic population-based cancer registry is presented. Probands were women born in or after 1920 and reported to have breast cancer in the cancer registry. The average age...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(200001)18:1<81::AID-GEPI6>

    authors: Baffoe-Bonnie AB,Beaty TH,Bailey-Wilson JE,Kiemeney LA,Sigvaldason H,Olafsdóttir G,Tryggvadóttir L,Tulinius H

    更新日期:2000-01-01 00:00:00

  • Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants.

    abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21740

    authors: Lin WY,Yi N,Lou XY,Zhi D,Zhang K,Gao G,Tiwari HK,Liu N

    更新日期:2013-09-01 00:00:00

  • Transcriptome-wide association study of breast cancer risk by estrogen-receptor status.

    abstract::Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22288

    authors: Feng H,Gusev A,Pasaniuc B,Wu L,Long J,Abu-Full Z,Aittomäki K,Andrulis IL,Anton-Culver H,Antoniou AC,Arason A,Arndt V,Aronson KJ,Arun BK,Asseryanis E,Auer PL,Azzollini J,Balmaña J,Barkardottir RB,Barnes DR,Barrowda

    更新日期:2020-07-01 00:00:00

  • An ensemble learning approach jointly modeling main and interaction effects in genetic association studies.

    abstract::Complex diseases are presumed to be the results of interactions of several genes and environmental factors, with each gene only having a small effect on the disease. Thus, the methods that can account for gene-gene interactions to search for a set of marker loci in different genes or across genome and to analyze these...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20304

    authors: Zhang Z,Zhang S,Wong MY,Wareham NJ,Sha Q

    更新日期:2008-05-01 00:00:00

  • Genetic prediction in the Genetic Analysis Workshop 18 sequencing data.

    abstract::High-throughput sequencing data can be used to predict phenotypes from genotypes, and this corresponds to establishing a prognostic model. In extended pedigrees the relatedness of subjects provides additional information so that genetic values, fixed or random genetic components, and heritability can be estimated. At ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21826

    authors: Ziegler A,Bohossian N,Diego VP,Yao C

    更新日期:2014-09-01 00:00:00

  • Multipoint analysis using affected sib pairs: incorporating linkage evidence from unlinked regions.

    abstract::In this paper, we proposed a multipoint method to assess evidence of linkage to one region by incorporating linkage evidence from another region. This approach uses affected sib pairs in which the number of alleles shared identical by descent (IBD) is the primary statistic. This generalized estimating equation (GEE) a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1021

    authors: Liang KY,Chiu YF,Beaty TH,Wjst M

    更新日期:2001-09-01 00:00:00

  • Equivalence of the mixed and regressive models for genetic analysis. I. Continuous traits.

    abstract::The mixed model of segregation analysis specifies major gene effects and partitions the residual variance into polygenic and environmental components. The model explains familial correlations essentially in terms of genetic causation. The regressive model, on the other hand, is constructed by successively conditioning...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060505

    authors: Demenais FM,Bonney GE

    更新日期:1989-01-01 00:00:00

  • Innovative approach to identify multigenomic and environmental interactions associated with birth defects in family-based hybrid designs.

    abstract::Genes, including those with transgenerational effects, work in concert with behavioral, environmental, and social factors via complex biological networks to determine human health. Understanding complex relationships between causal factors underlying human health is an essential step towards deciphering biological mec...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22363

    authors: Lou XY,Hou TT,Liu SY,Xu HM,Lin F,Tang X,MacLeod SL,Cleves MA,Hobbs CA

    更新日期:2020-09-30 00:00:00

  • Genome-wide approaches for identifying interacting susceptibility regions for asthma.

    abstract::A genome-wide correlation analysis and cluster analysis were utilized to determine chromosomal regions that had similar nonparametric linkage scores across families in order to locate interacting susceptibility loci for asthma. Conditional analysis was performed to detect any increase in lod score over baseline. Eight...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s266

    authors: Colilla S,Tsalenko A,Pluznikov A,Cox NJ

    更新日期:2001-01-01 00:00:00

  • Major gene with sex-specific effects influences fat mass in Mexican Americans.

    abstract::Increased adiposity has repeatedly been identified as a major risk factor for a variety of chronic diseases. However, the question still remains whether the amount of adipose tissue itself is genetically mediated. To address this question, a segregation analysis, using maximum likelihood techniques as implemented in t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120505

    authors: Comuzzie AG,Blangero J,Mahaney MC,Mitchell BD,Hixson JE,Samollow PB,Stern MP,MacCluer JW

    更新日期:1995-01-01 00:00:00