Genetic background comparison using distance-based regression, with applications in population stratification evaluation and adjustment.

Abstract:

:Population stratification (PS) can lead to an inflated rate of false-positive findings in genome-wide association studies (GWAS). The commonly used approach of adjustment for a fixed number of principal components (PCs) could have a deleterious impact on power when selected PCs are equally distributed in cases and controls, or the adjustment of certain covariates, such as self-identified ethnicity or recruitment center, already included in the association analyses, correctly maps to major axes of genetic heterogeneity. We propose a computationally efficient procedure, PC-Finder, to identify a minimal set of PCs while permitting an effective correction for PS. A general pseudo F statistic, derived from a non-parametric multivariate regression model, can be used to assess whether PS exists or has been adequately corrected by a set of selected PCs. Empirical data from two GWAS conducted as part of the Cancer Genetic Markers of Susceptibility (CGEMS) project demonstrate the application of the procedure. Furthermore, simulation studies show the power advantage of the proposed procedure in GWAS over currently used PS correction strategies, particularly when the PCs with substantial genetic variation are distributed similarly in cases and controls and therefore do not induce PS.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Li Q,Wacholder S,Hunter DJ,Hoover RN,Chanock S,Thomas G,Yu K

doi

10.1002/gepi.20396

subject

Has Abstract

pub_date

2009-07-01 00:00:00

pages

432-41

issue

5

eissn

0741-0395

issn

1098-2272

journal_volume

33

pub_type

杂志文章
  • Projection regression models for multivariate imaging phenotype.

    abstract::This paper presents a projection regression model (PRM) to assess the relationship between a multivariate phenotype and a set of covariates, such as a genetic marker, age, and gender. In the existing literature, a standard statistical approach to this problem is to fit a multivariate linear model to the multivariate p...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21658

    authors: Lin JA,Zhu H,Knickmeyer R,Styner M,Gilmore J,Ibrahim JG

    更新日期:2012-09-01 00:00:00

  • Multipoint analysis using affected sib pairs: incorporating linkage evidence from unlinked regions.

    abstract::In this paper, we proposed a multipoint method to assess evidence of linkage to one region by incorporating linkage evidence from another region. This approach uses affected sib pairs in which the number of alleles shared identical by descent (IBD) is the primary statistic. This generalized estimating equation (GEE) a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1021

    authors: Liang KY,Chiu YF,Beaty TH,Wjst M

    更新日期:2001-09-01 00:00:00

  • How can maximum likelihood methods reveal candidate gene effects on a quantitative trait?

    abstract::Different maximum likelihood approaches were used to explore the role of candidate genes in the variability of quantitative trait Q1 while accounting for the effects of age, Q2, and Q3. Segregation analysis, under the class D regressive model, provides evidence for a Mendelian gene effect on the adjusted trait Q1. Res...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120643

    authors: Martinez M,Abel L,Demenais F

    更新日期:1995-01-01 00:00:00

  • Mortality differences by APOE genotype estimated from demographic synthesis.

    abstract::The 4 allele of apolipoprotein E (APOE) is associated with increased risk of two major causes of death in low-mortality populations: ischemic heart disease and Alzheimer's disease. It is less common among centenarians than at younger ages. Therefore, it is likely that it is associated with excess risk of death. This a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.0164

    authors: Ewbank DC

    更新日期:2002-02-01 00:00:00

  • Family-based association tests for qualitative and quantitative traits using single-nucleotide polymorphism and microsatellite data.

    abstract::Using the Genetic Analysis Workshop 12 simulated data, we contrasted results for association tests in nuclear families and extended pedigrees using single-nucleotide polymorphism (SNP) data, and we compared results for different trait definitions, for outbred and isolate populations, and for SNP and microsatellite dat...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s364

    authors: Wilk JB,Volcjak JS,Myers RH,Maher NE,Knowlton BA,Heard-Costa NL,Demissie S,Cupples LA,DeStefano AL

    更新日期:2001-01-01 00:00:00

  • Entropy-supported marker selection and Mantel statistics for haplotype sharing analysis.

    abstract::Haplotype sharing analysis is a well-established option for the investigation of the etiology of complex diseases. The statistical power of haplotype association methods depends strongly on how the information of unobserved haplotypes can be captured by multilocus genotypes. In this study we combine an entropy-based m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20491

    authors: Schulz A,Fischer C,Chang-Claude J,Beckmann L

    更新日期:2010-05-01 00:00:00

  • Univariate analysis of dichotomous or ordinal data from twin pairs: a simulation study comparing structural equation modeling and logistic regression.

    abstract::The univariate analysis of categorical twin data can be performed using either structural equation modeling (SEM) or logistic regression. This paper presents a comparison between these two methods using a simulation study. Dichotomous and ordinal (three category) twin data are simulated under two different sample size...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1996)13:1<79::AID-GEPI7>3.

    authors: Ramakrishnan V,Meyer JM,Goldberg J,Henderson WG

    更新日期:1996-01-01 00:00:00

  • Extensions to sib-pair linkage tests applicable to disorders characterized by delayed onset.

    abstract::Extensions of the approach to sib-pair linkage tests developed by Haseman and Elston [Behav Genet 2:3-19, 1972] are proposed which incorporate information on age of onset and age at examination. Alternate sources for the age of onset corrections are described, including models for the estimation of parameters associat...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070607

    authors: Dawson DV,Kaplan EB,Elston RC

    更新日期:1990-01-01 00:00:00

  • A likelihood ratio-based Mann-Whitney approach finds novel replicable joint gene action for type 2 diabetes.

    abstract::The potential importance of the joint action of genes, whether modeled with or without a statistical interaction term, has long been recognized. However, identifying such action has been a great challenge, especially when millions of genetic markers are involved. We propose a likelihood ratio-based Mann-Whitney test t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21651

    authors: Lu Q,Wei C,Ye C,Li M,Elston RC

    更新日期:2012-09-01 00:00:00

  • Constructing meiotic maps with known error probability.

    abstract::We propose methods to construct meiotic gene maps while controlling the probability of a decision-error. First, a single step gene ordering procedure is presented whose decision-error probability is bounded above by a prespecified threshold. The bound for the error probability is valid under quite general circumstance...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:3<274::AID-GEPI4>3

    authors: Rogatko A,Babb J,Jordan H,Zacks S

    更新日期:1999-01-01 00:00:00

  • Pedigree disequilibrium tests for multilocus haplotypes.

    abstract::Association tests of multilocus haplotypes are of interest both in linkage disequilibrium mapping and in candidate gene studies. For case-parent trios, I discuss the extension of existing multilocus methods to include ambiguous haplotypes in tests of models which distinguish between the cis and trans phase. A likeliho...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10252

    authors: Dudbridge F

    更新日期:2003-09-01 00:00:00

  • Segregation analysis of juvenile myoclonic epilepsy.

    abstract::We examined the inheritance of juvenile myoclonic epilepsy (JME). We looked at both the trait of "epilepsy" and the trait of "epilepsy-plus-EEG abnormalities," since EEG abnormalities are frequently found in the clinically unaffected sibs of JME patients. We tested several modes of inheritance including the fully pene...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370050204

    authors: Greenberg DA,Delgado-Escueta AV,Maldonado HM,Widelitz H

    更新日期:1988-01-01 00:00:00

  • Availability of schizophrenic patients and their families for genetic linkage studies: findings from the Maryland epidemiology sample.

    abstract::It has been suggested that collections of affected sib pairs, or their nuclear families, may be an efficient method for screening for genetic linkages in schizophrenia. We present the data collected in five years from 15 hospitals in the state of Maryland in an effort to determine if such a collection scheme will be f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370060604

    authors: Pulver AE,Bale SJ

    更新日期:1989-01-01 00:00:00

  • Comparison of two linkage inference procedures for genes related to the P300 component of the event related potential.

    abstract::Our goal was to detect genes contributing to the P300 component of the event related potential (ERP). We found that all of the ERP traits were highly correlated. Most of them distinguished alcoholics from nonalcoholics. To have one summary variable for the ERP traits, we calculated the first principal component (PRIN1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170728

    authors: Goldin LR,Chase GA

    更新日期:1999-01-01 00:00:00

  • Genetic heterogeneity in Alzheimer's disease: a grade of membership analysis.

    abstract::Grade of membership analysis (GoM) may have particular relevance for genetic epidemiology. The method can flexibly relate genetic markers, clinical features, and environmental exposures to possible subtypes of disease termed pure types even when population allele frequencies and penetrance functions are not known. Hen...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100628

    authors: Corder EH,Woodbury MA

    更新日期:1993-01-01 00:00:00

  • Using single nucleotide polymorphisms to investigate association between a candidate gene and disease.

    abstract::A range of study designs, using unrelated or family controls, were used to investigate the pattern of association with disease of single nucleotide polymorphisms (SNPs) within candidate gene 1 (simulated data). Strong evidence of disease association at the functional locus was detected using all study designs, and in ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s415

    authors: Saunders CL,Crockford GP,Bishop DT,Barrett JH

    更新日期:2001-01-01 00:00:00

  • To type or not to type: the use of unaffected siblings in nonparametric linkage analysis.

    abstract::Unaffected individuals are often disregarded in nonparametric linkage analysis. Because of the presumed high complexity of genetic interactions and the resulting low penetrance of any single genetic effect, the statistical contribution of unaffected sib pairs is thought to be considerably lower than that of the affect...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s522

    authors: Majewski J

    更新日期:2001-01-01 00:00:00

  • Testing Hardy-Weinberg equilibrium using mother-child case-control samples.

    abstract::Genetic association studies of obstetric complications may genotype case and control mothers, or their respective newborns, or both case-control mothers and their children. The relatively high prevalence of many obstetric complications and the availability of both maternal and offspring's genotype data have provided m...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20406

    authors: Chen J,Zheng H,Wilson ML,Kraft P

    更新日期:2009-09-01 00:00:00

  • Risk factors for atherosclerosis in twins.

    abstract::We performed multivariate genetic analyses of cardiovascular risk factors from two sets of data on US and Australian female twins. Similar models for body mass index (BMI), serum low density (LDL) and high density (HDL) lipoproteins, including age as a covariate, were fitted successfully to both groups. These suggeste...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100638

    authors: Duffy DL,O'Connell DL,Heller RF,Martin NG

    更新日期:1993-01-01 00:00:00

  • A Bayesian toolkit for genetic association studies.

    abstract::We present a range of modelling components designed to facilitate Bayesian analysis of genetic-association-study data. A key feature of our approach is the ability to combine different submodels together, almost arbitrarily, for dealing with the complexities of real data. In particular, we propose various techniques f...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20140

    authors: Lunn DJ,Whittaker JC,Best N

    更新日期:2006-04-01 00:00:00

  • Analysis of two-locus traits under heterogeneity for recessive versus dominant inheritance.

    abstract::Complex traits have been modeled under various modes of two-locus inheritance. One example of a two-locus threshold model is the situation where an individual is susceptible to a disease trait if he or she carries three or more disease alleles. Under this model, if each locus is examined individually the inheritance a...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1997)14:6<1097::AID-GEPI89

    authors: Leal SM,Ott J

    更新日期:1997-01-01 00:00:00

  • Increased risk for familial ovarian cancer among Jewish women: a population-based case-control study.

    abstract::Jewish women have been reported to have a higher risk for familial breast cancer than non-Jewish women and to be more likely to carry mutations in breast cancer genes such as BRCA1. Because BRCA1 mutations also increase women's risk for ovarian cancer, we asked whether Jewish women are at higher risk for familial ovar...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1998)15:1<51::AID-GEPI4>3.

    authors: Steinberg KK,Pernarelli JM,Marcus M,Khoury MJ,Schildkraut JM,Marchbanks PA

    更新日期:1998-01-01 00:00:00

  • Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants.

    abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21740

    authors: Lin WY,Yi N,Lou XY,Zhi D,Zhang K,Gao G,Tiwari HK,Liu N

    更新日期:2013-09-01 00:00:00

  • Segregation analysis of autosomal dominant polycystic kidney disease.

    abstract::The results of classical segregation analysis on 159 families with polycystic kidney disease (PKD) are presented. It had been previously estimated that about 95% of autosomal dominant PKD (ADPKD) families have PKD1, the gene localized to chromosome 16p. The main purpose of the study was to determine if PKD shows any s...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100305

    authors: Dobin A,Kimberling WJ,Pettinger W,Bailey-Wilson JE,Shugart YY,Gabow P

    更新日期:1993-01-01 00:00:00

  • A two-locus model for familial Alzheimer's disease?

    abstract::The present findings for familial Alzheimer's disease suggest a possible linkage to gene(s) on chromosome 21 for the early onset form and to chromosome 19 for the late onset. Since these results are not unequivocal, possible alternative hypotheses include the effect of genetic heterogeneity or of an oligogenic model o...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370100618

    authors: Macciardi F,Cavallini MC

    更新日期:1993-01-01 00:00:00

  • Trends in prenatal diagnosis of Down syndrome and other autosomal trisomies in Scotland 1990 to 1994, with associated cytogenetic and epidemiological findings.

    abstract::The present report summarizes findings on 670 cases of autosomal trisomy diagnosed in Scotland, with actual or expected dates of delivery in 1990 to 1994 inclusive. Cases were notified by cytogenetic service laboratories. There were 277 prenatal and 369 postnatal diagnoses and 24 spontaneous losses. Excluding the latt...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:2<179::AID-GEPI5>3

    authors: Carothers AD,Boyd E,Lowther G,Ellis PM,Couzin DA,Faed MJ,Robb A

    更新日期:1999-01-01 00:00:00

  • Evaluation of path analysis through computer simulation: effect of incorrectly assuming independent distribution of familial correlations.

    abstract::Path analysis of family data has been widely applied to resolve genetic and environmental patterns of familial resemblance. A prevalent statistical approach in path analysis has been, first, to estimate the familial correlations and, second, by assuming these estimates to be independently distributed, define a likelih...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370010305

    authors: McGue M,Wette R,Rao DC

    更新日期:1984-01-01 00:00:00

  • Comparison of empirical strategies to maximize GENEHUNTER lod scores.

    abstract::We compare four strategies for finding the settings of genetic parameters that maximize the lod scores reported in GENEHUNTER 1.2. The four strategies are iterated complete factorial designs, iterated orthogonal Latin hypercubes, evolutionary operation, and numerical optimization. The genetic parameters that are set a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170718

    authors: Chen CH,Finch SJ,Mendell NR,Gordon D

    更新日期:1999-01-01 00:00:00

  • Regressive logistic modeling of familial aggregation for asthma in 7,394 population-based nuclear families.

    abstract::The aim of this population-based study was to determine whether asthma aggregates in families, and if so, whether aggregation was consistent with environmental and/or genetic etiologies. Data were from 7,394 nuclear families (41,506 individuals) from the 1968 Tasmanian Asthma Survey, in which all Tasmanian schoolchild...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:3<317::AID-GEPI9>3

    authors: Jenkins MA,Hopper JL,Giles GG

    更新日期:1997-01-01 00:00:00

  • The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits.

    abstract::With challenges in data harmonization and environmental heterogeneity across various data sources, meta-analysis of gene-environment interaction studies can often involve subtle statistical issues. In this paper, we study the effect of environmental covariate heterogeneity (within and between cohorts) on two approache...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21810

    authors: Li S,Mukherjee B,Taylor JM,Rice KM,Wen X,Rice JD,Stringham HM,Boehnke M

    更新日期:2014-07-01 00:00:00