Abstract:
:Population stratification (PS) can lead to an inflated rate of false-positive findings in genome-wide association studies (GWAS). The commonly used approach of adjustment for a fixed number of principal components (PCs) could have a deleterious impact on power when selected PCs are equally distributed in cases and controls, or the adjustment of certain covariates, such as self-identified ethnicity or recruitment center, already included in the association analyses, correctly maps to major axes of genetic heterogeneity. We propose a computationally efficient procedure, PC-Finder, to identify a minimal set of PCs while permitting an effective correction for PS. A general pseudo F statistic, derived from a non-parametric multivariate regression model, can be used to assess whether PS exists or has been adequately corrected by a set of selected PCs. Empirical data from two GWAS conducted as part of the Cancer Genetic Markers of Susceptibility (CGEMS) project demonstrate the application of the procedure. Furthermore, simulation studies show the power advantage of the proposed procedure in GWAS over currently used PS correction strategies, particularly when the PCs with substantial genetic variation are distributed similarly in cases and controls and therefore do not induce PS.
journal_name
Genet Epidemioljournal_title
Genetic epidemiologyauthors
Li Q,Wacholder S,Hunter DJ,Hoover RN,Chanock S,Thomas G,Yu Kdoi
10.1002/gepi.20396subject
Has Abstractpub_date
2009-07-01 00:00:00pages
432-41issue
5eissn
0741-0395issn
1098-2272journal_volume
33pub_type
杂志文章abstract::Advances in high throughput technology have enabled the generation of unprecedented amounts of genomic data (e.g., next-generation sequence data, transcriptomics, metabolomics, and proteomics), which promises to unravel the genetic architecture of complex traits. These discoveries may lead to novel therapeutic targets...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21768
更新日期:2013-12-01 00:00:00
abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370070204
更新日期:1990-01-01 00:00:00
abstract::In the last two decades, complex traits have become the main focus of genetic studies. The hypothesis that both rare and common variants are associated with complex traits is increasingly being discussed. Family-based association studies using relatively large pedigrees are suitable for both rare and common variant id...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21844
更新日期:2014-11-01 00:00:00
abstract::A range of study designs, using unrelated or family controls, were used to investigate the pattern of association with disease of single nucleotide polymorphisms (SNPs) within candidate gene 1 (simulated data). Strong evidence of disease association at the functional locus was detected using all study designs, and in ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s415
更新日期:2001-01-01 00:00:00
abstract::We used data from a population based series of breast cancer patients to investigate the genetic models that can best explain familial breast cancer not due to the BRCA1 and BRCA2 genes. The data set consisted of 1,484 women diagnosed with breast cancer under age 55 registered in the East Anglia Cancer registry betwee...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1014
更新日期:2001-07-01 00:00:00
abstract::In the genotyped-proband design, a proband is selected based on an observed phenotype, the genotype of the proband is observed, and then the phenotypes of all first-degree relatives are obtained. The genotypes of these first-degree relatives are not observed. Gail et al. [(1999) Genet Epidemiol] discuss likelihood ana...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(200004)18:4<293::AID-GEPI3
更新日期:2000-04-01 00:00:00
abstract::Genome-wide association studies (GWAS) have identified many single nucleotide polymorphisms (SNPs) associated with complex traits. However, the genetic heritability of most of these traits remains unexplained. To help guide future studies, we address the crucial question of whether future GWAS can detect new SNP assoc...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21724
更新日期:2013-05-01 00:00:00
abstract::The linkage between electronic health records (EHRs) and genotype data makes it plausible to study the genetic susceptibility of a wide range of disease phenotypes. Despite that EHR-derived phenotype data are subjected to misclassification, it has been shown useful for discovering susceptible genes, particularly in th...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22080
更新日期:2017-12-01 00:00:00
abstract::Linkage analyses and association studies were employed to detect disease susceptibility loci leading to elevated Q1 levels in Problem 2B. Phenotypes were defined to be the dichotomous affection status, the quantitative value for Q1, and Q1 adjusted for covariates. The method of mod-scores (for the dichotomous phenotyp...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1997)14:6<1035::AID-GEPI79
更新日期:1997-01-01 00:00:00
abstract::Multilocus linkage disequilibrium (LD) tests that consider inter-marker (LD) are more powerful than single-locus tests when disease etiology is contributed simultaneously by several linked and correlated loci. However, inclusion of redundant non-informative markers may result in reduced testing power and/or inflated f...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20165
更新日期:2006-09-01 00:00:00
abstract::Testing association between a genetic marker and multiple-dependent traits is a challenging task when both binary and quantitative traits are involved. The inverted regression model is a convenient method, in which the traits are treated as predictors although the genetic marker is an ordinal response. It is known tha...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21738
更新日期:2013-09-01 00:00:00
abstract::We combined the five chromosome 18 bipolar affective disorder data sets provided by GAW10, totaling 185 families with 3,394 individuals, and performed analysis of differential parental transmission and chromosome 18 marker allele sharing in families with transmission through fathers vs those through mothers. Results i...
journal_title:Genetic epidemiology
pub_type: 临床试验,杂志文章
doi:10.1002/(SICI)1098-2272(1997)14:6<665::AID-GEPI19>
更新日期:1997-01-01 00:00:00
abstract::We examined the power of the stepwise iterated generalized least squares (GLS) method by modeling the relationship between quantitative traits and other variables using the simulated data for Problem 2A. The comparison between the generating model provided by the workshop and the results of the stepwise iterated GLS m...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1997)14:6<797::AID-GEPI39>
更新日期:1997-01-01 00:00:00
abstract::Asthma and atopy are two closely related, common complex traits in which a number of genetic and environmental factors are suspected to play a role. We have performed parametric and nonparametric multi-marker linkage analysis for the Busselton data set, which is part of problem 1 of Genetic Analysis Workshop 12. In pa...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s204
更新日期:2001-01-01 00:00:00
abstract::Complex traits have been modeled under various modes of two-locus inheritance. One example of a two-locus threshold model is the situation where an individual is susceptible to a disease trait if he or she carries three or more disease alleles. Under this model, if each locus is examined individually the inheritance a...
journal_title:Genetic epidemiology
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(SICI)1098-2272(1997)14:6<1097::AID-GEPI89
更新日期:1997-01-01 00:00:00
abstract::Power estimations are important for optimizing genotype-phenotype association study designs. However, existing frameworks are designed for common disorders, and thus ill-suited for the inherent challenges of studies for low-prevalence conditions such as rare diseases and infrequent adverse drug reactions. These challe...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22129
更新日期:2018-07-01 00:00:00
abstract::The restricted partition method (RPM) is a partitioning algorithm for examining multi-locus genotypes as (potentially non-additive) predictors of a quantitative trait. The motivating application was to develop a robust method to examine quantitative phenotypes for epistasis (gene-gene interactions), but the method can...
journal_title:Genetic epidemiology
pub_type: 杂志文章,评审
doi:10.1002/gepi.20006
更新日期:2004-09-01 00:00:00
abstract::We address the analytical problem of evaluating the evidence for linkage at a test locus while taking into account the effect of a known linked disease locus. The method we propose is a multimarker regression approach that models the identity-by-descent states for affected sib-pairs at a series of linked markers in te...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20137
更新日期:2006-04-01 00:00:00
abstract::The univariate analysis of categorical twin data can be performed using either structural equation modeling (SEM) or logistic regression. This paper presents a comparison between these two methods using a simulation study. Dichotomous and ordinal (three category) twin data are simulated under two different sample size...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1996)13:1<79::AID-GEPI7>3.
更新日期:1996-01-01 00:00:00
abstract::Gene-gene interaction is believed to play an important role in understanding complex traits. Multifactor dimensionality reduction (MDR) was proposed by Ritchie et al. [2001. Am J Hum Genet 69:138-147] to identify multiple loci that simultaneously affect disease susceptibility. Although the MDR method has been widely u...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20416
更新日期:2009-11-01 00:00:00
abstract::A genetic epidemiologic investigation of breast cancer involving 389 breast cancer pedigrees including information on 14,721 individuals from the Icelandic population-based cancer registry is presented. Probands were women born in or after 1920 and reported to have breast cancer in the cancer registry. The average age...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(200001)18:1<81::AID-GEPI6>
更新日期:2000-01-01 00:00:00
abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21740
更新日期:2013-09-01 00:00:00
abstract::Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22288
更新日期:2020-07-01 00:00:00
abstract::Complex diseases are presumed to be the results of interactions of several genes and environmental factors, with each gene only having a small effect on the disease. Thus, the methods that can account for gene-gene interactions to search for a set of marker loci in different genes or across genome and to analyze these...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20304
更新日期:2008-05-01 00:00:00
abstract::High-throughput sequencing data can be used to predict phenotypes from genotypes, and this corresponds to establishing a prognostic model. In extended pedigrees the relatedness of subjects provides additional information so that genetic values, fixed or random genetic components, and heritability can be estimated. At ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21826
更新日期:2014-09-01 00:00:00
abstract::In this paper, we proposed a multipoint method to assess evidence of linkage to one region by incorporating linkage evidence from another region. This approach uses affected sib pairs in which the number of alleles shared identical by descent (IBD) is the primary statistic. This generalized estimating equation (GEE) a...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1021
更新日期:2001-09-01 00:00:00
abstract::The mixed model of segregation analysis specifies major gene effects and partitions the residual variance into polygenic and environmental components. The model explains familial correlations essentially in terms of genetic causation. The regressive model, on the other hand, is constructed by successively conditioning...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370060505
更新日期:1989-01-01 00:00:00
abstract::Genes, including those with transgenerational effects, work in concert with behavioral, environmental, and social factors via complex biological networks to determine human health. Understanding complex relationships between causal factors underlying human health is an essential step towards deciphering biological mec...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22363
更新日期:2020-09-30 00:00:00
abstract::A genome-wide correlation analysis and cluster analysis were utilized to determine chromosomal regions that had similar nonparametric linkage scores across families in order to locate interacting susceptibility loci for asthma. Conditional analysis was performed to detect any increase in lod score over baseline. Eight...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s266
更新日期:2001-01-01 00:00:00
abstract::Increased adiposity has repeatedly been identified as a major risk factor for a variety of chronic diseases. However, the question still remains whether the amount of adipose tissue itself is genetically mediated. To address this question, a segregation analysis, using maximum likelihood techniques as implemented in t...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370120505
更新日期:1995-01-01 00:00:00