Abstract:
:Association analysis has led to the identification of many genetic variants for complex diseases. While assessing the association between genes and a disease, other factors can play an important role. The consequence of not considering covariates (such as population stratification and environmental factors) is well-documented in genetic studies. We introduce a nonparametric test of association that adjusts for covariate effects. Specifically, the adjustment is realized through weights that are constructed from genomic propensity scores that summarize the contribution of all covariates. The benefit of our test is demonstrated through an important data set on bipolar disorder (BD) collected by the Wellcome Trust Case Control Consortium. When compared to other tests, our test identified an unreported region with three single nucleotide polymorphisms (SNPs) on chromosome 16 that show strong evidence of association (P-value <5 × 10(-7)). This region is near the RPGRIP1L gene known to be associated with BD. A haplotype block including these three SNPs was further discovered to be strongly associated with BD. It is also interesting to note that our nonparametric test did not reveal strong signals at two SNPs that were detected by a covariate-adjusted parametric test. This suggests that different methods of covariate adjustment can complement each other. Thus, we recommend using both parametric and nonparametric testing. Additionally, we performed simulation studies to compare our proposed test with the unadjusted test and an adjusted parametric test. Our finding underscores the importance of accommodating and controlling for covariate effects in discovering genetic variants associated with complex disorders.
journal_name
Genet Epidemioljournal_title
Genetic epidemiologyauthors
Jiang Y,Zhang Hdoi
10.1002/gepi.20558subject
Has Abstractpub_date
2011-02-01 00:00:00pages
125-32issue
2eissn
0741-0395issn
1098-2272journal_volume
35pub_type
杂志文章abstract::The recent successes of genome-wide association studies (GWAS) have renewed interest in genome environment wide interaction studies (GEWIS) to discover genetic factors that modulate penetrance of environmental exposures to human diseases. Indeed, gene-environment interactions (G × E), which have not been emphasized in...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21890
更新日期:2015-07-01 00:00:00
abstract::Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions t...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20659
更新日期:2011-01-01 00:00:00
abstract::Emerging evidence suggests that a genetic variant can affect multiple phenotypes, especially in complex human diseases. Therefore, joint analysis of multiple phenotypes may offer new insights into disease etiology. Recently, many statistical methods have been developed for joint analysis of multiple phenotypes, includ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22263
更新日期:2020-01-01 00:00:00
abstract::Haplotype-sharing was examined in sets of affected siblings in the Breast Cancer Linkage Consortium pedigrees [Easton et al., 1993], using both identity-by-descent and identity-by-state methods. Linkage of the disease susceptibility locus to markers on chromosome 17 was confirmed. Substantial genetic heterogeneity was...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370120653
更新日期:1995-01-01 00:00:00
abstract::Genes with imprinting (parent-of-origin) effects express differently when inheriting from the mother or from the father. Some genes for development and behavior in mammals are known to be imprinted. We developed parametric linkage analysis that accounts for imprinting effects for continuous traits, implementing it in ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20321
更新日期:2008-07-01 00:00:00
abstract::The asymptotic distribution of [MOD] scores under the null hypothesis of no linkage is only known for affected sib pairs and other types of affected relative pairs. We have extended the GENEHUNTER-MODSCORE program to allow for simulations under the null hypothesis of no linkage to determine the empirical significance ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20264
更新日期:2008-01-01 00:00:00
abstract::We compare four strategies for finding the settings of genetic parameters that maximize the lod scores reported in GENEHUNTER 1.2. The four strategies are iterated complete factorial designs, iterated orthogonal Latin hypercubes, evolutionary operation, and numerical optimization. The genetic parameters that are set a...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370170718
更新日期:1999-01-01 00:00:00
abstract::Complex diseases are presumed to be the results of interactions of several genes and environmental factors, with each gene only having a small effect on the disease. Thus, the methods that can account for gene-gene interactions to search for a set of marker loci in different genes or across genome and to analyze these...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20304
更新日期:2008-05-01 00:00:00
abstract::Three characteristics of genetic epidemiology that distinguish it from its parent disciplines are a focus on population-based research, a focus on the joint effects of genes and the environment, and the incorporation of the underlying biology of the disease into its conceptual models. These principles are illustrated ...
journal_title:Genetic epidemiology
pub_type:
doi:10.1002/1098-2272(200012)19:4<289::AID-GEPI2>3.0.C
更新日期:2000-12-01 00:00:00
abstract::In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22089
更新日期:2017-12-01 00:00:00
abstract::Population isolates may be particularly useful for association studies of complex traits. This utility, however, largely depends on the transferability of tag SNPs chosen from reference samples, such as HapMap, to samples from such populations. Factors that characterize population isolates, such as widespread genetic ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20201
更新日期:2007-04-01 00:00:00
abstract::Construction of multifactorial disease models from epidemiological findings and their application to disease pedigrees for risk prediction is nontrivial for all but the simplest of cases. Multifactorial Disease Risk Calculator is a web tool facilitating this. It provides a user-friendly interface, extending a reported...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22101
更新日期:2018-03-01 00:00:00
abstract::Mantel statistics provide an additional step to standard approaches in the analysis of gene expression and covariate data, allow the calculation of standard statistics such as correlation, partial correlation, and regression coefficients, and, with permutation tests, provide P values for these statistics to relate the...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1115
更新日期:2002-06-01 00:00:00
abstract::We set out to apply conventional analytic methods to a GAW data set of nuclear families with an oligogenic disease that has a population prevalence of 0.023. We chose methods generally applied to disorders with at least one major gene. Our approaches included: 1) complex segregation analysis under two models of ascert...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370120613
更新日期:1995-01-01 00:00:00
abstract::A dense set of 5,000 SNPs on a 10-Mb region of human chromosome 20 has been typed on samples of African Americans, East Asians, and United Kingdom Caucasians. There are departures from Hardy-Weinberg equilibrium beyond the level at which markers are often discarded because of possible genotyping errors. The observatio...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20038
更新日期:2004-12-01 00:00:00
abstract::Methods for genetic risk prediction have been widely investigated in recent years. However, most available training data involves European samples, and it is currently unclear how to accurately predict disease risk in other populations. Previous studies have used either training data from European samples in large sam...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22083
更新日期:2017-12-01 00:00:00
abstract::HLA-A, -B, -C, -DR, and -DQ typings of the Schmiedeleut Hutterites of South Dakota were collected as part of an ongoing genetic-epidemiologic study of HLA and fertility. A total of 1,082 individuals, including 852 married adults representative of the reproductive population of this isolate, were characterized for five...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370120106
更新日期:1995-01-01 00:00:00
abstract::Imputation is widely used for obtaining information about rare variants. However, one issue concerning imputation is the low accuracy of imputed rare variants as the inaccurate imputed rare variants may distort the results of region-based association tests. Therefore, we developed a pre-collapsing imputation method (P...
journal_title:Genetic epidemiology
pub_type: 杂志文章,多中心研究
doi:10.1002/gepi.22020
更新日期:2017-01-01 00:00:00
abstract::Results of studies for the association of BRCA1 genotypes and haplotypes with sporadic breast cancer have been inconsistent. Therefore, a candidate single nucleotide polymorphism (SNP) approach was used in a breast cancer case-control study to explore genotypes and haplotypes that have the potential to affect protein ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21730
更新日期:2013-07-01 00:00:00
abstract::For many clinical studies in cancer, germline DNA is prospectively collected for the purpose of discovering or validating single-nucleotide polymorphisms (SNPs) associated with clinical outcomes. The primary clinical endpoint for many of these studies are time-to-event outcomes such as time of death or disease progres...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21645
更新日期:2012-09-01 00:00:00
abstract::In this paper we investigate the power to identify gene x gene interactions in genome-wide association studies. In our analysis we focus on two-stage analyses: analyses in which we only test for interactions between single nucleotide polymorphisms that show some marginal effect. We give two algorithms to compute signi...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20300
更新日期:2008-04-01 00:00:00
abstract::Sub-Saharan Africa has been identified as the part of the world with the greatest human genetic diversity. This high level of diversity causes difficulties for genome-wide association (GWA) studies in African populations-for example, by reducing the accuracy of genotype imputation in African populations compared to no...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20626
更新日期:2011-12-01 00:00:00
abstract::We address the analytical problem of evaluating the evidence for linkage at a test locus while taking into account the effect of a known linked disease locus. The method we propose is a multimarker regression approach that models the identity-by-descent states for affected sib-pairs at a series of linked markers in te...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20137
更新日期:2006-04-01 00:00:00
abstract::Using a recently developed semiparametric method for combined linkage/linkage-disequilibrium analysis, we analyzed the Collaborative Study on the Genetics of Alcoholism data subset developed for Genetic Analysis Workshop 11 (GAW11). This semiparametric approach estimates recombination fractions for linkage, marker log...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370170708
更新日期:1999-01-01 00:00:00
abstract::Multifactor dimensionality reduction (MDR) was developed as a nonparametric and model-free data mining method for detecting, characterizing, and interpreting epistasis in the absence of significant main effects in genetic and epidemiologic studies of complex traits such as disease susceptibility. The goal of MDR is to...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20360
更新日期:2009-01-01 00:00:00
abstract::Testing for association between two random vectors is a common and important task in many fields, however, existing tests, such as Escoufier's RV test, are suitable only for low-dimensional data, not for high-dimensional data. In moderate to high dimensions, it is necessary to consider sparse signals, which are often ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22059
更新日期:2017-11-01 00:00:00
abstract::Methods to account for population structure (PS) in genome-wide association studies have been well developed in samples of unrelated individuals, but when a sample is composed of families, the task of finding and accounting for PS is not as straight forward. Family-based tests that condition on parental genotypes or t...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20590
更新日期:2011-09-01 00:00:00
abstract::Human apolipoprotein A-IV (APO A-IV) exhibits a common protein polymorphism detectable by isoelectric focusing (IEF) due to a single base substitution at codon 360 which replaces the frequently occurring glutamine residue (allele 1) with histidine (allele 2). Recently, sequence analysis of the APO A-IV coding region h...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370090503
更新日期:1992-01-01 00:00:00
abstract::Next-generation sequencing (NGS) has led to the study of rare genetic variants, which possibly explain the missing heritability for complex diseases. Most existing methods for rare variant (RV) association detection do not account for the common presence of sequencing errors in NGS data. The errors can largely affect ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21871
更新日期:2015-02-01 00:00:00
abstract::Standard linear regression is commonly used for genetic association studies of quantitative traits. This approach may not be appropriate if the trait, on its original or transformed scales, does not follow a normal distribution. A rank-based nonparametric approach that does not rely on any distributional assumptions c...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21723
更新日期:2013-05-01 00:00:00