A flexible and parallelizable approach to genome-wide polygenic risk scores.

Abstract:

:The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic risk scores from meta-GWAS summary statistics. Local linkage disequilibrium (LD) is adjusted for in Step 1, followed by, uniquely, long-range LD in Step 2. Our algorithm is highly parallelizable since block-wise analyses in Step 1 can be distributed across a high-performance computing cluster, and flexible, since sparsity and heritability are estimated within each block. Inference is obtained through a formal Bayesian variable selection framework, meaning final risk predictions are averaged over competing models. We compared our method to two alternative approaches: LDPred and lassosum using all seven traits in the Welcome Trust Case Control Consortium as well as meta-GWAS summaries for type 1 diabetes (T1D), coronary artery disease, and schizophrenia. Performance was generally similar across methods, although our framework provided more accurate predictions for T1D, for which there are multiple heterogeneous signals in regions of both short- and long-range LD. With sufficient compute resources, our method also allows the fastest runtimes.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Newcombe PJ,Nelson CP,Samani NJ,Dudbridge F

doi

10.1002/gepi.22245

subject

Has Abstract

pub_date

2019-10-01 00:00:00

pages

730-741

issue

7

eissn

0741-0395

issn

1098-2272

journal_volume

43

pub_type

杂志文章
  • Analysis of bipolar disorder using affected relatives.

    abstract::We have analyzed the GAW10 data from several studies of bipolar affective disorder (BPAD) using the software packages SimIBD and SIMWALK2. SimIBD implements a simulation-based affected-pedigree-member (APM) statistic, called SimAPM, as well as an APM-like statistic, also called SimIBD, that measures identical-by-desce...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<605::AID-GEPI9>3

    authors: Davis S,Sobel E,Marinov M,Weeks DE

    更新日期:1997-01-01 00:00:00

  • Explorative two-locus linkage analysis suggests a multiplicative interaction between the 7q32 and 16p13 myoclonic seizures-related photosensitivity loci.

    abstract::In traits suspected to be governed by at least two loci, linkage analysis incorporating the joint action of both loci may improve the power to detect linkage, increase the precision of estimating locus positions and provide insight into the underlying etiological mechanism. Recently, we mapped two susceptibility loci ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20190

    authors: Pinto D,Kasteleijn-Nolst Trenité DG,Cordell HJ,Mattheisen M,Strauch K,Lindhout D,Koeleman BP

    更新日期:2007-01-01 00:00:00

  • Comparison of the QTDT analysis for IgE in the CSGA data set.

    abstract::Over the past few years at least 13 transmission/disequilibrium test (TDT)-based tests have been developed for quantitative (Q) traits for the assessment of association or linkage in the presence of the other. A total of six of these QTDT methods were used to analyze log10IgE in the Collaborative Study on the Genetics...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s312

    authors: Page GP,Wilcox MA,Occhiuto J,Adak S,Neuberg D,Bajorunaite R,George V

    更新日期:2001-01-01 00:00:00

  • Gene-dropping vs. empirical variance estimation for allele-sharing linkage statistics.

    abstract::In this study, we compare the statistical properties of a number of methods for estimating P-values for allele-sharing statistics in non-parametric linkage analysis. Some of the methods are based on the normality assumption, using different variance estimation methods, and others use simulation (gene-dropping) to find...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20177

    authors: Jung J,Weeks DE,Feingold E

    更新日期:2006-12-01 00:00:00

  • Comparison of two linkage inference procedures for genes related to the P300 component of the event related potential.

    abstract::Our goal was to detect genes contributing to the P300 component of the event related potential (ERP). We found that all of the ERP traits were highly correlated. Most of them distinguished alcoholics from nonalcoholics. To have one summary variable for the ERP traits, we calculated the first principal component (PRIN1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170728

    authors: Goldin LR,Chase GA

    更新日期:1999-01-01 00:00:00

  • Increasing the power of identifying gene x gene interactions in genome-wide association studies.

    abstract::In this paper we investigate the power to identify gene x gene interactions in genome-wide association studies. In our analysis we focus on two-stage analyses: analyses in which we only test for interactions between single nucleotide polymorphisms that show some marginal effect. We give two algorithms to compute signi...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20300

    authors: Kooperberg C,Leblanc M

    更新日期:2008-04-01 00:00:00

  • Genetic and environmental causes of variation in renal tubular handling of sodium and potassium: a twin study.

    abstract::We have conducted a study of renal sodium and potassium reabsorption in 205 pairs of twins on freely chosen diets; 89 of the subjects were studied on more than one occasion. Renal tubular sodium and potassium handling, as measured by the fractional excretions FENa and FEK, show repeatable differences between individua...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370020103

    authors: Whitfield JB,Martin NG

    更新日期:1985-01-01 00:00:00

  • Evaluation of genetic and environmental effects using GEE and APM methods.

    abstract::Two analytic methods were used in the Problem 2 data set. First, generalized estimating equations (GEE) modelling was developed to adjust for familial correlation in regressions evaluating candidate genes and an environmental factor. Second, the affected-pedigree-member (APM) method was used to identify chromosomal re...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120633

    authors: Bull SB,Chapman NH,Greenwood CM,Darlington GA

    更新日期:1995-01-01 00:00:00

  • An efficient study design to test parent-of-origin effects in family trios.

    abstract::Increasing evidence has shown that genes may cause prenatal, neonatal, and pediatric diseases depending on their parental origins. Statistical models that incorporate parent-of-origin effects (POEs) can improve the power of detecting disease-associated genes and help explain the missing heritability of diseases. In ma...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22060

    authors: Yu X,Chen G,Feng R

    更新日期:2017-11-01 00:00:00

  • Analysis of multiple phenotypes.

    abstract::The complex etiology of common diseases like cardiovascular disease, diabetes, hypertension, and rheumatoid arthritis has led investigators to focus on the genetics of correlated phenotypes and risk factors. Joint analysis of multiple disease-related phenotypes may reveal genes of pleiotropic effect and increase analy...

    journal_title:Genetic epidemiology

    pub_type:

    doi:10.1002/gepi.20470

    authors: Kent JW Jr

    更新日期:2009-01-01 00:00:00

  • Accounting for population stratification in DNA methylation studies.

    abstract::DNA methylation is an important epigenetic mechanism that has been linked to complex diseases and is of great interest to researchers as a potential link between genome, environment, and disease. As the scale of DNA methylation association studies approaches that of genome-wide association studies, issues such as popu...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21789

    authors: Barfield RT,Almli LM,Kilaru V,Smith AK,Mercer KB,Duncan R,Klengel T,Mehta D,Binder EB,Epstein MP,Ressler KJ,Conneely KN

    更新日期:2014-04-01 00:00:00

  • Linear trend tests for case-control genetic association that incorporate random phenotype and genotype misclassification error.

    abstract::The purpose of this work is the development of linear trend tests that allow for error (LTT ae), specifically incorporating double-sampling information on phenotypes and/or genotypes. We use a likelihood framework. Misclassification errors are estimated via double sampling. Unbiased estimates of penetrances and genoty...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20246

    authors: Gordon D,Haynes C,Yang Y,Kramer PL,Finch SJ

    更新日期:2007-12-01 00:00:00

  • Increased risk for familial ovarian cancer among Jewish women: a population-based case-control study.

    abstract::Jewish women have been reported to have a higher risk for familial breast cancer than non-Jewish women and to be more likely to carry mutations in breast cancer genes such as BRCA1. Because BRCA1 mutations also increase women's risk for ovarian cancer, we asked whether Jewish women are at higher risk for familial ovar...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1098-2272(1998)15:1<51::AID-GEPI4>3.

    authors: Steinberg KK,Pernarelli JM,Marcus M,Khoury MJ,Schildkraut JM,Marchbanks PA

    更新日期:1998-01-01 00:00:00

  • Mantel statistics to correlate gene expression levels from microarrays with clinical covariates.

    abstract::Mantel statistics provide an additional step to standard approaches in the analysis of gene expression and covariate data, allow the calculation of standard statistics such as correlation, partial correlation, and regression coefficients, and, with permutation tests, provide P values for these statistics to relate the...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1115

    authors: Shannon WD,Watson MA,Perry A,Rich K

    更新日期:2002-06-01 00:00:00

  • A multipoint method for meta-analysis of genetic association studies.

    abstract::Meta-analyses of genetic association studies are usually performed using a single polymorphism at a time, even though in many cases the individual studies report results from partially overlapping sets of polymorphisms. We present here a multipoint (or multilocus) method for multivariate meta-analysis of published pop...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20531

    authors: Bagos PG,Liakopoulos TD

    更新日期:2010-11-01 00:00:00

  • Tag SNPs chosen from HapMap perform well in several population isolates.

    abstract::Population isolates may be particularly useful for association studies of complex traits. This utility, however, largely depends on the transferability of tag SNPs chosen from reference samples, such as HapMap, to samples from such populations. Factors that characterize population isolates, such as widespread genetic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20201

    authors: Service S,International Collaborative Group on Isolated Populations.,Sabatti C,Freimer N

    更新日期:2007-04-01 00:00:00

  • Efficient computation of patterned covariance matrix mixed models in quantitative segregation analysis.

    abstract::The use of patterned covariance matrices in forming pedigree-based mixed models for quantitative traits is discussed. It is suggested that patterned covariance matrix models provide intuitive, theoretically appealing, and flexible genetic modeling devices for pedigree data. It is suggested further that the very great ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370080104

    authors: Schork N

    更新日期:1991-01-01 00:00:00

  • Meta-Analysis of Rare Variant Association Tests in Multiethnic Populations.

    abstract::Several methods have been proposed to increase power in rare variant association testing by aggregating information from individual rare variants (MAF < 0.005). However, how to best combine rare variants across multiple ethnicities and the relative performance of designs using different ethnic sampling fractions remai...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21939

    authors: Mensah-Ablorh A,Lindstrom S,Haiman CA,Henderson BE,Marchand LL,Lee S,Stram DO,Eliassen AH,Price A,Kraft P

    更新日期:2016-01-01 00:00:00

  • SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

    abstract::Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true perti...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20543

    authors: Ayers KL,Cordell HJ

    更新日期:2010-12-01 00:00:00

  • Identifying SNPs predictive of phenotype using random forests.

    abstract::There has been a great interest and a few successes in the identification of complex disease susceptibility genes in recent years. Association studies, where a large number of single-nucleotide polymorphisms (SNPs) are typed in a sample of cases and controls to determine which genes are associated with a specific dise...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20041

    authors: Bureau A,Dupuis J,Falls K,Lunetta KL,Hayward B,Keith TP,Van Eerdewegh P

    更新日期:2005-02-01 00:00:00

  • Genome-wide approaches for identifying interacting susceptibility regions for asthma.

    abstract::A genome-wide correlation analysis and cluster analysis were utilized to determine chromosomal regions that had similar nonparametric linkage scores across families in order to locate interacting susceptibility loci for asthma. Conditional analysis was performed to detect any increase in lod score over baseline. Eight...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s266

    authors: Colilla S,Tsalenko A,Pluznikov A,Cox NJ

    更新日期:2001-01-01 00:00:00

  • Demonstration of a common major gene with pleiotropic effects on immunoglobulin E levels and allergy.

    abstract::Atopic disease is generally recognized to be familial, although specific genetic components have yet to be identified. High levels of a unique class of immunoglobulins, immunoglobulin E (IgE), have been shown to be associated with allergies. Several investigators have reported evidence indicating a recessive regulator...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370020402

    authors: Borecki IB,Rao DC,Lalouel JM,McGue M,Gerrard JW

    更新日期:1985-01-01 00:00:00

  • Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees.

    abstract::In the last two decades, complex traits have become the main focus of genetic studies. The hypothesis that both rare and common variants are associated with complex traits is increasingly being discussed. Family-based association studies using relatively large pedigrees are suitable for both rare and common variant id...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21844

    authors: Saad M,Wijsman EM

    更新日期:2014-11-01 00:00:00

  • Power and sample size calculations for SNP association studies with censored time-to-event outcomes.

    abstract::For many clinical studies in cancer, germline DNA is prospectively collected for the purpose of discovering or validating single-nucleotide polymorphisms (SNPs) associated with clinical outcomes. The primary clinical endpoint for many of these studies are time-to-event outcomes such as time of death or disease progres...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21645

    authors: Owzar K,Li Z,Cox N,Jung SH

    更新日期:2012-09-01 00:00:00

  • Ordered multinomial regression for genetic association analysis of ordinal phenotypes at Biobank scale.

    abstract::Logistic regression is the primary analysis tool for binary traits in genome-wide association studies (GWAS). Multinomial regression extends logistic regression to multiple categories. However, many phenotypes more naturally take ordered, discrete values. Examples include (a) subtypes defined from multiple sources of ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22276

    authors: German CA,Sinsheimer JS,Klimentidis YC,Zhou H,Zhou JJ

    更新日期:2020-04-01 00:00:00

  • Propensity score-based nonparametric test revealing genetic variants underlying bipolar disorder.

    abstract::Association analysis has led to the identification of many genetic variants for complex diseases. While assessing the association between genes and a disease, other factors can play an important role. The consequence of not considering covariates (such as population stratification and environmental factors) is well-do...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20558

    authors: Jiang Y,Zhang H

    更新日期:2011-02-01 00:00:00

  • The impact of improved microarray coverage and larger sample sizes on future genome-wide association studies.

    abstract::Genome-wide association studies (GWAS) have identified many single nucleotide polymorphisms (SNPs) associated with complex traits. However, the genetic heritability of most of these traits remains unexplained. To help guide future studies, we address the crucial question of whether future GWAS can detect new SNP assoc...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21724

    authors: Lindquist KJ,Jorgenson E,Hoffmann TJ,Witte JS

    更新日期:2013-05-01 00:00:00

  • Genome-wide linkage analysis using genetic variance components of alcohol dependency-associated censored and continuous traits.

    abstract::We used variance-components analysis to investigate the additive genetic effects regulating some of the phenotypes included in the GAW11 data set. Variance-components models were fitted using Gibbs sampling methods in BUGS v 0.6. Linkage analyses for both multivariate normal (MvN) traits and right censored survival ti...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370170748

    authors: Palmer LJ,Tiller KJ,Burton PR

    更新日期:1999-01-01 00:00:00

  • Effect of linkage disequilibrium between markers in linkage and association analyses.

    abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20291

    authors: Dupuis J,Albers K,Allen-Brady K,Cho K,Elston RC,Kappen HJ,Tang H,Thomas A,Thomson G,Tsung E,Yang Q,Zhang W,Zhao K,Zheng G,Ziegler JT

    更新日期:2007-01-01 00:00:00

  • Multifactorial disease risk calculator: Risk prediction for multifactorial disease pedigrees.

    abstract::Construction of multifactorial disease models from epidemiological findings and their application to disease pedigrees for risk prediction is nontrivial for all but the simplest of cases. Multifactorial Disease Risk Calculator is a web tool facilitating this. It provides a user-friendly interface, extending a reported...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22101

    authors: Campbell DD,Li Y,Sham PC

    更新日期:2018-03-01 00:00:00