Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.

Abstract:

:Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They have successfully identified tens of thousands of disease-related variants. However, in total these identified variants explain only part of the variation for most complex traits. There remain many genetic variants with small effect sizes to be discovered, which calls for the development of (a) GWAS with more samples and more comprehensively genotyped variants, for example, the NHLBI Trans-Omics for Precision Medicine (TOPMed) Program is planning to conduct whole genome sequencing on over 100 000 individuals; and (b) novel and more powerful statistical analysis methods. The current dominating GWAS analysis approach is the "single trait" association test, despite the fact that many GWAS are conducted in deeply phenotyped cohorts including many correlated and well-characterized outcomes, which can help improve the power to detect novel variants if properly analyzed, as suggested by increasing evidence that pleiotropy, where a genetic variant affects multiple traits, is the norm in genome-phenome associations. We aim to develop pleiotropy informed powerful association test methods across multiple traits for GWAS. Since it is generally very hard to access individual-level GWAS phenotype and genotype data for those existing GWAS, due to privacy concerns and various logistical considerations, we develop rigorous statistical methods for pleiotropy informed adaptive multitrait association test methods that need only summary association statistics publicly available from most GWAS. We first develop a pleiotropy test, which has powerful performance for truly pleiotropic variants but is sensitive to the pleiotropy assumption. We then develop a pleiotropy informed adaptive test that has robust and powerful performance under various genetic models. We develop accurate and efficient numerical algorithms to compute the analytical P-value for the proposed adaptive test without the need of resampling or permutation. We illustrate the performance of proposed methods through application to joint association test of GWAS meta-analysis summary data for several glycemic traits. Our proposed adaptive test identified several novel loci missed by individual trait based GWAS meta-analysis. All the proposed methods are implemented in a publicly available R package.

journal_name

Biometrics

journal_title

Biometrics

authors

Masotti M,Guo B,Wu B

doi

10.1111/biom.13076

subject

Has Abstract

pub_date

2019-12-01 00:00:00

pages

1076-1085

issue

4

eissn

0006-341X

issn

1541-0420

journal_volume

75

pub_type

杂志文章
  • Sequential equivalence testing and repeated confidence intervals, with applications to normal and binary responses.

    abstract::We propose group sequential tests of the equivalence of two treatments based on ideas related to repeated confidence intervals. These tests adapt readily to unpredictable group sizes, to the possibility of continuing even though a boundary has been crossed, and to nonnormal observations. In comparing two binomial dist...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Jennison C,Turnbull BW

    更新日期:1993-03-01 00:00:00

  • Statistical analysis of multilocus recombination.

    abstract::A general formula for the frequency of different recombinant gamete types, in terms of the underlying distribution of crossovers, is derived. This formula may be applied to any theoretical model of recombination in which it is assumed that there is no chromatid interference. Multiple-locus recombination data may be ev...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Risch N,Lange K

    更新日期:1983-12-01 00:00:00

  • Likelihood-ratio tests for hidden Markov models.

    abstract::We consider hidden Markov models as a versatile class of models for weakly dependent random phenomena. The topic of the present paper is likelihood-ratio testing for hidden Markov models, and we show that, under appropriate conditions, the standard asymptotic theory of likelihood-ratio tests is valid. Such tests are c...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00742.x

    authors: Giudici P,Rydén T,Vandekerkhove P

    更新日期:2000-09-01 00:00:00

  • A network-based analysis of the 1861 Hagelloch measles data.

    abstract::In this article, we demonstrate a statistical method for fitting the parameters of a sophisticated network and epidemic model to disease data. The pattern of contacts between hosts is described by a class of dyadic independence exponential-family random graph models (ERGMs), whereas the transmission process that runs ...

    journal_title:Biometrics

    pub_type: 历史文章,杂志文章

    doi:10.1111/j.1541-0420.2012.01748.x

    authors: Groendyke C,Welch D,Hunter DR

    更新日期:2012-09-01 00:00:00

  • Numerical discretization-based estimation methods for ordinary differential equation models via penalized spline smoothing with applications in biomedical research.

    abstract::Differential equations are extensively used for modeling dynamics of physical processes in many scientific fields such as engineering, physics, and biomedical sciences. Parameter estimation of differential equation models is a challenging problem because of high computational cost and high-dimensional parameter space....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01752.x

    authors: Wu H,Xue H,Kumar A

    更新日期:2012-06-01 00:00:00

  • One-sided sequential stopping boundaries for clinical trials: a decision-theoretic approach.

    abstract::We address one-sided stopping rules for clinical trials, or more generally, drug development programs, from a decision-theoretic point of view. If efficacy results are sufficiently negative then the trial will be stopped. But regardless of how positive the efficacy results are, the trial will continue in order to demo...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Berry DA,Ho CH

    更新日期:1988-03-01 00:00:00

  • Interval estimation of the kappa coefficient with binary classification and an equal marginal probability model.

    abstract::We derive a likelihood score method for interval estimation of the intraclass version of the kappa coefficient of agreement with binary classification using a general theory of Bartlett (1953, Biometrika 40, 306-317). By exact evaluation, we investigate statistical properties of the score method, the chi-square goodne...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00583.x

    authors: Nam JM

    更新日期:2000-06-01 00:00:00

  • A new criterion for confounder selection.

    abstract::We propose a new criterion for confounder selection when the underlying causal structure is unknown and only limited knowledge is available. We assume all covariates being considered are pretreatment variables and that for each covariate it is known (i) whether the covariate is a cause of treatment, and (ii) whether t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01619.x

    authors: VanderWeele TJ,Shpitser I

    更新日期:2011-12-01 00:00:00

  • A stochastic model for the analysis of bivariate longitudinal AIDS data.

    abstract::We present a model for multivariate repeated measures that incorporates random effects, correlated stochastic processes, and measurement errors. The model is a multivariate generalization of the model for univariate longitudinal data given by Taylor, Cumberland, and Sy (1994, Journal of the American Statistical Associ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Sy JP,Taylor JM,Cumberland WG

    更新日期:1997-06-01 00:00:00

  • Testing for cubic smoothing splines under dependent data.

    abstract::In most research on smoothing splines the focus has been on estimation, while inference, especially hypothesis testing, has received less attention. By defining design matrices for fixed and random effects and the structure of the covariance matrices of random errors in an appropriate way, the cubic smoothing spline a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01537.x

    authors: Nummi T,Pan J,Siren T,Liu K

    更新日期:2011-09-01 00:00:00

  • Nonparametric comparison of two survival-time distributions in the presence of dependent censoring.

    abstract::When testing the null hypothesis that treatment arm-specific survival-time distributions are equal, the log-rank test is asymptotically valid when the distribution of time to censoring is conditionally independent of randomized treatment group given survival time. We introduce a test of the null hypothesis for use whe...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00059

    authors: DiRienzo AG

    更新日期:2003-09-01 00:00:00

  • On symmetric semiparametric two-sample problem.

    abstract::We consider a two-sample problem where data come from symmetric distributions. Usual two-sample data with only magnitudes recorded, arising from case-control studies or logistic discriminant analyses, may constitute a symmetric two-sample problem. We propose a semiparametric model such that, in addition to symmetry, t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13233

    authors: Li M,Diao G,Qin J

    更新日期:2020-12-01 00:00:00

  • Unbiased and locally efficient estimation of genetic effect on quantitative trait in the presence of population admixture.

    abstract::Population admixture can be a confounding factor in genetic association studies. Family-based methods (Rabinowitz and Larid, 2000, Human Heredity 50, 211-223) have been proposed in both testing and estimation settings to adjust for this confounding, especially in case-only association studies. The family-based methods...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01454.x

    authors: Wang Y,Yang Q,Rabinowitz D

    更新日期:2011-06-01 00:00:00

  • An alternative approach to confidence interval estimation for the win ratio statistic.

    abstract::Pocock et al. (2012, European Heart Journal 33, 176-182) proposed a win ratio approach to analyzing composite endpoints comprised of outcomes with different clinical priorities. In this article, we establish a statistical framework for this approach. We derive the null hypothesis and propose a closed-form variance est...

    journal_title:Biometrics

    pub_type: 杂志文章,随机对照试验

    doi:10.1111/biom.12225

    authors: Luo X,Tian H,Mohanty S,Tsai WY

    更新日期:2015-03-01 00:00:00

  • Nonparametric estimation of relative mortality from nested case-control studies.

    abstract::Andersen et al. (1985, Biometrics 41, 921-932) gave an estimator of the cumulative relative mortality comparing rates of death in an epidemiologic cohort to an external population as a function of time when covariate information is available on all cohort members. We present an analogous estimator when covariate infor...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Borgan O,Langholz B

    更新日期:1993-06-01 00:00:00

  • Confidence intervals following group sequential tests in clinical trials.

    abstract::Tsiatis, Rosner, and Mehta (1984, Biometrics 40, 797-803) proposed a procedure for constructing confidence intervals following group sequential tests of a normal mean. This method is first extended for group sequential tests for which the sample sizes between interim analyses are not identical or the times are not equ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kim K,DeMets DL

    更新日期:1987-12-01 00:00:00

  • Use of historical marker data for assessing treatment effects in phase I/II trials when subject selection is determined by baseline marker level.

    abstract::Although the primary focus of Phase I clinical trials is to assess clinical pharmacology and possible toxicities, any information on the potential effect of treatment would be useful in helping to determine priorities between treatments for further study. We consider the scenario where data are routinely collected on ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lin HM,Hughes MD

    更新日期:1995-09-01 00:00:00

  • Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks.

    abstract::Thanks to the growing interest in personalized medicine, joint modeling of longitudinal marker and time-to-event data has recently started to be used to derive dynamic individual risk predictions. Individual predictions are called dynamic because they are updated when information on the subject's health profile grows ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12232

    authors: Blanche P,Proust-Lima C,Loubère L,Berr C,Dartigues JF,Jacqmin-Gadda H

    更新日期:2015-03-01 00:00:00

  • Sensitivity analysis for nonrandom dropout: a local influence approach.

    abstract::Diggle and Kenward (1994, Applied Statistics 43, 49-93) proposed a selection model for continuous longitudinal data subject to nonrandom dropout. It has provoked a large debate about the role for such models. The original enthusiasm was followed by skepticism about the strong but untestable assumptions on which this t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00007.x

    authors: Verbeke G,Molenberghs G,Thijs H,Lesaffre E,Kenward MG

    更新日期:2001-03-01 00:00:00

  • Biased and unbiased estimation in longitudinal studies with informative visit processes.

    abstract::The availability of data in longitudinal studies is often driven by features of the characteristics being studied. For example, clinical databases are increasingly being used for research to address longitudinal questions. Because visit times in such data are often driven by patient characteristics that may be related...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12501

    authors: McCulloch CE,Neuhaus JM,Olin RL

    更新日期:2016-12-01 00:00:00

  • Semiparametric methods for mapping quantitative trait loci with censored data.

    abstract::Statistical methods for the detection of genes influencing quantitative traits with the aid of genetic markers are well developed for normally distributed, fully observed phenotypes. Many experiments are concerned with failure-time phenotypes, which have skewed distributions and which are usually subject to censoring ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2005.00346.x

    authors: Diao G,Lin DY

    更新日期:2005-09-01 00:00:00

  • Estimating acute air pollution health effects from cohort study data.

    abstract::Traditional studies of short-term air pollution health effects use time series data, while cohort studies generally focus on long-term effects. There is increasing interest in exploiting individual level cohort data to assess short-term health effects in order to understand the mechanisms and time scales of action. We...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12125

    authors: Szpiro AA,Sheppard L,Adar SD,Kaufman JD

    更新日期:2014-03-01 00:00:00

  • Nonparametric functional mapping of quantitative trait loci.

    abstract::Functional mapping is a useful tool for mapping quantitative trait loci (QTL) that control dynamic traits. It incorporates mathematical aspects of biological processes into the mixture model-based likelihood setting for QTL mapping, thus increasing the power of QTL detection and the precision of parameter estimation. ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01063.x

    authors: Yang J,Wu R,Casella G

    更新日期:2009-03-01 00:00:00

  • Semiparametric modelling and estimation of covariate-adjusted dependence between bivariate recurrent events.

    abstract::A time-dependent measure, termed the rate ratio, was proposed to assess the local dependence between two types of recurrent event processes in one-sample settings. However, the one-sample work does not consider modeling the dependence by covariates such as subject characteristics and treatments received. The focus of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13229

    authors: Ning J,Cai C,Chen Y,Huang X,Wang MC

    更新日期:2020-12-01 00:00:00

  • Maximum likelihood estimation in dynamical models of HIV.

    abstract::The study of dynamical models of HIV infection, based on a system of nonlinear ordinary differential equations (ODE), has considerably improved the knowledge of its pathogenesis. While the first models used simplified ODE systems and analyzed each patient separately, recent works dealt with inference in non-simplified...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00812.x

    authors: Guedj J,Thiébaut R,Commenges D

    更新日期:2007-12-01 00:00:00

  • Asynchronous distance between homologous DNA sequences.

    abstract::The distance between homologous DNA sequences of two species is proposed to be -1/4 ln[det(P)], where P is the conditional probability matrix specifying the proportions of the various nucleotides in the second sequence, corresponding to each of the four nucleotides in the first sequence. A probability model is describ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Barry D,Hartigan JA

    更新日期:1987-06-01 00:00:00

  • Two-stage designs for gene-disease association studies with sample size constraints.

    abstract::Gene-disease association studies based on case-control designs may often be used to identify candidate polymorphisms (markers) conferring disease risk. If a large number of markers are studied, genotyping all markers on all samples is inefficient in resource utilization. Here, we propose an alternative two-stage metho...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00207.x

    authors: Satagopan JM,Venkatraman ES,Begg CB

    更新日期:2004-09-01 00:00:00

  • On pooling across strata when frequency matching has been followed in a cohort study.

    abstract::In a study designed to assess the relationship between a dichotomous exposure and the eventual occurrence of a dichotomous outcome, frequency matching has been proposed as a way to balance the exposure cohorts with respect to the sampling distribution of potential confounding factors. This paper discusses the pooled e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR

    更新日期:1985-03-01 00:00:00

  • Correcting for the effect of misclassification bias in a case-control study using data from two different questionnaires.

    abstract::In an epidemiological study of risk factors in breast cancer, data are available on confirmed cases from a diagnostic clinic and on controls from a screening clinic that sampled the general population. Relative risk estimation is complicated by differences in the interviewing environment and in the wording and order o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Elton RA,Duffy SW

    更新日期:1983-09-01 00:00:00

  • An improved method for measuring heart-rate variability: assessment of cardiac autonomic function.

    abstract::Heart rate oscillates in synchrony with respiration. Several methods have been employed to assess this 'sinus arrhythmia', as an index of autonomic nervous system function. This paper proposes a new, easily computed measure, R, which is relatively resistant to the major nonrespiratory sources of variation, including p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR,Pfeifer MA

    更新日期:1984-09-01 00:00:00