A penalized robust semiparametric approach for gene-environment interactions.

Abstract:

:In genetic and genomic studies, gene-environment (G×E) interactions have important implications. Some of the existing G×E interaction methods are limited by analyzing a small number of G factors at a time, by assuming linear effects of E factors, by assuming no data contamination, and by adopting ineffective selection techniques. In this study, we propose a new approach for identifying important G×E interactions. It jointly models the effects of all E and G factors and their interactions. A partially linear varying coefficient model is adopted to accommodate possible nonlinear effects of E factors. A rank-based loss function is used to accommodate possible data contamination. Penalization, which has been extensively used with high-dimensional data, is adopted for selection. The proposed penalized estimation approach can automatically determine if a G factor has an interaction with an E factor, main effect but not interaction, or no effect at all. The proposed approach can be effectively realized using a coordinate descent algorithm. Simulation shows that it has satisfactory performance and outperforms several competing alternatives. The proposed approach is used to analyze a lung cancer study with gene expression measurements and clinical variables. Copyright © 2015 John Wiley & Sons, Ltd.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Wu C,Shi X,Cui Y,Ma S

doi

10.1002/sim.6609

subject

Has Abstract

pub_date

2015-12-30 00:00:00

pages

4016-30

issue

30

eissn

0277-6715

issn

1097-0258

journal_volume

34

pub_type

杂志文章
  • Flexible design clinical trial methodology in regulatory applications.

    abstract::Adaptive designs or flexible designs in a broader sense have increasingly been considered in planning pivotal registration clinical trials. Sample size reassessment design and adaptive selection design are two of such designs that appear in regulatory applications. At the design stage, consideration of sample size rea...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4021

    authors: Hung HM,Wang SJ,O'Neill R

    更新日期:2011-06-15 00:00:00

  • Rank-based principal stratum sensitivity analyses.

    abstract::We describe rank-based approaches to assess principal stratification treatment effects in studies where the outcome of interest is only well-defined in a subgroup selected after randomization. Our methods are sensitivity analyses, in that estimands are identified by fixing a parameter and then we investigate the sensi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5849

    authors: Lu X,Mehrotra DV,Shepherd BE

    更新日期:2013-11-20 00:00:00

  • Predicting analysis times in randomized clinical trials.

    abstract::Randomized clinical trial designs commonly include one or more planned interim analyses. At these times an external monitoring committee reviews the accumulated data and determines whether it is scientifically and ethically appropriate for the study to continue. With failure-time endpoints, it is common to schedule an...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.843

    authors: Bagiella E,Heitjan DF

    更新日期:2001-07-30 00:00:00

  • Modeling health survey data with excessive zero and K responses.

    abstract::Zero-inflated Poisson regression is a popular tool used to analyze data with excessive zeros. Although much work has already been performed to fit zero-inflated data, most models heavily depend on special features of the individual data. To be specific, this means that there is a sizable group of respondents who endor...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5650

    authors: Lin TH,Tsai MH

    更新日期:2013-04-30 00:00:00

  • Statistical and practical issues in the design of a national probability sample of births for the Vanguard Study of the National Children's Study.

    abstract::The National Children's Study is a national household probability sample designed to identify 100,000 children at birth and follow the sampled children for 21 years. Data from the study will support examining numerous hypotheses concerning genetic and environmental effects on the health and development of children. Th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3891

    authors: Montaquila JM,Brick JM,Curtin LR

    更新日期:2010-06-15 00:00:00

  • Joint estimation of multiple disease-specific sensitivities and specificities via crossed random effects models for correlated reader-based diagnostic data: application of data cloning.

    abstract::We present a model for describing correlated binocular data from reader-based diagnostic studies, where the same group of readers evaluates the presence or absence of certain diseases on binocular organs (e.g., fellow eyes) of patients. Multiple random effects are incorporated to meaningfully delineate various associa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6584

    authors: Withanage N,de Leon AR,Rudnisky CJ

    更新日期:2015-12-20 00:00:00

  • Modeling and validating Bayesian accrual models on clinical data and simulations using adaptive priors.

    abstract::Slow recruitment in clinical trials leads to increased costs and resource utilization, which includes both the clinic staff and patient volunteers. Careful planning and monitoring of the accrual process can prevent the unnecessary loss of these resources. We propose two hierarchical extensions to the existing Bayesian...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6359

    authors: Jiang Y,Simon S,Mayo MS,Gajewski BJ

    更新日期:2015-02-20 00:00:00

  • Combining classification trees using MLE.

    abstract::We propose a probability distribution for an equivalence class of classification trees (that is, those that ignore the value of the cutpoints but retain tree structure). This distribution is parameterized by a central tree structure representing the true model, and a precision or concentration coefficient representing...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990330)18:6<727::aid-sim

    authors: Shannon WD,Banks D

    更新日期:1999-03-30 00:00:00

  • Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond.

    abstract::Identification of key factors associated with the risk of developing cardiovascular disease and quantification of this risk using multivariable prediction algorithms are among the major advances made in preventive cardiology and cardiovascular epidemiology in the 20th century. The ongoing discovery of new risk markers...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2929

    authors: Pencina MJ,D'Agostino RB Sr,D'Agostino RB Jr,Vasan RS

    更新日期:2008-01-30 00:00:00

  • Sample size planning for survival prediction with focus on high-dimensional data.

    abstract::Sample size planning should reflect the primary objective of a trial. If the primary objective is prediction, the sample size determination should focus on prediction accuracy instead of power. We present formulas for the determination of training set sample size for survival prediction. Sample size is chosen to contr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5550

    authors: Götte H,Zwiener I

    更新日期:2013-02-28 00:00:00

  • Doubly robust generalized estimating equations for longitudinal data.

    abstract::A popular method for analysing repeated-measures data is generalized estimating equations (GEE). When response data are missing at random (MAR), two modifications of GEE use inverse-probability weighting and imputation. The weighted GEE (WGEE) method involves weighting observations by their inverse probability of bein...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3520

    authors: Seaman S,Copas A

    更新日期:2009-03-15 00:00:00

  • A restricted mixture model for dietary pattern analysis in small samples.

    abstract::Multivariate finite mixture models have been applied to the identification of dietary patterns. These models are known to have many parameters, and consequently large samples are usually required. We present a special case of a multivariate mixture model that reduces the number of parameters to be estimated and seems ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5336

    authors: Rita Gaio A,Costa JP,Santos AC,Ramos E,Lopes C

    更新日期:2012-08-30 00:00:00

  • A comparison of arm-based and contrast-based models for network meta-analysis.

    abstract::Differences between arm-based (AB) and contrast-based (CB) models for network meta-analysis (NMA) are controversial. We compare the CB model of Lu and Ades (2006), the AB model of Hong et al(2016), and two intermediate models, using hypothetical data and a selected real data set. Differences between models arise prima...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8360

    authors: White IR,Turner RM,Karahalios A,Salanti G

    更新日期:2019-11-30 00:00:00

  • Analysis of the ratio of marginal probabilities in a matched-pair setting.

    abstract::Statistical methods for testing and interval estimation of the ratio of marginal probabilities in the matched-pair setting are considered in this paper. We are especially interested in the situation where the null value is not one, as in one-sided equivalence trials. We propose a Fieller-type statistic based on constr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1017

    authors: Nam JM,Blackwelder WC

    更新日期:2002-03-15 00:00:00

  • British 1990 growth reference centiles for weight, height, body mass index and head circumference fitted by maximum penalized likelihood.

    abstract::To update the British growth reference, anthropometric data for weight, height, body mass index (weight/height2) and head circumference from 17 distinct surveys representative of England, Scotland and Wales (37,700 children, age range 23 weeks gestation to 23 years) were analysed by maximum penalized likelihood using ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:

    authors: Cole TJ,Freeman JV,Preece MA

    更新日期:1998-02-28 00:00:00

  • Assessing goodness-of-fit of parametric regression models for lifetime data-graphical methods.

    abstract::Graphical methods are often used to check goodness-of-fit of models to data. It is common to plot residuals against a reference distribution so that when the model fits the data, the configuration should be close to a straight line. Since the resemblance to a straight line is often unclear, it has been suggested to ad...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141607

    authors: Cohen A,Barnett O

    更新日期:1995-08-30 00:00:00

  • Small clinical trials: are they all bad?

    abstract::Statisticians have long argued that randomized controlled trials should be sufficiently large to achieve their purpose, and for common diseases with major public health implications this has brought many benefits. However, there are many instances where it is unrealistic to expect clinicians to provide the information...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/sim.4780140204

    authors: Matthews JN

    更新日期:1995-01-30 00:00:00

  • Beta-binomial/Poisson regression models for repeated bivariate counts.

    abstract::We analyze data obtained from a study designed to evaluate training effects on the performance of certain motor activities of Parkinson's disease patients. Maximum likelihood methods were used to fit beta-binomial/Poisson regression models tailored to evaluate the effects of training on the numbers of attempted and su...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3303

    authors: Lora MI,Singer JM

    更新日期:2008-07-30 00:00:00

  • The effect of salvage therapy on survival in a longitudinal study with treatment by indication.

    abstract::We consider using observational data to estimate the effect of a treatment on disease recurrence, when the decision to initiate treatment is based on longitudinal factors associated with the risk of recurrence. The effect of salvage androgen deprivation therapy (SADT) on the risk of recurrence of prostate cancer is in...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4017

    authors: Kennedy EH,Taylor JM,Schaubel DE,Williams S

    更新日期:2010-11-10 00:00:00

  • Bayesian bivariate meta-analysis of diagnostic test studies using integrated nested Laplace approximations.

    abstract::For bivariate meta-analysis of diagnostic studies, likelihood approaches are very popular. However, they often run into numerical problems with possible non-convergence. In addition, the construction of confidence intervals is controversial. Bayesian methods based on Markov chain Monte Carlo (MCMC) sampling could be u...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3858

    authors: Paul M,Riebler A,Bachmann LM,Rue H,Held L

    更新日期:2010-05-30 00:00:00

  • A scan statistic with a variable window.

    abstract::Given N points or events occurring according to some probability distribution in the unit interval (0, 1), the simple scan statistic is defined to be the maximum number of points in any sub-interval of length d. In many areas, as in epidemiology, it is used to test the null hypothesis that the events are random, again...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19960415)15:7/9<845::aid-s

    authors: Nagarwalla N

    更新日期:1996-04-15 00:00:00

  • Analysis of ordered composite endpoints.

    abstract::Composite endpoints are frequently used in clinical trials, but simple approaches, such as the time to first event, do not reflect any ordering among the endpoints. However, some endpoints, such as mortality, are worse than others. A variety of procedures have been proposed to reflect the severity of the individual en...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8431

    authors: Follmann D,Fay MP,Hamasaki T,Evans S

    更新日期:2020-02-28 00:00:00

  • Exact group-sequential designs for clinical trials with randomized play-the-winner allocation.

    abstract::The use of both sequential designs and adaptive treatment allocation are effective in reducing the number of patients receiving an inferior treatment in a clinical trial. In large samples, when the asymptotic normality of test statistics can be utilized, a standard sequential design can be combined with adaptive alloc...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.998

    authors: Stallard N,Rosenberger WF

    更新日期:2002-02-28 00:00:00

  • Some considerations in the analysis of rates of change in longitudinal studies.

    abstract::This paper discusses and compares several estimators of mean rate of change in unbalanced longitudinal data based on a model with randomly distributed regression coefficients across individuals. The estimators are unweighted and weighted means of these coefficients. The paper also evaluates commonly used variance esti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780060509

    authors: Palta M,Cook T

    更新日期:1987-07-01 00:00:00

  • Multilocus linkage disequilibrium mapping of epistatic quantitative trait loci that regulate HIV dynamics: a simulation approach.

    abstract::The time-dependent change of HIV particle load, i.e. HIV dynamics, is likely to be controlled by a multitude of quantitative trait loci (QTL) that interact with each other as well as with various developmental and environmental factors in a coordinated manner. In this article, we have derived a new statistical model f...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2489

    authors: Wu S,Yang J,Wu R

    更新日期:2006-11-30 00:00:00

  • Statistical methods for multivariate interval-censored recurrent events.

    abstract::Multi-type recurrent event data arise when two or more different kinds of events may occur repeatedly over a period of observation. The scientific objectives in such settings are often to describe features of the marginal processes and to study the association between the different types of events. Interval-censored m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1936

    authors: Chen BE,Cook RJ,Lawless JF,Zhan M

    更新日期:2005-03-15 00:00:00

  • CoPlot: a tool for visualizing multivariate data in medicine.

    abstract::Many critical questions in medicine require the analysis of complex multivariate data, often from large data sets describing numerous variables for numerous subjects. In this paper, we describe CoPlot, a tool for visualizing multivariate data in medicine. CoPlot is an adaptation of multidimensional scaling (MDS) that ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3078

    authors: Bravata DM,Shojania KG,Olkin I,Raveh A

    更新日期:2008-05-30 00:00:00

  • Variable selection in covariate dependent random partition models: an application to urinary tract infection.

    abstract::Lower urinary tract symptoms can indicate the presence of urinary tract infection (UTI), a condition that if it becomes chronic requires expensive and time consuming care as well as leading to reduced quality of life. Detecting the presence and gravity of an infection from the earliest symptoms is then highly valuable...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6786

    authors: Barcella W,Iorio MD,Baio G,Malone-Lee J

    更新日期:2016-04-15 00:00:00

  • Additive and multiplicative covariate regression models for relative survival incorporating fractional polynomials for time-dependent effects.

    abstract::Relative survival is used to estimate patient survival excluding causes of death not related to the disease of interest. Rather than using cause of death information from death certificates, which is often poorly recorded, relative survival compares the observed survival to that expected in a matched group from the ge...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2399

    authors: Lambert PC,Smith LK,Jones DR,Botha JL

    更新日期:2005-12-30 00:00:00

  • Estimating the mean hazard ratio parameters for clustered survival data with random clusters.

    abstract::We consider a latent variable hazard model for clustered survival data where clusters are a random sample from an underlying population. We allow interactions between the random cluster effect and covariates. We use a maximum pseudo-likelihood estimator to estimate the mean hazard ratio parameters. We propose a bootst...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970915)16:17<2009::aid-s

    authors: Cai J,Zhou H,Davis CE

    更新日期:1997-09-15 00:00:00