Direct effects testing: a two-stage procedure to test for effect size and variable importance for correlated binary predictors and a binary response.

Abstract:

:In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good predictive model in such cases is well studied. Less well understood is how to recover the 'true sparsity pattern', that is finding which predictors have direct effects on the response, and indicating the statistical significance of the results. Restricting attention to binary predictors and response, we study the recovery of the true sparsity pattern using a two-stage method that separates establishing the presence of effects from inferring their exact relationship with the predictors. Simulations and a real data application demonstrate that the method discriminates well between associations and direct effects. Comparisons with lasso-based methods demonstrate favourable performance of the proposed method.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Sperrin M,Jaki T

doi

10.1002/sim.4014

subject

Has Abstract

pub_date

2010-10-30 00:00:00

pages

2544-56

issue

24

eissn

0277-6715

issn

1097-0258

journal_volume

29

pub_type

杂志文章
  • Semiparametric ROC surfaces for continuous diagnostic tests based on two test measurements.

    abstract::We propose a semiparametric method for estimating ROC surfaces for continuous diagnostic tests based on two test measurements. Such a three-class diagnostic problem based on two test measurements arises naturally from some DNA amplification-related diagnostic scenarios. Simulation results show that our proposed semipa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3625

    authors: Wan S,Zhang B

    更新日期:2009-08-15 00:00:00

  • Testing for publication bias in diagnostic meta-analysis: a simulation study.

    abstract::The present study investigates the performance of several statistical tests to detect publication bias in diagnostic meta-analysis by means of simulation. While bivariate models should be used to pool data from primary studies in diagnostic meta-analysis, univariate measures of diagnostic accuracy are preferable for t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6177

    authors: Bürkner PC,Doebler P

    更新日期:2014-08-15 00:00:00

  • Statistical and practical issues in the design of a national probability sample of births for the Vanguard Study of the National Children's Study.

    abstract::The National Children's Study is a national household probability sample designed to identify 100,000 children at birth and follow the sampled children for 21 years. Data from the study will support examining numerous hypotheses concerning genetic and environmental effects on the health and development of children. Th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3891

    authors: Montaquila JM,Brick JM,Curtin LR

    更新日期:2010-06-15 00:00:00

  • Application of the parallel line assay to assessment of biosimilar products based on binary endpoints.

    abstract::Biological drug products are therapeutic moieties manufactured by a living system or organisms. These are important life-saving drug products for patients with unmet medical needs. Because of expensive cost, only a few patients have access to life-saving biological products. Most of the early biological products will ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5565

    authors: Lin JR,Chow SC,Chang CH,Lin YC,Liu JP

    更新日期:2013-02-10 00:00:00

  • Case-control analysis with a continuous outcome variable.

    abstract::It is not uncommon for a continuous outcome variable Y to be dichotomized and analysed using logistic regression. Moser and Coombs (Statist. Med. 2004; 23:1843-1860) provide a method for converting the output from a standard linear regression analysis using the original continuous outcome Y to give much more efficient...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3474

    authors: Jiang Y,Scott A,Wild CJ

    更新日期:2009-01-30 00:00:00

  • Aggregation of existing geographic regions to diminish spurious variability of disease rates.

    abstract::The availability of large data sets together with the growth in power and storage capabilities of computers have made the analysis of the spatial distribution of disease rates an increasingly important tool in public health research. Use of existing geographic divisions or groupings tends to result either in unstable ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780121916

    authors: Morris RD,Munasinghe RL

    更新日期:1993-10-01 00:00:00

  • A sensitivity analysis for subverting randomization in controlled trials.

    abstract::In some randomized controlled trials, subjects with a better prognosis may be diverted into the treatment group. This subverting of randomization results in an unobserved non-compliance with the originally intended treatment assignment. Consequently, the estimate of treatment effect from these trials may be biased. Th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.715

    authors: Marcus SM

    更新日期:2001-02-28 00:00:00

  • Using follow-up data to avoid omitted variable bias: an application to cardiovascular epidemiology.

    abstract::Omitted variable bias is discussed in the context of linear models. It is shown that the effect of omitted variables can be controlled in linear models for metric dependent variables by using data from follow-up studies. Two different models for analysing such data are proposed. In the first model the omitted variable...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110906

    authors: Rehm J,Arminger G,Kohlmeier L

    更新日期:1992-06-30 00:00:00

  • Prior event rate ratio adjustment for hidden confounding in observational studies of treatment effectiveness: a pairwise Cox likelihood approach.

    abstract::Observational studies provide a rich source of information for assessing effectiveness of treatment interventions in many situations where it is not ethical or practical to perform randomized controlled trials. However, such studies are prone to bias from hidden (unmeasured) confounding. A promising approach to identi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7051

    authors: Lin NX,Henley WE

    更新日期:2016-12-10 00:00:00

  • Assessing the effect of interventions in the context of mixture distributions with detection limits.

    abstract::Many quantitative assay measurements of metabolites of environmental toxicants in clinical investigations are subject to left censoring due to values falling below assay detection limits. Moreover, when observations occur in both unexposed individuals and exposed individuals who reflect a mixture of two distributions ...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/sim.2079

    authors: Chu H,Kensler TW,Muñoz A

    更新日期:2005-07-15 00:00:00

  • Hypothesis testing in an errors-in-variables model with heteroscedastic measurement errors.

    abstract::In many epidemiological studies it is common to resort to regression models relating incidence of a disease and its risk factors. The main goal of this paper is to consider inference on such models with error-prone observations and variances of the measurement errors changing across observations. We suppose that the o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3343

    authors: de Castro M,Galea M,Bolfarine H

    更新日期:2008-11-10 00:00:00

  • On the relationship between association and surrogacy when both the surrogate and true endpoint are binary outcomes.

    abstract::The relationship between association and surrogacy has been the focus of much debate in the surrogate marker literature. Recently, the individual causal association (ICA) has been introduced as a metric of surrogacy in the causal inference framework, when both the surrogate and the true endpoint are normally distribut...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8698

    authors: Meyvisch P,Alonso A,Van der Elst W,Molenberghs G

    更新日期:2020-11-20 00:00:00

  • Discriminant analysis when all variables are ordered.

    abstract::Determination of the equation that relates an ordered dependent variable to ordered independent variables is sought. One solution, non-parametric discriminant analysis (NPD), involves obtaining the best monotonic step function by means of a computer search procedure. Although one can use alternative selection criteria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110804

    authors: Johnston B,Seshia SS

    更新日期:1992-06-15 00:00:00

  • Construction, validation and updating of a prognostic model for kidney graft survival.

    abstract::The construction, validation and updating of a prognostic model for kidney graft survival is reported using data from the Eurotransplant database. First, a model is constructed for data from transplantations in the period 1984 to 1987. The model is later updated for the 1988 1990 data. The first data set was randomly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141806

    authors: Van Houwelingen HC,Thorogood J

    更新日期:1995-09-30 00:00:00

  • Modelling the relationship between continuous covariates and clinical events using isotonic regression.

    abstract::In a medical study we are often interested in graphically displaying the relationship between continuous variables and clinical events indicating disease progression. Often, it is reasonable to make the minimal assumption that the risk of progression is an arbitrary monotone function of the continuous variable. Someti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1561

    authors: Ancukiewicz M,Finkelstein DM,Schoenfeld DA

    更新日期:2003-10-30 00:00:00

  • Partitioned GMM logistic regression models for longitudinal data.

    abstract::Correlation is inherent in longitudinal studies due to the repeated measurements on subjects, as well as due to time-dependent covariates in the study. In the National Longitudinal Study of Adolescent to Adult Health (Add Health), data were repeatedly collected on children in grades 7-12 across four waves. Thus, obser...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8099

    authors: Irimata KM,Broatch J,Wilson JR

    更新日期:2019-05-30 00:00:00

  • Network-based regularization for matched case-control analysis of high-dimensional DNA methylation data.

    abstract::The matched case-control designs are commonly used to control for potential confounding factors in genetic epidemiology studies especially epigenetic studies with DNA methylation. Compared with unmatched case-control studies with high-dimensional genomic or epigenetic data, there have been few variable selection metho...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5694

    authors: Sun H,Wang S

    更新日期:2013-05-30 00:00:00

  • Estimation of the mediation effect with a binary mediator.

    abstract::A mediator acts as a third variable in the causal pathway between a risk factor and an outcome. In this paper, we consider the estimation of the mediation effect when the mediator is a binary variable. We give a precise definition of the mediation effect and examine asymptotic properties of five different estimators o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2730

    authors: Li Y,Schneider JA,Bennett DA

    更新日期:2007-08-15 00:00:00

  • Cause-specific quantile regression on inactivity time.

    abstract::In time-to-event analysis, the traditional summary measures have been based on the hazard function, survival function, quantile event time, restricted mean event time, and residual lifetime. Under competing risks, furthermore, typical summary measures have been the cause-specific hazard function and cumulative inciden...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8871

    authors: Jia Y,Jeong JH

    更新日期:2021-01-06 00:00:00

  • Baseline risk as predictor of treatment benefit: three clinical meta-re-analyses.

    abstract::A relationship between baseline risk and treatment effect is increasingly investigated as a possible explanation of between-study heterogeneity in clinical trial meta-analysis. An approach that is still often applied in the medical literature is to plot the estimated treatment effects against the estimated measures of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20001230)19:24<3497::aid-sim830>

    authors: Arends LR,Hoes AW,Lubsen J,Grobbee DE,Stijnen T

    更新日期:2000-12-30 00:00:00

  • Cutoff designs for community-based intervention studies.

    abstract::Public health interventions are often designed to target communities defined either geographically (e.g. cities, counties) or socially (e.g. schools or workplaces). The group randomized trial (GRT) is regarded as the gold standard for evaluating these interventions. However, community leaders may object to randomizati...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,随机对照试验

    doi:10.1002/sim.4237

    authors: Pennell ML,Hade EM,Murray DM,Rhoda DA

    更新日期:2011-07-10 00:00:00

  • A functional-model-adjusted spatial scan statistic.

    abstract::This paper introduces a new spatial scan statistic designed to adjust cluster detection for longitudinal confounding factors indexed in space. The functional-model-adjusted statistic was developed using generalized functional linear models in which longitudinal confounding factors were considered to be functional cova...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8459

    authors: Ahmed MS,Genin M

    更新日期:2020-04-15 00:00:00

  • Corrections for exposure measurement error in logistic regression models with an application to nutritional data.

    abstract::Two correction methods are considered for multiple logistic regression models with some covariates measured with error. Both methods are based on approximating the complicated regression model between the response and the observed covariates with simpler models. The first model is the logistic approximation proposed b...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131105

    authors: Kuha J

    更新日期:1994-06-15 00:00:00

  • Causal inference in survival analysis using pseudo-observations.

    abstract::Causal inference for non-censored response variables, such as binary or quantitative outcomes, is often based on either (1) direct standardization ('G-formula') or (2) inverse probability of treatment assignment weights ('propensity score'). To do causal inference in survival analysis, one needs to address right-censo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7297

    authors: Andersen PK,Syriopoulou E,Parner ET

    更新日期:2017-07-30 00:00:00

  • Randomization tests for multiarmed randomized clinical trials.

    abstract::We examine the use of randomization-based inference for analyzing multiarmed randomized clinical trials, including the application of conditional randomization tests to multiple comparisons. The view is taken that the linkage of the statistical test to the experimental design (randomization procedure) should be recogn...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8418

    authors: Wang Y,Rosenberger WF,Uschner D

    更新日期:2020-02-20 00:00:00

  • A prediction-based test for multiple endpoints.

    abstract::This article introduces a global hypothesis test intended for studies with multiple endpoints. Our test makes use of a priori predictions about the direction of the result of each endpoint and we weight these predictions using the sample correlation matrix. The global alternative hypothesis concerns a parameter, ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8724

    authors: Montgomery RN,Mahnken JD

    更新日期:2020-12-10 00:00:00

  • A robust goodness-of-fit test statistic with application to ordinal regression models.

    abstract::We propose a goodness-of-fit test statistic for linear regression with heterogeneous variance, which is asymptotically chi-square if the given model is correct. The test statistic is computed as a quadratic form of observed minus predicted responses. We apply the method to a linear regression for an ordinal categorica...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780130205

    authors: Lipsitz SR,Buoncristiani JF

    更新日期:1994-01-30 00:00:00

  • Binary regression with continuous outcomes.

    abstract::Clinical research often involves continuous outcome measures, such as blood cholesterol, that are amenable to statistical techniques of analysis based on the mean, such as the t-test or multiple linear regression. Clinical interest, however, frequently focuses on the proportion of subjects who fall below or above a cl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140303

    authors: Suissa S,Blais L

    更新日期:1995-02-15 00:00:00

  • On the association between variables with lower detection limits.

    abstract::In this paper, we define a modified version τ(b) of Kendall's tau to measure the association in a pair (X,Y) of random variables subject to fixed left censoring due to known lower detection limits. We provide a nonparametric estimator of τ(b) and investigate its asymptotic properties. We then assume an Archimedean cop...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4319

    authors: Romdhani H,Lakhal-Chaieb L

    更新日期:2011-11-20 00:00:00

  • Bayesian methods for meta-analysis of causal relationships estimated using genetic instrumental variables.

    abstract::Genetic markers can be used as instrumental variables, in an analogous way to randomization in a clinical trial, to estimate the causal relationship between a phenotype and an outcome variable. Our purpose is to extend the existing methods for such Mendelian randomization studies to the context of multiple genetic mar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3843

    authors: Burgess S,Thompson SG,CRP CHD Genetics Collaboration.,Burgess S,Thompson SG,Andrews G,Samani NJ,Hall A,Whincup P,Morris R,Lawlor DA,Davey Smith G,Timpson N,Ebrahim S,Ben-Shlomo Y,Davey Smith G,Timpson N,Brown M,Ricket

    更新日期:2010-05-30 00:00:00