Multiple imputation analysis of case-cohort studies.


:The usual methods for analyzing case-cohort studies rely on sometimes not fully efficient weighted estimators. Multiple imputation might be a good alternative because it uses all the data available and approximates the maximum partial likelihood estimator. This method is based on the generation of several plausible complete data sets, taking into account uncertainty about missing values. When the imputation model is correctly defined, the multiple imputation estimator is asymptotically unbiased and its variance is correctly estimated. We show that a correct imputation model must be estimated from the fully observed data (cases and controls), using the case status among the explanatory variable. To validate the approach, we analyzed case-cohort studies first with completely simulated data and then with case-cohort data sampled from two real cohorts. The analyses of simulated data showed that, when the imputation model was correct, the multiple imputation estimator was unbiased and efficient. The observed gain in precision ranged from 8 to 37 per cent for phase-1 variables and from 5 to 19 per cent for the phase-2 variable. When the imputation model was misspecified, the multiple imputation estimator was still more efficient than the weighted estimators but it was also slightly biased. The analyses of case-cohort data sampled from complete cohorts showed that even when no strong predictor of the phase-2 variable was available, the multiple imputation was unbiased, as precised as the weighted estimator for the phase-2 variable and slightly more precise than the weighted estimators for the phase-1 variables. However, the multiple imputation estimator was found to be biased when, because of interaction terms, some coefficients of the imputation model had to be estimated from small samples. Multiple imputation is an efficient technique for analyzing case-cohort data. Practically, we suggest building the analysis model using only the case-cohort data and weighted estimators. Multiple imputation can eventually be used to reanalyze the data using the selected model in order to improve the precision of the results.


Stat Med


Statistics in medicine


Marti H,Chavance M




Has Abstract


2011-06-15 00:00:00












  • Estimating adjusted risk difference (RD) and number needed to treat (NNT) measures in the Cox regression model.

    abstract::In medical research, risk difference (RD) and number needed to treat (NNT) measures for survival times have been mainly proposed without consideration of covariates. In this paper, we develop adjusted RD and NNT measures for use in observational studies with survival time outcomes within the framework of the Cox propo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Laubender RP,Bender R

    更新日期:2010-03-30 00:00:00

  • Assurance calculations for planning clinical trials with time-to-event outcomes.

    abstract::We consider the use of the assurance method in clinical trial planning. In the assurance method, which is an alternative to a power calculation, we calculate the probability of a clinical trial resulting in a successful outcome, via eliciting a prior probability distribution about the relevant treatment effect. This i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Ren S,Oakley JE

    更新日期:2014-01-15 00:00:00

  • Model-checking techniques for stratified case-control studies.

    abstract::We present graphical and numerical methods for assessing the adequacy of the logistic regression model for stratified case-control data. The proposed methods are derived from the cumulative sum of residuals over the covariate or linear predictor. Under the assumed model, the cumulative residual process converges weakl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Arbogast PG,Lin DY

    更新日期:2005-01-30 00:00:00

  • Analysis of in vitro fertilization data with multiple outcomes using discrete time-to-event analysis.

    abstract::In vitro fertilization (IVF) is an increasingly common method of assisted reproductive technology. Because of the careful observation and follow-up required as part of the procedure, IVF studies provide an ideal opportunity to identify and assess clinical and demographic factors along with environmental exposures that...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Maity A,Williams PL,Ryan L,Missmer SA,Coull BA,Hauser R

    更新日期:2014-05-10 00:00:00

  • Driving in search of analyses.

    abstract::Although transportation safety has greatly improved over the past 2 decades, motor vehicle crash injuries remain a leading cause of morbidity and mortality, particularly among young drivers. Driver errors and behaviors such as speeding and distraction contribute disproportionately to crashes among inexperienced novice...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Simons-Morton B

    更新日期:2017-10-30 00:00:00

  • Model-based phase I designs incorporating toxicity and efficacy for single and dual agent drug combinations: methods and challenges.

    abstract::Novel therapies are challenging the standards of drug development. Agents with specific biologic targets, unknown dose-efficacy curves, and limited toxicity mandate novel designs to identify biologically optimal doses. We review two model-based designs that utilize either a proportional odds model or a continuation ra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Mandrekar SJ,Qin R,Sargent DJ

    更新日期:2010-05-10 00:00:00

  • Analysis of cluster randomized cross-over trial data: a comparison of methods.

    abstract::In a cluster randomized cross-over trial, all participating clusters receive both intervention and control treatments consecutively, in separate time periods. Patients recruited by each cluster within the same time period receive the same intervention, and randomization determines order of treatment within a cluster. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Turner RM,White IR,Croudace T,PIP Study Group.

    更新日期:2007-01-30 00:00:00

  • Quantifying the bias due to observed individual confounders in causal treatment effect estimates.

    abstract::It is often of interest to use observational data to estimate the causal effect of a target exposure or treatment on an outcome. When estimating the treatment effect, it is essential to appropriately adjust for selection bias due to observed confounders using, for example, propensity score weighting. Selection bias du...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Parast L,Griffin BA

    更新日期:2020-08-15 00:00:00

  • Performance of weighted estimating equations for longitudinal binary data with drop-outs missing at random.

    abstract::The generalized estimating equations (GEE) approach is commonly used to model incomplete longitudinal binary data. When drop-outs are missing at random through dependence on observed responses (MAR), GEE may give biased parameter estimates in the model for the marginal means. A weighted estimating equations approach g...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Preisser JS,Lohman KK,Rathouz PJ

    更新日期:2002-10-30 00:00:00

  • Surveillance of clustering near point sources.

    abstract::Health authorities are often alerted to suspected cancer clusters near the vicinity of potential point sources by members of the public. A surveillance system, where administrative regions around the potential point sources are regularly monitored for high disease rates, would allow for responses which are easier to o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Le ND,Petkau AJ,Rosychuk R

    更新日期:1996-04-15 00:00:00

  • Modelling frailty in area mortality.

    abstract::This paper investigates the impact on area life tables of the specification of unobserved frailty. Frailty specification may affect both the regression effects of area and individual level covariates, and lead to changes in the value of summary mortality parameters, such as life expectancy. The paper also investigates...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Congdon P

    更新日期:1995-09-15 00:00:00

  • Modelling the association between patient characteristics and the change over time in a disease measure using observational cohort data.

    abstract::In observational cohort studies we may wish to examine the associations between fixed patient characteristics and the longitudinal changes from baseline in a repeated outcome measure. Many biological and other outcome measures are known to be subject to measurement error and biological variation. In an initial analysi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Harrison L,Dunn DT,Green H,Copas AJ

    更新日期:2009-11-20 00:00:00

  • Confidence intervals for the standardized effect arising in the comparison of two normal populations.

    abstract::Confidence intervals for a standardized effect are derived after stabilizing the variance of the Welch t-statistic. Simulation studies demonstrate the viability of the resulting intervals for a wide range of parameter values and sample sizes as small as five. The methodology is extended to the combination of results f...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Kulinskaya E,Staudte RG

    更新日期:2007-06-30 00:00:00

  • A model-based conditional power assessment for decision making in randomized controlled trial studies.

    abstract::Conditional power based on summary statistic by comparing outcomes (such as the sample mean) directly between 2 groups is a convenient tool for decision making in randomized controlled trial studies. In this paper, we extend the traditional summary statistic-based conditional power with a general model-based assessmen...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Zou B,Cai J,Koch GG,Zhou H,Zou F

    更新日期:2017-12-30 00:00:00

  • Tests for individual and population bioequivalence based on generalized p-values.

    abstract::The U.S. Food and Drug Administration (FDA) has proposed new regulations that address the 'prescribability' and 'switchability' of new formulations of already-approved drugs. These new criteria are known, respectively, as population and individual bioequivalence. Two methods have been proposed in the bioequivalence li...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: McNally RJ,Iyer H,Mathew T

    更新日期:2003-01-15 00:00:00

  • Efficient semiparametric inference for two-phase studies with outcome and covariate measurement errors.

    abstract::In modern observational studies using electronic health records or other routinely collected data, both the outcome and covariates of interest can be error-prone and their errors often correlated. A cost-effective solution is the two-phase design, under which the error-prone outcome and covariates are observed for all...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Tao R,Lotspeich SC,Amorim G,Shaw PA,Shepherd BE

    更新日期:2021-02-10 00:00:00

  • Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX.

    abstract::For time-to-event outcomes, a rich literature exists on the bias introduced by covariate measurement error in regression models, such as the Cox model, and methods of analysis to address this bias. By comparison, less attention has been given to understanding the impact or addressing errors in the failure time outcome...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Oh EJ,Shepherd BE,Lumley T,Shaw PA

    更新日期:2018-04-15 00:00:00

  • Biomarker discovery using targeted maximum-likelihood estimation: application to the treatment of antiretroviral-resistant HIV infection.

    abstract::Researchers in clinical science and bioinformatics frequently aim to learn which of a set of candidate biomarkers is important in determining a given outcome, and to rank the contributions of the candidates accordingly. This article introduces a new approach to research questions of this type, based on targeted maximu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Bembom O,Petersen ML,Rhee SY,Fessel WJ,Sinisi SE,Shafer RW,van der Laan MJ

    更新日期:2009-01-15 00:00:00

  • Elasticity as a measure for online determination of remission points in ongoing epidemics.

    abstract::The correct identification of change-points during ongoing outbreak investigations of infectious diseases is a matter of paramount importance in epidemiology, with major implications for the management of health care resources, public health and, as the COVID-19 pandemic has shown, social live. Onsets, peaks, and infl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Veres-Ferrer EJ,Pavía JM

    更新日期:2021-02-20 00:00:00

  • Stochastically curtailed phase II clinical trials.

    abstract::Phase II trials often test the null hypothesis H(0): p or=p(1), where p is the true unknown proportion responding to the new treatment, p(0) is the greatest response proportion which is deemed clinically ineffective, and p(1) is the smallest response proportion which is deemed clinically effe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Ayanlowo AO,Redden DT

    更新日期:2007-03-30 00:00:00

  • An immigration-death model to estimate the duration of malaria infection when detectability of the parasite is imperfect.

    abstract::Immigration-death models are proposed to analyse the infection dynamics in longitudinal studies of panels of heavily parasitized human hosts where parasites have been typed at regular intervals by PCR. Immigration refers to the acquisition of a new parasitic genotype, occurring at rate lambda, and death refers to the ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Sama W,Owusu-Agyei S,Felger I,Vounatsou P,Smith T

    更新日期:2005-11-15 00:00:00

  • Modelling covariate adjusted mortality relative to a standard population.

    abstract::A study of long term survival of 1487 patients given an allogeneic bone marrow transplant for acute myelogenous leukaemia and 729 patients given a transplant for severe aplastic anaemia was conducted by the International Bone Marrow Transplant Registry. One aim of this study is to determine if the mortality rates of t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Andersen PK,Horowitz MM,Klein JP,Socie G,Stone JV,Zhang MJ

    更新日期:1999-06-30 00:00:00

  • Correcting for regression in assessing the response to treatment in a selected population.

    abstract::Previous work on the consequences of regression to the mean for the interpretation of responses to treatment is extended to the situation where the response measured is the proportional change in some variable. Methods for correcting for the bias are discussed. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Curnow RN

    更新日期:1987-03-01 00:00:00

  • Design and estimation in clinical trials with subpopulation selection.

    abstract::Population heterogeneity is frequently observed among patients' treatment responses in clinical trials because of various factors such as clinical background, environmental, and genetic factors. Different subpopulations defined by those baseline factors can lead to differences in the benefit or safety profile of a the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Chiu YD,Koenig F,Posch M,Jaki T

    更新日期:2018-12-20 00:00:00

  • Smooth centile curves for skew and kurtotic data modelled using the Box-Cox power exponential distribution.

    abstract::The Box-Cox power exponential (BCPE) distribution, developed in this paper, provides a model for a dependent variable Y exhibiting both skewness and kurtosis (leptokurtosis or platykurtosis). The distribution is defined by a power transformation Y(nu) having a shifted and scaled (truncated) standard power exponential ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Rigby RA,Stasinopoulos DM

    更新日期:2004-10-15 00:00:00

  • There is no impact of exposure measurement error on latency estimation in linear models.

    abstract::Identification of the latency period for the effect of a time-varying exposure is key when assessing many environmental, nutritional, and behavioral risk factors. A pre-specified exposure metric involving an unknown latency parameter is often used in the statistical model for the exposure-disease relationship. Likelih...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Peskoe SB,Spiegelman D,Wang M

    更新日期:2019-03-30 00:00:00

  • Emerging and recurrent issues in drug development.

    abstract::This paper reviews several emerging and recurrent issues relating to the drug development process. These emerging issues include changes to the FDA regulatory environment, internationalization of drug development, advances in computer technology and visualization tools, and efforts to incorporate meta-analysis methodo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审


    authors: Anello C

    更新日期:1999-09-15 00:00:00

  • Cluster detection diagnostics for small area health data: with reference to evaluation of local likelihood models.

    abstract::The focus of this paper is the development of a range of cluster detection diagnostics that can be used to assess the degree to which a clustering method recovers the true clustering behaviour of small area data. The diagnostics proposed range from individual region specific diagnostics to neighbourhood diagnostics, a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Hossain MM,Lawson AB

    更新日期:2006-03-15 00:00:00

  • A random forest approach for competing risks based on pseudo-values.

    abstract::Random forest is a supervised learning method that combines many classification or regression trees for prediction. Here we describe an extension of the random forest method for building event risk prediction models in survival analysis with competing risks. In case of right-censored data, the event status at the pred...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Mogensen UB,Gerds TA

    更新日期:2013-08-15 00:00:00

  • Incorporating data from various trial designs into a mixed treatment comparison model.

    abstract::Estimates of relative efficacy between alternative treatments are crucial for decision making in health care. Bayesian mixed treatment comparison models provide a powerful methodology to obtain such estimates when head-to-head evidence is not available or insufficient. In recent years, this methodology has become wide...

    journal_title:Statistics in medicine

    pub_type: 杂志文章


    authors: Schmitz S,Adams R,Walsh C

    更新日期:2013-07-30 00:00:00