Multiple imputation using chained equations: Issues and guidance for practice.

Abstract:

:Multiple imputation by chained equations is a flexible and practical approach to handling missing data. We describe the principles of the method and show how to impute categorical and quantitative variables, including skewed variables. We give guidance on how to specify the imputation model and how many imputations are needed. We describe the practical analysis of multiply imputed data, including model building and model checking. We stress the limitations of the method and discuss the possible pitfalls. We illustrate the ideas using a data set in mental health, giving Stata code fragments.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

White IR,Royston P,Wood AM

doi

10.1002/sim.4067

subject

Has Abstract

pub_date

2011-02-20 00:00:00

pages

377-99

issue

4

eissn

0277-6715

issn

1097-0258

journal_volume

30

pub_type

杂志文章
  • A numerical strategy to evaluate performance of predictive scores via a copula-based approach.

    abstract::Assessing and comparing the performance of correlated predictive scores are of current interest in precision medicine. Given the limitations of available theoretical approaches for assessing and comparing the predictive accuracy, numerical methods are highly desired which, however, have not been systematically develop...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8566

    authors: Zhang Y,Shao Y

    更新日期:2020-09-10 00:00:00

  • Nonparametric meta-analysis for diagnostic accuracy studies.

    abstract::Summarizing the information of many studies using a meta-analysis becomes more and more important, also in the field of diagnostic studies. The special challenge in meta-analysis of diagnostic accuracy studies is that in general sensitivity and specificity are co-primary endpoints. Across the studies both endpoints ar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6583

    authors: Zapf A,Hoyer A,Kramer K,Kuss O

    更新日期:2015-12-20 00:00:00

  • Meta-analysis combining parallel and cross-over clinical trials. II: Binary outcomes.

    abstract::We examine different methods to pool binary outcomes used both in parallel and cross-over trials. Odds ratio (OR) estimators obtained from joint conditional probabilities in cross-over trials, such as the Mantel-Haenszel and Peto methods, are compared to an OR estimator using marginal results of cross-over trials. Whe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1206

    authors: Curtin F,Elbourne D,Altman DG

    更新日期:2002-08-15 00:00:00

  • Group sequential testing of the predictive accuracy of a continuous biomarker with unknown prevalence.

    abstract::Group sequential testing procedures have been proposed as an approach to conserving resources in biomarker validation studies. Previously, we derived the asymptotic properties of the sequential empirical positive predictive value (PPV) and negative predictive value (NPV) curves, which summarize the predictive accuracy...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6790

    authors: Koopmeiners JS,Feng Z

    更新日期:2016-04-15 00:00:00

  • Multi-state models for colon cancer recurrence and death with a cured fraction.

    abstract::In cancer clinical trials, patients often experience a recurrence of disease prior to the outcome of interest, overall survival. Additionally, for many cancers, there is a cured fraction of the population who will never experience a recurrence. There is often interest in how different covariates affect the probability...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6056

    authors: Conlon AS,Taylor JM,Sargent DJ

    更新日期:2014-05-10 00:00:00

  • Estimation of infection prevalence and sensitivity in a stratified two-stage sampling design employing highly specific diagnostic tests when there is no gold standard.

    abstract::In this work, we describe a two-stage sampling design to estimate the infection prevalence in a population. In the first stage, an imperfect diagnostic test was performed on a random sample of the population. In the second stage, a different imperfect test was performed in a stratified random sample of the first sampl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6545

    authors: Miller E,Huppert A,Novikov I,Warburg A,Hailu A,Abbasi I,Freedman LS

    更新日期:2015-11-10 00:00:00

  • Logistic discrimination of mixtures of M. tuberculosis and non-specific tuberculin reactions.

    abstract::Interpretation of the Mantoux test for tuberculous infection can be complicated by cross-reactions caused by infection with non-specific mycobacteria. Thus, the distribution of positive indurations is a mixture of two distributions. To estimate tuberculous infection prevalence, the marginal distribution of indurations...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.745

    authors: Nagelkerke NJ,Borgdorff MW,Kim SJ

    更新日期:2001-04-15 00:00:00

  • The impact of heterogeneity on the comparison of survival times.

    abstract::We consider several sources of heterogeneity in a clinical trial with patients' survival time as the main response criterion: differences in prognosis which can be attributed to a latent or ignored prognostic factor; differences in treatment efficacy in subgroups of patients, and differences in treatment combinations ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780060708

    authors: Schumacher M,Olschewski M,Schmoor C

    更新日期:1987-10-01 00:00:00

  • A new sequential procedure for surveillance of Down's syndrome.

    abstract::A new method is proposed for the surveillance of Down's syndrome among newborn. Despite the strong dependence of overall risk of Down's syndrome on maternal age, it has been suggested that an environmentally induced increase in risk may be additive over all maternal ages. The surveillance method introduced here is spe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120104

    authors: Lie RT,Heuch I,Irgens LM

    更新日期:1993-01-15 00:00:00

  • On identifying a positive dose-response surface for combination agents.

    abstract::This article concerns construction of a confidence surface for tangential slopes of the dose-response surface of a combination therapy to identify where response increases as a function of drug dosage. This approach extends to the assessment of the effectiveness of the combination therapy. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110513

    authors: Hung HM

    更新日期:1992-03-01 00:00:00

  • Analysis of incomplete multivariate data using linear models with structured covariance matrices.

    abstract::Incomplete and unbalanced multivariate data often arise in longitudinal studies due to missing or unequally-timed repeated measurements and/or the presence of time-varying covariates. A general approach to analysing such data is through maximum likelihood analysis using a linear model for the expected responses, and s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780070132

    authors: Schluchter MD

    更新日期:1988-01-01 00:00:00

  • Analyzing longitudinal data with patients in different disease states during follow-up and death as final state.

    abstract::This paper considers the analysis of longitudinal data complicated by the fact that during follow-up patients can be in different disease states, such as remission, relapse or death. If both the response of interest (for example, quality of life (QOL)) and the amount of missing data depend on this disease state, ignor...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3755

    authors: le Cessie S,de Vries EG,Buijs C,Post WJ

    更新日期:2009-12-30 00:00:00

  • Nonparametric collective spectral density estimation with an application to clustering the brain signals.

    abstract::In this paper, we develop a method for the simultaneous estimation of spectral density functions (SDFs) for a collection of stationary time series that share some common features. Due to the similarities among the SDFs, the log-SDF can be represented using a common set of basis functions. The basis shared by the colle...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7972

    authors: Maadooliat M,Sun Y,Chen T

    更新日期:2018-12-30 00:00:00

  • Assessing surrogates as trial endpoints using mixed models.

    abstract::Having a surrogate for a definitive endpoint in a clinical trial can sometimes be useful when it is impractical, invasive or very time consuming to obtain the definitive endpoint. This paper discusses methods for assessing whether the surrogate-endpoint results of a trial can be used in place of definitive-endpoint re...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1779

    authors: Korn EL,Albert PS,McShane LM

    更新日期:2005-01-30 00:00:00

  • The use of an extended baseline period in the evaluation of treatment in a longitudinal Duchenne muscular dystrophy trial.

    abstract::A trial of Duchenne muscular dystrophy involved tracking boys of all ages through a one-year baseline period, followed by a one-year trial of leucine versus placebo treatment. In this paper we develop a model for a total-muscle-strength score that uses the data of the extended baseline period in the evaluation of the ...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/sim.4780050304

    authors: Madsen KS,Miller JP,Province MA

    更新日期:1986-05-01 00:00:00

  • A regression model for multivariate random length data.

    abstract::Multivariate random length data occur when we observe multiple measurements of a quantitative variable and the variable number of these measurements is also an observed outcome for each experimental unit. For example, for a patient with coronary artery disease, we may observe a number of lesions in that patient's coro...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990130)18:2<199::aid-sim

    authors: Barnhart HX,Kosinski AS,Sampson AR

    更新日期:1999-01-30 00:00:00

  • A note on sample size calculations for cluster randomised crossover trials with a fixed number of clusters.

    abstract::Girardeau, Ravaud and Donner in 2008 presented a formula for sample size calculations for cluster randomised crossover trials, when the intracluster correlation coefficient, interperiod correlation coefficient and mean cluster size are specified in advance. However, in many randomised trials, the number of clusters is...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8191

    authors: Kelly TL,Pratt N

    更新日期:2019-08-15 00:00:00

  • Hypothesis testing in the polychotomous logistic model with an application to detecting gastrointestinal cancer.

    abstract::We discuss the use of the trichotomous logistic model to discriminate between patients with gastrointestinal (GI) cancer, patients with benign GI disease and 'normal' subjects, using symptoms and the concentrations of some serum proteins that are potentially indicative of malignancy as covariates. A parsimonious model...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780040313

    authors: Marshall RJ,Chisholm EM

    更新日期:1985-07-01 00:00:00

  • Case-control analysis with a continuous outcome variable.

    abstract::It is not uncommon for a continuous outcome variable Y to be dichotomized and analysed using logistic regression. Moser and Coombs (Statist. Med. 2004; 23:1843-1860) provide a method for converting the output from a standard linear regression analysis using the original continuous outcome Y to give much more efficient...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3474

    authors: Jiang Y,Scott A,Wild CJ

    更新日期:2009-01-30 00:00:00

  • A robust method for proportional hazards regression.

    abstract::In this paper we give an informal introduction to a robust method for survival analysis which is based on a modification of the usual partial likelihood estimator (PLE). Large sample results lead us to expect reduced bias for this robust estimator compared with the PLE whenever there are even slight violations of the ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(SICI)1097-0258(19960530)15:10<1033::AID-S

    authors: Minder CE,Bednarski T

    更新日期:1996-05-30 00:00:00

  • Statistical methods for multivariate interval-censored recurrent events.

    abstract::Multi-type recurrent event data arise when two or more different kinds of events may occur repeatedly over a period of observation. The scientific objectives in such settings are often to describe features of the marginal processes and to study the association between the different types of events. Interval-censored m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1936

    authors: Chen BE,Cook RJ,Lawless JF,Zhan M

    更新日期:2005-03-15 00:00:00

  • Methods for epidemiologic analyses of multiple exposures: a review and comparative study of maximum-likelihood, preliminary-testing, and empirical-Bayes regression.

    abstract::Many epidemiologic investigations are designed to study the effects of multiple exposures. Most of these studies are analysed either by fitting a risk-regression model with all exposures forced in the model, or by using a preliminary-testing algorithm, such as stepwise regression, to produce a smaller model. Research ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/sim.4780120802

    authors: Greenland S

    更新日期:1993-04-30 00:00:00

  • Utility-based optimization of phase II/III programs.

    abstract::Phase II and phase III trials play a crucial role in drug development programs. They are costly and time consuming and, because of high failure rates in late development stages, at the same time risky investments. Commonly, sample size calculation of phase III is based on the treatment effect observed in phase II. The...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6624

    authors: Kirchner M,Kieser M,Götte H,Schüler A

    更新日期:2016-01-30 00:00:00

  • Comparisons of the performance of different statistical tests for time-to-event analysis with confounding factors: practical illustrations in kidney transplantation.

    abstract::Confounding factors are commonly encountered in observational studies. Several confounder-adjusted tests to compare survival between differently exposed subjects were proposed. However, only few studies have compared their performances regarding type I error rates, and no study exists evaluating their type II error ra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6777

    authors: Le Borgne F,Giraudeau B,Querard AH,Giral M,Foucher Y

    更新日期:2016-03-30 00:00:00

  • Adjusting for drop-out in clinical trials with repeated measures: design and analysis issues.

    abstract::Recently, Wu and Follmann developed summary measures to adjust for informative drop-out in longitudinal studies where drop-out depends on the underlying true value of the response. In this paper we evaluate these procedures in the common situation where drop-out depends on the observed responses. We also discuss vario...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20010115)20:1<93::aid-sim655>3.0

    authors: Wu MC,Albert PS,Wu BU

    更新日期:2001-01-15 00:00:00

  • A measure of similarity for response curves based on ranks.

    abstract::A measure of similarity for response curves is presented and its potential use is discussed. The distribution of a suitable test statistic for testing the independence of the course of two curves is derived. The method proposed is compared with other proposals in the literature for the analysis of paired response curv...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780081112

    authors: Schulgen G

    更新日期:1989-11-01 00:00:00

  • Prior event rate ratio adjustment for hidden confounding in observational studies of treatment effectiveness: a pairwise Cox likelihood approach.

    abstract::Observational studies provide a rich source of information for assessing effectiveness of treatment interventions in many situations where it is not ethical or practical to perform randomized controlled trials. However, such studies are prone to bias from hidden (unmeasured) confounding. A promising approach to identi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7051

    authors: Lin NX,Henley WE

    更新日期:2016-12-10 00:00:00

  • Combining classification trees using MLE.

    abstract::We propose a probability distribution for an equivalence class of classification trees (that is, those that ignore the value of the cutpoints but retain tree structure). This distribution is parameterized by a central tree structure representing the true model, and a precision or concentration coefficient representing...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990330)18:6<727::aid-sim

    authors: Shannon WD,Banks D

    更新日期:1999-03-30 00:00:00

  • Construction, validation and updating of a prognostic model for kidney graft survival.

    abstract::The construction, validation and updating of a prognostic model for kidney graft survival is reported using data from the Eurotransplant database. First, a model is constructed for data from transplantations in the period 1984 to 1987. The model is later updated for the 1988 1990 data. The first data set was randomly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141806

    authors: Van Houwelingen HC,Thorogood J

    更新日期:1995-09-30 00:00:00

  • A geometric confidence ellipse approach to the estimation of the ratio of two variables.

    abstract::One is often interested in the ratio of two variables, for example in genetics, assessing drug effectiveness, and in health economics. In this paper, we derive an explicit geometric solution to the general problem of identifying the two tangents from an arbitrary external point to an ellipse. This solution permits num...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3398

    authors: Walter SD,Gafni A,Birch S

    更新日期:2008-12-10 00:00:00