Abstract:
:Multiple imputation by chained equations is a flexible and practical approach to handling missing data. We describe the principles of the method and show how to impute categorical and quantitative variables, including skewed variables. We give guidance on how to specify the imputation model and how many imputations are needed. We describe the practical analysis of multiply imputed data, including model building and model checking. We stress the limitations of the method and discuss the possible pitfalls. We illustrate the ideas using a data set in mental health, giving Stata code fragments.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
White IR,Royston P,Wood AMdoi
10.1002/sim.4067subject
Has Abstractpub_date
2011-02-20 00:00:00pages
377-99issue
4eissn
0277-6715issn
1097-0258journal_volume
30pub_type
杂志文章abstract::Assessing and comparing the performance of correlated predictive scores are of current interest in precision medicine. Given the limitations of available theoretical approaches for assessing and comparing the predictive accuracy, numerical methods are highly desired which, however, have not been systematically develop...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8566
更新日期:2020-09-10 00:00:00
abstract::Summarizing the information of many studies using a meta-analysis becomes more and more important, also in the field of diagnostic studies. The special challenge in meta-analysis of diagnostic accuracy studies is that in general sensitivity and specificity are co-primary endpoints. Across the studies both endpoints ar...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6583
更新日期:2015-12-20 00:00:00
abstract::We examine different methods to pool binary outcomes used both in parallel and cross-over trials. Odds ratio (OR) estimators obtained from joint conditional probabilities in cross-over trials, such as the Mantel-Haenszel and Peto methods, are compared to an OR estimator using marginal results of cross-over trials. Whe...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1206
更新日期:2002-08-15 00:00:00
abstract::Group sequential testing procedures have been proposed as an approach to conserving resources in biomarker validation studies. Previously, we derived the asymptotic properties of the sequential empirical positive predictive value (PPV) and negative predictive value (NPV) curves, which summarize the predictive accuracy...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6790
更新日期:2016-04-15 00:00:00
abstract::In cancer clinical trials, patients often experience a recurrence of disease prior to the outcome of interest, overall survival. Additionally, for many cancers, there is a cured fraction of the population who will never experience a recurrence. There is often interest in how different covariates affect the probability...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6056
更新日期:2014-05-10 00:00:00
abstract::In this work, we describe a two-stage sampling design to estimate the infection prevalence in a population. In the first stage, an imperfect diagnostic test was performed on a random sample of the population. In the second stage, a different imperfect test was performed in a stratified random sample of the first sampl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6545
更新日期:2015-11-10 00:00:00
abstract::Interpretation of the Mantoux test for tuberculous infection can be complicated by cross-reactions caused by infection with non-specific mycobacteria. Thus, the distribution of positive indurations is a mixture of two distributions. To estimate tuberculous infection prevalence, the marginal distribution of indurations...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.745
更新日期:2001-04-15 00:00:00
abstract::We consider several sources of heterogeneity in a clinical trial with patients' survival time as the main response criterion: differences in prognosis which can be attributed to a latent or ignored prognostic factor; differences in treatment efficacy in subgroups of patients, and differences in treatment combinations ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780060708
更新日期:1987-10-01 00:00:00
abstract::A new method is proposed for the surveillance of Down's syndrome among newborn. Despite the strong dependence of overall risk of Down's syndrome on maternal age, it has been suggested that an environmentally induced increase in risk may be additive over all maternal ages. The surveillance method introduced here is spe...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780120104
更新日期:1993-01-15 00:00:00
abstract::This article concerns construction of a confidence surface for tangential slopes of the dose-response surface of a combination therapy to identify where response increases as a function of drug dosage. This approach extends to the assessment of the effectiveness of the combination therapy. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780110513
更新日期:1992-03-01 00:00:00
abstract::Incomplete and unbalanced multivariate data often arise in longitudinal studies due to missing or unequally-timed repeated measurements and/or the presence of time-varying covariates. A general approach to analysing such data is through maximum likelihood analysis using a linear model for the expected responses, and s...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780070132
更新日期:1988-01-01 00:00:00
abstract::This paper considers the analysis of longitudinal data complicated by the fact that during follow-up patients can be in different disease states, such as remission, relapse or death. If both the response of interest (for example, quality of life (QOL)) and the amount of missing data depend on this disease state, ignor...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3755
更新日期:2009-12-30 00:00:00
abstract::In this paper, we develop a method for the simultaneous estimation of spectral density functions (SDFs) for a collection of stationary time series that share some common features. Due to the similarities among the SDFs, the log-SDF can be represented using a common set of basis functions. The basis shared by the colle...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7972
更新日期:2018-12-30 00:00:00
abstract::Having a surrogate for a definitive endpoint in a clinical trial can sometimes be useful when it is impractical, invasive or very time consuming to obtain the definitive endpoint. This paper discusses methods for assessing whether the surrogate-endpoint results of a trial can be used in place of definitive-endpoint re...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1779
更新日期:2005-01-30 00:00:00
abstract::A trial of Duchenne muscular dystrophy involved tracking boys of all ages through a one-year baseline period, followed by a one-year trial of leucine versus placebo treatment. In this paper we develop a model for a total-muscle-strength score that uses the data of the extended baseline period in the evaluation of the ...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/sim.4780050304
更新日期:1986-05-01 00:00:00
abstract::Multivariate random length data occur when we observe multiple measurements of a quantitative variable and the variable number of these measurements is also an observed outcome for each experimental unit. For example, for a patient with coronary artery disease, we may observe a number of lesions in that patient's coro...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990130)18:2<199::aid-sim
更新日期:1999-01-30 00:00:00
abstract::Girardeau, Ravaud and Donner in 2008 presented a formula for sample size calculations for cluster randomised crossover trials, when the intracluster correlation coefficient, interperiod correlation coefficient and mean cluster size are specified in advance. However, in many randomised trials, the number of clusters is...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8191
更新日期:2019-08-15 00:00:00
abstract::We discuss the use of the trichotomous logistic model to discriminate between patients with gastrointestinal (GI) cancer, patients with benign GI disease and 'normal' subjects, using symptoms and the concentrations of some serum proteins that are potentially indicative of malignancy as covariates. A parsimonious model...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780040313
更新日期:1985-07-01 00:00:00
abstract::It is not uncommon for a continuous outcome variable Y to be dichotomized and analysed using logistic regression. Moser and Coombs (Statist. Med. 2004; 23:1843-1860) provide a method for converting the output from a standard linear regression analysis using the original continuous outcome Y to give much more efficient...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3474
更新日期:2009-01-30 00:00:00
abstract::In this paper we give an informal introduction to a robust method for survival analysis which is based on a modification of the usual partial likelihood estimator (PLE). Large sample results lead us to expect reduced bias for this robust estimator compared with the PLE whenever there are even slight violations of the ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(SICI)1097-0258(19960530)15:10<1033::AID-S
更新日期:1996-05-30 00:00:00
abstract::Multi-type recurrent event data arise when two or more different kinds of events may occur repeatedly over a period of observation. The scientific objectives in such settings are often to describe features of the marginal processes and to study the association between the different types of events. Interval-censored m...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1936
更新日期:2005-03-15 00:00:00
abstract::Many epidemiologic investigations are designed to study the effects of multiple exposures. Most of these studies are analysed either by fitting a risk-regression model with all exposures forced in the model, or by using a preliminary-testing algorithm, such as stepwise regression, to produce a smaller model. Research ...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/sim.4780120802
更新日期:1993-04-30 00:00:00
abstract::Phase II and phase III trials play a crucial role in drug development programs. They are costly and time consuming and, because of high failure rates in late development stages, at the same time risky investments. Commonly, sample size calculation of phase III is based on the treatment effect observed in phase II. The...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6624
更新日期:2016-01-30 00:00:00
abstract::Confounding factors are commonly encountered in observational studies. Several confounder-adjusted tests to compare survival between differently exposed subjects were proposed. However, only few studies have compared their performances regarding type I error rates, and no study exists evaluating their type II error ra...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6777
更新日期:2016-03-30 00:00:00
abstract::Recently, Wu and Follmann developed summary measures to adjust for informative drop-out in longitudinal studies where drop-out depends on the underlying true value of the response. In this paper we evaluate these procedures in the common situation where drop-out depends on the observed responses. We also discuss vario...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20010115)20:1<93::aid-sim655>3.0
更新日期:2001-01-15 00:00:00
abstract::A measure of similarity for response curves is presented and its potential use is discussed. The distribution of a suitable test statistic for testing the independence of the course of two curves is derived. The method proposed is compared with other proposals in the literature for the analysis of paired response curv...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780081112
更新日期:1989-11-01 00:00:00
abstract::Observational studies provide a rich source of information for assessing effectiveness of treatment interventions in many situations where it is not ethical or practical to perform randomized controlled trials. However, such studies are prone to bias from hidden (unmeasured) confounding. A promising approach to identi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7051
更新日期:2016-12-10 00:00:00
abstract::We propose a probability distribution for an equivalence class of classification trees (that is, those that ignore the value of the cutpoints but retain tree structure). This distribution is parameterized by a central tree structure representing the true model, and a precision or concentration coefficient representing...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990330)18:6<727::aid-sim
更新日期:1999-03-30 00:00:00
abstract::The construction, validation and updating of a prognostic model for kidney graft survival is reported using data from the Eurotransplant database. First, a model is constructed for data from transplantations in the period 1984 to 1987. The model is later updated for the 1988 1990 data. The first data set was randomly ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780141806
更新日期:1995-09-30 00:00:00
abstract::One is often interested in the ratio of two variables, for example in genetics, assessing drug effectiveness, and in health economics. In this paper, we derive an explicit geometric solution to the general problem of identifying the two tangents from an arbitrary external point to an ellipse. This solution permits num...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3398
更新日期:2008-12-10 00:00:00