Evaluation of software for multiple imputation of semi-continuous data.

Abstract:

:It is now widely accepted that multiple imputation (MI) methods properly handle the uncertainty of missing data over single imputation methods. Several standard statistical software packages, such as SAS, R and STATA, have standard procedures or user-written programs to perform MI. The performance of these packages is generally acceptable for most types of data. However, it is unclear whether these applications are appropriate for imputing data with a large proportion of zero values resulting in a semi-continuous distribution. In addition, it is not clear whether the use of these applications is suitable when the distribution of the data needs to be preserved for subsequent analysis. This article reports the findings of a simulation study carried out to evaluate the performance of the MI procedures for handling semi-continuous data within these statistical packages. Complete resource use data on 1060 participants from a large randomized clinical trial were used as the simulation population from which 500 bootstrap samples were obtained and missing data imposed. The findings of this study showed differences in the performance of the MI programs when imputing semi-continuous data. Caution should be exercised when deciding which program should perform MI on this type of data.

journal_name

Stat Methods Med Res

authors

Yu LM,Burton A,Rivero-Arias O

doi

10.1177/0962280206074464

subject

Has Abstract

pub_date

2007-06-01 00:00:00

pages

243-58

issue

3

eissn

0962-2802

issn

1477-0334

pii

16/3/243

journal_volume

16

pub_type

杂志文章
  • Controlling false positive selections in high-dimensional regression and causal inference.

    abstract::Guarding against false positive selections is important in many applications. We discuss methods based on subsampling and sample splitting for controlling the expected number of false positives and assigning p-values. They are generic and especially useful for high-dimensional settings. We review encouraging results f...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211428371

    authors: Bühlmann P,Rütimann P,Kalisch M

    更新日期:2013-10-01 00:00:00

  • Estimation of sensitivity depending on sojourn time and time spent in preclinical state.

    abstract::The probability model for periodic screening was extended to provide statistical inference for sensitivity depending on sojourn time, in which the sensitivity was modeled as a function of time spent in the preclinical state and the sojourn time. The likelihood function with the proposed sensitivity model was then eval...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212465499

    authors: Kim S,Wu D

    更新日期:2016-04-01 00:00:00

  • An ad hoc method for dual adjusting for measurement errors and nonresponse bias for estimating prevalence in survey data: Application to Iranian mental health survey on any illicit drug use.

    abstract::Purpose The prevalence estimates of binary variables in sample surveys are often subject to two systematic errors: measurement error and nonresponse bias. A multiple-bias analysis is essential to adjust for both biases. Methods In this paper, we linked the latent class log-linear and proxy pattern-mixture models to ad...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217690939

    authors: Khalagi K,Mansournia MA,Motevalian SA,Nourijelyani K,Rahimi-Movaghar A,Bakhtiyari M

    更新日期:2018-10-01 00:00:00

  • Statistical challenges in assessing potential efficacy of complex interventions in pilot or feasibility studies.

    abstract::Early phase trials of complex interventions currently focus on assessing the feasibility of a large randomised control trial and on conducting pilot work. Assessing the efficacy of the proposed intervention is generally discouraged, due to concerns of underpowered hypothesis testing. In contrast, early assessment of e...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215589507

    authors: Wilson DT,Walwyn RE,Brown J,Farrin AJ,Brown SR

    更新日期:2016-06-01 00:00:00

  • Bayesian sample size calculation for estimation of the difference between two binomial proportions.

    abstract::In this study, we discuss a decision theoretic or fully Bayesian approach to the sample size question in clinical trials with binary responses. Data are assumed to come from two binomial distributions. A Dirichlet distribution is assumed to describe prior knowledge of the two success probabilities p1 and p2. The param...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211399562

    authors: Pezeshk H,Nematollahi N,Maroufy V,Marriott P,Gittins J

    更新日期:2013-12-01 00:00:00

  • Applications of temporal kernel canonical correlation analysis in adherence studies.

    abstract::Adherence to medication is often measured as a continuous outcome but analyzed as a dichotomous outcome due to lack of appropriate tools. In this paper, we illustrate the use of the temporal kernel canonical correlation analysis (tkCCA) as a method to analyze adherence measurements and symptom levels on a continuous s...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215598805

    authors: John M,Lencz T,Ferbinteanu J,Gallego JA,Robinson DG

    更新日期:2017-10-01 00:00:00

  • The asymptotic maximal procedure for subject randomization in clinical trials.

    abstract::The maximal procedure is a restricted randomization method that maximizes the number of feasible allocation sequences under the constraints of the maximum tolerated imbalance and the allocation sequence length. It assigns an equal probability to all feasible sequences. However, its implementation is not easy due to th...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216677107

    authors: Zhao W,Berger VW,Yu Z

    更新日期:2018-07-01 00:00:00

  • Analysis of phase II methodologies for single-arm clinical trials with multiple endpoints in rare cancers: An example in Ewing's sarcoma.

    abstract::Trials run in either rare diseases, such as rare cancers, or rare sub-populations of common diseases are challenging in terms of identifying, recruiting and treating sufficient patients in a sensible period. Treatments for rare diseases are often designed for other disease areas and then later proposed as possible tre...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216662070

    authors: Dutton P,Love SB,Billingham L,Hassan AB

    更新日期:2018-05-01 00:00:00

  • Projections of cancer mortality risks using spatio-temporal P-spline models.

    abstract::Cancer mortality risk estimates are essential for planning resource allocation and designing and evaluating cancer prevention and management strategies. However, mortality figures generally become available after a few years, making necessary to develop reliable procedures to provide current and near future mortality ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212446366

    authors: Ugarte MD,Goicoa T,Etxeberria J,Militino AF

    更新日期:2012-10-01 00:00:00

  • Estimating the effect of treatment on binary outcomes using full matching on the propensity score.

    abstract::Many non-experimental studies use propensity-score methods to estimate causal effects by balancing treatment and control groups on a set of observed baseline covariates. Full matching on the propensity score has emerged as a particularly effective and flexible method for utilizing all available data, and creating well...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215601134

    authors: Austin PC,Stuart EA

    更新日期:2017-12-01 00:00:00

  • A robust score test of homogeneity for zero-inflated count data.

    abstract::In many applications of zero-inflated models, score tests are often used to evaluate whether the population heterogeneity as implied by these models is consistent with the data. The most frequently cited justification for using score tests is that they only require estimation under the null hypothesis. Because this es...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220937324

    authors: Hsu WW,Todem D,Mawella NR,Kim K,Rosenkranz RR

    更新日期:2020-12-01 00:00:00

  • Longitudinal prostate-specific antigen reference ranges: Choosing the underlying model of age-related changes.

    abstract::Serial measurements of prostate-specific antigen (PSA) are used as a biomarker for men diagnosed with prostate cancer following an active monitoring programme. Distinguishing pathological changes from natural age-related changes is not straightforward. Here, we compare four approaches to modelling age-related change i...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213503928

    authors: Simpkin AJ,Metcalfe C,Martin RM,Lane JA,Donovan JL,Hamdy FC,Neal DE,Tilling K

    更新日期:2016-10-01 00:00:00

  • A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies.

    abstract:BACKGROUND:Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the m...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213502437

    authors: Khondoker M,Dobson R,Skirrow C,Simmons A,Stahl D

    更新日期:2016-10-01 00:00:00

  • Cluster analysis and related techniques in medical research.

    abstract::In this paper we review methods of cluster analysis in the context of classifying patients on the basis of clinical and/or laboratory type observations. Both hierarchical and non-hierarchical methods of clustering are considered, although the emphasis is on the latter type, with particular attention devoted to the mix...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029200100103

    authors: McLachlan GJ

    更新日期:1992-01-01 00:00:00

  • Obtaining evidence by a single well-powered trial or several modestly powered trials.

    abstract::There is debate whether clinical trials with suboptimal power are justified and whether results from large studies are more reliable than the (combined) results of smaller trials. We quantified the error rates for evaluations based on single conventionally powered trials (80% or 90% power) versus evaluations based on ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212461098

    authors: IntHout J,Ioannidis JP,Borm GF

    更新日期:2016-04-01 00:00:00

  • Estimation of half-life periods in nonlinear data with fractional polynomials.

    abstract::Regression models are frequently used to model the functional relationship between an interesting outcome parameter and one or more potentially relevant explanatory variables. Objectives can be to set up as a prognostic model, for example, or an estimation model for a certain parameter of interest. Determining half-li...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213502403

    authors: Mayer B,Keller F,Syrovets T,Wittau M

    更新日期:2016-10-01 00:00:00

  • A corrected formulation for marginal inference derived from two-part mixed models for longitudinal semi-continuous data.

    abstract::For semi-continuous data which are a mixture of true zeros and continuously distributed positive values, the use of two-part mixed models provides a convenient modelling framework. However, deriving population-averaged (marginal) effects from such models is not always straightforward. Su et al. presented a model that ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213509798

    authors: Tom BD,Su L,Farewell VT

    更新日期:2016-10-01 00:00:00

  • A transformation class for spatio-temporal survival data with a cure fraction.

    abstract::We propose a hierarchical Bayesian methodology to model spatially or spatio-temporal clustered survival data with possibility of cure. A flexible continuous transformation class of survival curves indexed by a single parameter is used. This transformation model is a larger class of models containing two special cases ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212445658

    authors: Hurtado Rúa SM,Dey DK

    更新日期:2016-02-01 00:00:00

  • Efficient two-stage sequential arrays of proof of concept studies for pharmaceutical portfolios.

    abstract::Previous work has shown that individual randomized "proof-of-concept" (PoC) studies may be designed to maximize cost-effectiveness, subject to an overall PoC budget constraint. Maximizing cost-effectiveness has also been considered for arrays of simultaneously executed PoC studies. Defining Type III error as the oppor...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220958177

    authors: He L,Du L,Antonijevic Z,Posch M,Korostyshevskiy VR,Beckman RA

    更新日期:2020-09-21 00:00:00

  • Semiparametric analysis of correlated and interval-censored event-history data.

    abstract::We propose a semiparametric multi-state frailty model to analyze clustered event-history data subject to interval censoring. The proposed model is motivated by an attempt to study the life course of dental caries at the tooth level, taking into account the multiplicity of caries states and the intra-oral clustering of...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218788383

    authors: Pak D,Li C,Todem D

    更新日期:2019-09-01 00:00:00

  • Adjustment for treatment changes in epilepsy trials: A comparison of causal methods for time-to-event outcomes.

    abstract:BACKGROUND:When trials are subject to departures from randomised treatment, simple statistical methods that aim to estimate treatment efficacy, such as per protocol or as treated analyses, typically introduce selection bias. More appropriate methods to adjust for departure from randomised treatment are rarely employed,...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217735560

    authors: Dodd S,Williamson P,White IR

    更新日期:2019-03-01 00:00:00

  • Towards joint disease mapping.

    abstract::This article discusses and extends statistical models to jointly analyse the spatial variation of rates of several diseases with common risk factors. We start with a review of methods for separate analyses of diseases, then move to ecological regression approaches, where the rates from one of the diseases enter as sur...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1191/0962280205sm389oa

    authors: Held L,Natário I,Fenton SE,Rue H,Becker N

    更新日期:2005-02-01 00:00:00

  • Bayesian nonparametric mixed-effects joint model for longitudinal-competing risks data analysis in presence of multiple data features.

    abstract::Recently, the joint analysis of longitudinal and survival data has been an active research area. Most joint models focus on survival data with only one type of failure. The research on joint modeling of longitudinal and competing risks survival data is sparse. Even so, many joint models for this type of data assume pa...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215597939

    authors: Lu T

    更新日期:2017-10-01 00:00:00

  • Fast clustering using adaptive density peak detection.

    abstract::Common limitations of clustering methods include the slow algorithm convergence, the instability of the pre-specification on a number of intrinsic parameters, and the lack of robustness to outliers. A recent clustering approach proposed a fast search algorithm of cluster centers based on their local densities. However...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215609948

    authors: Wang XF,Xu Y

    更新日期:2017-12-01 00:00:00

  • A unified approach for assessing heterogeneity in age-period-cohort model parameters using random effects.

    abstract::Age-period-cohort models are a popular tool for studying population-level rates; for example, trends in cancer incidence and mortality. Age-period-cohort models decompose observed trends into age effects that correlate with natural history, period effects that reveal factors impacting all ages simultaneously (e.g. inn...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217713033

    authors: Chernyavskiy P,Little MP,Rosenberg PS

    更新日期:2019-01-01 00:00:00

  • Estimation of regression quantiles in complex surveys with data missing at random: An application to birthweight determinants.

    abstract::The estimation of population parameters using complex survey data requires careful statistical modelling to account for the design features. This is further complicated by unit and item nonresponse for which a number of methods have been developed in order to reduce estimation bias. In this paper, we address some issu...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213484401

    authors: Geraci M

    更新日期:2016-08-01 00:00:00

  • Joint nested frailty models for clustered recurrent and terminal events: An application to colonoscopy screening visits and colorectal cancer risks in Lynch Syndrome families.

    abstract::Joint models for recurrent and terminal events have not been yet developed for clustered data. The goals of our study are to develop a statistical framework for modelling clustered recurrent and terminal events and to perform dynamic predictions of the terminal event in family studies. We propose a joint nested frailt...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219863076

    authors: Choi YH,Jacqmin-Gadda H,Król A,Parfrey P,Briollais L,Rondeau V

    更新日期:2020-05-01 00:00:00

  • Expected p-values in light of an ROC curve analysis applied to optimal multiple testing procedures.

    abstract::Many statistical studies report p-values for inferential purposes. In several scenarios, the stochastic aspect of p-values is neglected, which may contribute to drawing wrong conclusions in real data experiments. The stochastic nature of p-values makes their use to examine the performance of given testing procedures o...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217704451

    authors: Vexler A,Yu J,Zhao Y,Hutson AD,Gurevich G

    更新日期:2018-12-01 00:00:00

  • Comparing cluster-level dynamic treatment regimens using sequential, multiple assignment, randomized trials: Regression estimation and sample size considerations.

    abstract::Cluster-level dynamic treatment regimens can be used to guide sequential treatment decision-making at the cluster level in order to improve outcomes at the individual or patient-level. In a cluster-level dynamic treatment regimen, the treatment is potentially adapted and re-adapted over time based on changes in the cl...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217708654

    authors: NeCamp T,Kilbourne A,Almirall D

    更新日期:2017-08-01 00:00:00

  • Allele-sharing among affected relatives: non-parametric methods for identifying genes.

    abstract::Non-parametric linkage analysis examines similarities among affected relatives in alleles of one or more genetic markers (pieces of DNA at known locations on a chromosome). The objective is to evaluate departures from the null hypothesis that the markers are not near a disease gene. Under the null hypothesis, Mendel's...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228020101000103

    authors: Shih MC,Whittemore AS

    更新日期:2001-02-01 00:00:00