Complete imputation of missing repeated categorical data: one-sample applications.

Abstract:

:Longitudinal studies with repeated measures are often subject to non-response. Methods currently employed to alleviate the difficulties caused by missing data are typically unsatisfactory, especially when the cause of the missingness is related to the outcomes. We present an approach for incomplete categorical data in the repeated measures setting that allows missing data to depend on other observed outcomes for a study subject. The proposed methodology also allows a broader examination of study findings through interpretation of results in the framework of the set of all possible test statistics that might have been observed had no data been missing. The proposed approach consists of the following general steps. First, we generate all possible sets of missing values and form a set of possible complete data sets. We then weight each data set according to clearly defined assumptions and apply an appropriate statistical test procedure to each data set, combining the results to give an overall indication of significance. We make use of the EM algorithm and a Bayesian prior in this approach. While not restricted to the one-sample case, the proposed methodology is illustrated for one-sample data and compared to the common complete-case and available-case analysis methods.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

West CP,Dawson JD

doi

10.1002/sim.982

subject

Has Abstract

pub_date

2002-01-30 00:00:00

pages

203-17

issue

2

eissn

0277-6715

issn

1097-0258

pii

10.1002/sim.982

journal_volume

21

pub_type

杂志文章
  • Conditional power and predictive power based on right censored data with supplementary auxiliary information.

    abstract::Conditional power and predictive power provide estimates of the probability of success at the end of the trial based on the information from the interim analysis. The observed value of the time to event endpoint at the interim analysis could be biased for the true treatment effect due to early censoring, leading to a ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7673

    authors: Sun L,Wan Y

    更新日期:2018-08-15 00:00:00

  • Hierarchical multiple informants models: examining food environment contributions to the childhood obesity epidemic.

    abstract::Methods for multiple informants help to estimate the marginal effect of each multiple source predictor and formally compare the strength of their association with an outcome. We extend multiple informant methods to the case of hierarchical data structures to account for within cluster correlation. We apply the propose...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5967

    authors: Baek J,Sánchez BN,Sanchez-Vaznaugh EV

    更新日期:2014-02-20 00:00:00

  • A flexible, interpretable framework for assessing sensitivity to unmeasured confounding.

    abstract::When estimating causal effects, unmeasured confounding and model misspecification are both potential sources of bias. We propose a method to simultaneously address both issues in the form of a semi-parametric sensitivity analysis. In particular, our approach incorporates Bayesian Additive Regression Trees into a two-p...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6973

    authors: Dorie V,Harada M,Carnegie NB,Hill J

    更新日期:2016-09-10 00:00:00

  • Multiple imputation for left-censored biomarker data based on Gibbs sampling method.

    abstract::Biomarkers, increasingly used in biomedical studies for the diagnosis and prognosis of acute and chronic diseases, provide insight into the effectiveness of treatments and potential pathways that can be used to guide future treatment targets. The measurement of these markers is often limited by the sensitivity of the ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4503

    authors: Lee M,Kong L,Weissfeld L

    更新日期:2012-07-30 00:00:00

  • Targeted maximum likelihood estimation for a binary treatment: A tutorial.

    abstract::When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In con...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7628

    authors: Luque-Fernandez MA,Schomaker M,Rachet B,Schnitzer ME

    更新日期:2018-07-20 00:00:00

  • Joint modeling of repeated multivariate cognitive measures and competing risks of dementia and death: a latent process and latent class approach.

    abstract::Joint models initially dedicated to a single longitudinal marker and a single time-to-event need to be extended to account for the rich longitudinal data of cohort studies. Multiple causes of clinical progression are indeed usually observed, and multiple longitudinal markers are collected when the true latent trait of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6731

    authors: Proust-Lima C,Dartigues JF,Jacqmin-Gadda H

    更新日期:2016-02-10 00:00:00

  • Dynamic Cox modelling based on fractional polynomials: time-variations in gastric cancer prognosis.

    abstract::The most popular model used for survival analysis is the proportional hazards regression model proposed by Cox. This is mainly due to its exceptional simplicity. Nevertheless the fundamental assumption of the Cox model is the proportionality of the hazards. For many applications, however, this assumption is doubtful. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1411

    authors: Berger U,Schäfer J,Ulm K

    更新日期:2003-04-15 00:00:00

  • A regression model for multivariate random length data.

    abstract::Multivariate random length data occur when we observe multiple measurements of a quantitative variable and the variable number of these measurements is also an observed outcome for each experimental unit. For example, for a patient with coronary artery disease, we may observe a number of lesions in that patient's coro...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990130)18:2<199::aid-sim

    authors: Barnhart HX,Kosinski AS,Sampson AR

    更新日期:1999-01-30 00:00:00

  • Comparison of methods for imputing ordinal data using multivariate normal imputation: a case study of non-linear effects in a large cohort study.

    abstract:BACKGROUND:Multiple imputation is becoming increasingly popular for handling missing data, with Markov chain Monte Carlo assuming multivariate normality (MVN) a commonly used approach. Imputing categorical variables (which are clearly non-normal) using MVN imputation is challenging, and several approaches have been sug...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5445

    authors: Lee KJ,Galati JC,Simpson JA,Carlin JB

    更新日期:2012-12-30 00:00:00

  • Establishing the relationship between nurse staffing and hospital mortality using a clustered discrete-time logistic model.

    abstract::Studies based on aggregated hospital outcome data have established that there is a relationship between nurse staffing and adverse events. However, this result could not be confirmed in Belgium where 96 per cent of the variability of nurse staffing levels over nursing units (belonging to different hospitals) is explai...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3756

    authors: Diya L,Lesaffre E,Van den Heede K,Sermeus W,Vleugels A

    更新日期:2010-03-30 00:00:00

  • Spatial clustering of the failure to geocode and its implications for the detection of disease clustering.

    abstract::Geocoding a study population as completely as possible is an important data assimilation component of many spatial epidemiologic studies. Unfortunately, complete geocoding is rare in practice. The failure of a substantial proportion of study subjects' addresses to geocode has consequences for spatial analyses, some of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3288

    authors: Zimmerman DL,Fang X,Mazumdar S

    更新日期:2008-09-20 00:00:00

  • Redesign of trials under different enrollment mixes.

    abstract::A few large multi-centre male-only heart trials done in the 1970s and 1980s have been seen as ill-conceived because they did not include females. The purpose here is to revisit two of those trials and to consider consequences in terms of cost and power had they been designed to include females. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990215)18:3<241::aid-sim

    authors: Meinert CL

    更新日期:1999-02-15 00:00:00

  • Bayesian methods for meta-analysis of causal relationships estimated using genetic instrumental variables.

    abstract::Genetic markers can be used as instrumental variables, in an analogous way to randomization in a clinical trial, to estimate the causal relationship between a phenotype and an outcome variable. Our purpose is to extend the existing methods for such Mendelian randomization studies to the context of multiple genetic mar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3843

    authors: Burgess S,Thompson SG,CRP CHD Genetics Collaboration.,Burgess S,Thompson SG,Andrews G,Samani NJ,Hall A,Whincup P,Morris R,Lawlor DA,Davey Smith G,Timpson N,Ebrahim S,Ben-Shlomo Y,Davey Smith G,Timpson N,Brown M,Ricket

    更新日期:2010-05-30 00:00:00

  • A functional multiple imputation approach to incomplete longitudinal data.

    abstract::In designed longitudinal studies, information from the same set of subjects are collected repeatedly over time. The longitudinal measurements are often subject to missing data which impose an analytic challenge. We propose a functional multiple imputation approach modeling longitudinal response profiles as smooth curv...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4201

    authors: He Y,Yucel R,Raghunathan TE

    更新日期:2011-05-10 00:00:00

  • Incorporating longitudinal biomarkers for dynamic risk prediction in the era of big data: A pseudo-observation approach.

    abstract::Longitudinal biomarker data are often collected in studies, providing important information regarding the probability of an outcome of interest occurring at a future time. With many new and evolving technologies for biomarker discovery, the number of biomarker measurements available for analysis of disease progression...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8687

    authors: Zhao L,Murray S,Mariani LH,Ju W

    更新日期:2020-11-20 00:00:00

  • Sample size calculation for stepped wedge and other longitudinal cluster randomised trials.

    abstract::The sample size required for a cluster randomised trial is inflated compared with an individually randomised trial because outcomes of participants from the same cluster are correlated. Sample size calculations for longitudinal cluster randomised trials (including stepped wedge trials) need to take account of at least...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7028

    authors: Hooper R,Teerenstra S,de Hoop E,Eldridge S

    更新日期:2016-11-20 00:00:00

  • Adaptive dose modification for phase I clinical trials.

    abstract::Most phase I dose-finding methods in oncology aim to find the maximum-tolerated dose from a set of prespecified doses. However, in practice, because of a lack of understanding of the true dose-toxicity relationship, it is likely that none of these prespecified doses are equal or reasonably close to the true maximum-to...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6933

    authors: Chu Y,Pan H,Yuan Y

    更新日期:2016-09-10 00:00:00

  • Using data from multiple studies to develop a child growth correlation matrix.

    abstract::In many countries, the monitoring of child growth does not occur in a regular manner, and instead, we may have to rely on sporadic observations that are subject to substantial measurement error. In these countries, it can be difficult to identify patterns of poor growth, and faltering children may miss out on essentia...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7696

    authors: Anderson C,Xiao L,Checkley W

    更新日期:2019-08-30 00:00:00

  • Measurement error in dietary assessment: an investigation using covariance structure models. Part II.

    abstract::In Part I we presented a covariance structure model for analysing measurement error in the assessment of nitrogen intake. In this paper we include data on urine nitrogen excretion which allows a critical assessment of the model proposed. Inclusion of urine nitrogen data produces more pessimistic estimates of the quali...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780121005

    authors: Plummer M,Clayton D

    更新日期:1993-05-30 00:00:00

  • A simple test for synergy for a small number of combinations.

    abstract::A method for detecting deviations from the Loewe additive drug combination reference model for in vitro drug combination experimentation is described. It is often difficult to fit a response surface model to drug combination data, especially in situations where the experimental design contains a sparse set of combinat...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5905

    authors: Novick SJ

    更新日期:2013-12-20 00:00:00

  • Conflicts of interest in data monitoring of industry versus publicly financed clinical trials.

    abstract::The FDA Guidance, while highly appropriate for industry sponsored trials, need not be imposed on publicly (e.g. NIH) financed clinical trials. While the potential for conflicts of interest exist in the latter, they are in general manageable and pose an acceptable low risk of threatening the integrity of a study. Howev...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1787

    authors: Lachin JM

    更新日期:2004-05-30 00:00:00

  • Additive and multiplicative covariate regression models for relative survival incorporating fractional polynomials for time-dependent effects.

    abstract::Relative survival is used to estimate patient survival excluding causes of death not related to the disease of interest. Rather than using cause of death information from death certificates, which is often poorly recorded, relative survival compares the observed survival to that expected in a matched group from the ge...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2399

    authors: Lambert PC,Smith LK,Jones DR,Botha JL

    更新日期:2005-12-30 00:00:00

  • Comparing and combining data across multiple sources via integration of paired-sample data to correct for measurement error.

    abstract::In biomedical research such as the development of vaccines for infectious diseases or cancer, study outcomes measured by an assay or device are often collected from multiple sources or laboratories. Measurement error that may vary between laboratories needs to be adjusted for when combining samples across data sources...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5446

    authors: Huang Y,Huang Y,Moodie Z,Li S,Self S

    更新日期:2012-12-10 00:00:00

  • A nonparametric smoothing method for assessing GEE models with longitudinal binary data.

    abstract::Studies involving longitudinal binary responses are widely applied in the health and biomedical sciences research and frequently analyzed by generalized estimating equations (GEE) method. This article proposes an alternative goodness-of-fit test based on the nonparametric smoothing approach for assessing the adequacy ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3315

    authors: Lin KC,Chen YJ,Shyr Y

    更新日期:2008-09-30 00:00:00

  • Comparisons of risk prediction methods using nested case-control data.

    abstract::Using both simulated and real datasets, we compared two approaches for estimating absolute risk from nested case-control (NCC) data and demonstrated the feasibility of using the NCC design for estimating absolute risk. In contrast to previously published results, we successfully demonstrated not only that data from a ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7143

    authors: Salim A,Delcoigne B,Villaflores K,Koh WP,Yuan JM,van Dam RM,Reilly M

    更新日期:2017-02-10 00:00:00

  • Classification using ensemble learning under weighted misclassification loss.

    abstract::Binary classification rules based on covariates typically depend on simple loss functions such as zero-one misclassification. Some cases may require more complex loss functions. For example, individual-level monitoring of HIV-infected individuals on antiretroviral therapy requires periodic assessment of treatment fail...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8082

    authors: Xu Y,Liu T,Daniels MJ,Kantor R,Mwangi A,Hogan JW

    更新日期:2019-05-20 00:00:00

  • Use of a density equalizing map projection in analysing childhood cancer in four California counties.

    abstract::In this study, 401 cases of childhood cancer in four California counties in 1980-1988 were analysed with the innovative methodology of density equalizing map projections. The data were originally collected and analysed by the California State Department of Health Services (DHS). In addition to the new analytic techniq...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.686

    authors: Merrill DW

    更新日期:2001-05-15 00:00:00

  • How should meta-regression analyses be undertaken and interpreted?

    abstract::Appropriate methods for meta-regression applied to a set of clinical trials, and the limitations and pitfalls in interpretation, are insufficiently recognized. Here we summarize recent research focusing on these issues, and consider three published examples of meta-regression in the light of this work. One principal m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1187

    authors: Thompson SG,Higgins JP

    更新日期:2002-06-15 00:00:00

  • A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates.

    abstract::In behavioral, biomedical, and social-psychological sciences, it is common to encounter latent variables and heterogeneous data. Mixture structural equation models (SEMs) are very useful methods to analyze these kinds of data. Moreover, the presence of missing data, including both missing responses and missing covaria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3915

    authors: Cai JH,Song XY,Hser YI

    更新日期:2010-08-15 00:00:00

  • Modelling the association between patient characteristics and the change over time in a disease measure using observational cohort data.

    abstract::In observational cohort studies we may wish to examine the associations between fixed patient characteristics and the longitudinal changes from baseline in a repeated outcome measure. Many biological and other outcome measures are known to be subject to measurement error and biological variation. In an initial analysi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3725

    authors: Harrison L,Dunn DT,Green H,Copas AJ

    更新日期:2009-11-20 00:00:00