Smoothing across time in repeated cross-sectional data.

Abstract:

:Repeated cross-sectional samples are common in national surveys of health like the National Health Interview Survey (NHIS). Because population health outcomes generally evolve slowly, pooling data across years can improve the precision of current-year annual estimates of disease prevalence and other health outcomes. Pooling over time is particularly valuable in health disparities research, where outcomes for small groups are often of interest and pooling data across groups would bias disparity estimates. State-space modeling and Kalman filtering are appealing choices for smoothing data across time. However, filtering can be problematic when few time points are available, as is common with annual cross-sectional data. Problems arise because filtering relies on estimated variance components, which can be biased and imprecise when estimated with small samples, especially when estimated in tandem with linear trends. We conduct a simulation study showing that even when trends and variance components are estimated poorly, smoothing with these estimates can improve the mean squared error (MSE) of estimated health states for multiple racial/ethnic groups when the variance components are estimated with the pooled sample. We consider frequentist estimators with no trends, one common trend across groups, and separate trends for every group, as well as shrinkage estimators of trends through a Bayesian model. We show that the Bayesian model offers the greatest improvement in MSE, and that Bayesian Information Criterion (BIC)-based model averaging of the frequentist estimators with different trend assumptions performs nearly as well. We present empirical examples using the NHIS data.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Lockwood JR,McCaffrey DF,Setodji CM,Elliott MN

doi

10.1002/sim.3897

subject

Has Abstract

pub_date

2011-02-28 00:00:00

pages

584-94

issue

5

eissn

0277-6715

issn

1097-0258

journal_volume

30

pub_type

杂志文章
  • Two-stage residual inclusion for survival data and competing risks-An instrumental variable approach with application to SEER-Medicare linked data.

    abstract::Instrumental variable is an essential tool for addressing unmeasured confounding in observational studies. Two-stage predictor substitution (2SPS) estimator and two-stage residual inclusion (2SRI) are two commonly used approaches in applying instrumental variables. Recently, 2SPS was studied under the additive hazards...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8071

    authors: Ying A,Xu R,Murphy J

    更新日期:2019-05-10 00:00:00

  • Use of max and min scores for trend tests for association when the genetic model is unknown.

    abstract::In case-control studies, the Cochran-Armitage (CA) trend test is powerful for detection of an association between a risk allele and a marker. To apply this test, a score should be assigned to the genotypes based on the genetic model. When the underlying genetic model is unknown, the trend test statistic is a function ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1474

    authors: Zheng G

    更新日期:2003-08-30 00:00:00

  • Mixtures of proportional hazards regression models.

    abstract::This paper presents a mixture model which combines features of the usual Cox proportional hazards model with those of a class of models, known as mixtures-of-experts. The resulting model is more flexible than the usual Cox model in the sense that the log hazard ratio is allowed to vary non-linearly as a function of th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990515)18:9<1119::aid-si

    authors: Rosen O,Tanner M

    更新日期:1999-05-15 00:00:00

  • Proportion cured and mean log survival time as functions of tumour size.

    abstract::We obtained maximum likelihood estimates (MLEs) of the proportion cured pi c and mean log survival time mu t for a sample of 4355 patients with intraocular melanoma whose survival times subsequent to treatment were assumed to follow a log-normal distribution. Following stratification by tumour size, MLEs of pi c and m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780090814

    authors: Gamel JW,McLean IW,Rosenberg SH

    更新日期:1990-08-01 00:00:00

  • Elasticity as a measure for online determination of remission points in ongoing epidemics.

    abstract::The correct identification of change-points during ongoing outbreak investigations of infectious diseases is a matter of paramount importance in epidemiology, with major implications for the management of health care resources, public health and, as the COVID-19 pandemic has shown, social live. Onsets, peaks, and infl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8807

    authors: Veres-Ferrer EJ,Pavía JM

    更新日期:2021-02-20 00:00:00

  • Multiple outputation for the analysis of longitudinal data subject to irregular observation.

    abstract::Observational cohort studies often feature longitudinal data subject to irregular observation. Moreover, the timings of observations may be associated with the underlying disease process and must thus be accounted for when analysing the data. This paper suggests that multiple outputation, which consists of repeatedly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6829

    authors: Pullenayegum EM

    更新日期:2016-05-20 00:00:00

  • Explaining community-level variance in group randomized trials.

    abstract::Between-community variance or community-by-time variance is one of the key factors driving the cost of conducting group randomized trials, which are often very expensive. We investigated empirically whether between-community variance could be reduced by controlling individual- and/or community-level covariates and ide...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990315)18:5<539::aid-sim

    authors: Feng Z,Diehr P,Yasui Y,Evans B,Beresford S,Koepsell TD

    更新日期:1999-03-15 00:00:00

  • Sam Greenhouse's years at the Census Bureau and the UNRRA.

    abstract::Sam Greenhouse joined the Census Bureau as a clerk at an interesting time period for the agency. The first use of sampling in the decennial census occurred in 1940. There was a major expansion of the amount of data collected. The organization of the Census Bureau underwent radical changes, including the growth of the ...

    journal_title:Statistics in medicine

    pub_type: 传,历史文章,杂志文章

    doi:10.1002/sim.1627

    authors: Keller J,Clark CZ

    更新日期:2003-11-15 00:00:00

  • Estimation of time-shift models with application to survival calibration in health technology assessment.

    abstract::The incremental life expectancy, defined as the difference in mean survival times between two treatment groups, is a crucial quantity of interest in cost-effectiveness analyses. Usually, this quantity is very difficult to estimate from censored survival data with a limited follow-up period. The paper develops estimati...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6951

    authors: Titman AC

    更新日期:2016-09-10 00:00:00

  • Measuring spatial effects in time to event data: a case study using months from angiography to coronary artery bypass graft (CABG).

    abstract::The application of Bayesian hierarchical models to measure spatial effects in time to event data has not been widely reported. This case study aims to estimate the effect of area of residence on waiting times to coronary artery bypass graft (CABG) and to assess the role of important individual specific covariates (age...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1535

    authors: Crook AM,Knorr-Held L,Hemingway H

    更新日期:2003-09-30 00:00:00

  • Permutation tests for joinpoint regression with applications to cancer rates.

    abstract::The identification of changes in the recent trend is an important issue in the analysis of cancer mortality and incidence data. We apply a joinpoint regression model to describe such continuous changes and use the grid-search method to fit the regression function with unknown joinpoints assuming constant variance and ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(20000215)19:3<335::aid-sim

    authors: Kim HJ,Fay MP,Feuer EJ,Midthune DN

    更新日期:2000-02-15 00:00:00

  • Sample sizes for constructing confidence intervals and testing hypotheses.

    abstract::Although estimation and confidence intervals have become popular alternatives to hypothesis testing and p-values, statisticians usually determine sample sizes for randomized clinical trials by controlling the power of a statistical test at an appropriate alternative, even those statisticians who recommend the use of c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080705

    authors: Bristol DR

    更新日期:1989-07-01 00:00:00

  • Predicting analysis times in randomized clinical trials.

    abstract::Randomized clinical trial designs commonly include one or more planned interim analyses. At these times an external monitoring committee reviews the accumulated data and determines whether it is scientifically and ethically appropriate for the study to continue. With failure-time endpoints, it is common to schedule an...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.843

    authors: Bagiella E,Heitjan DF

    更新日期:2001-07-30 00:00:00

  • Competing approaches to analysis of failure times with competing risks.

    abstract::For the analysis of time to event data in contraceptive studies when individuals are subject to competing causes for discontinuation, some authors have recently advocated the use of the cumulative incidence rate as a more appropriate measure to summarize data than the complement of the Kaplan-Meier estimate of discont...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1135

    authors: Farley TM,Ali MM,Slaymaker E

    更新日期:2001-12-15 00:00:00

  • Semi-parametric modelling for costs of health care technologies.

    abstract::Cost data that arise in the evaluation of health care technologies usually exhibit highly skew, heavy-tailed and, possibly, multi-modal distributions. Distribution-free methods for analysing these data, such as the bootstrap, or those based on the asymptotic normality of sample means, may often lead to inefficient or ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2012

    authors: Conigliani C,Tancredi A

    更新日期:2005-10-30 00:00:00

  • Methodological pitfalls in the analysis of contraceptive failure.

    abstract::Although the literature on contraceptive failure is vast and is expanding rapidly, our understanding of the relative efficacy of methods is quite limited because of defects in the research design and in the analytical tools used by investigators. Errors in the literature range from simple arithmetical mistakes to outr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/sim.4780100206

    authors: Trussell J

    更新日期:1991-02-01 00:00:00

  • The analysis of contingency tables with ordinal data: an application to monitoring antibiotic resistance.

    abstract::Rationalization of antibiotic therapy in the management of infectious diseases is helped by a knowledge of the patterns of sensitivity and resistance of bacteria to antibiotics and their possible changes both in time and from one hospital unit to another. In this paper we present the results regarding the sensitivitie...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2447

    authors: Bonetto C,Giannerini S,Giovagnoli A

    更新日期:2006-10-30 00:00:00

  • Causal inference in paired two-arm experimental studies under noncompliance with application to prognosis of myocardial infarction.

    abstract::Motivated by a study about prompt coronary angiography in myocardial infarction, we propose a method to estimate the causal effect of a treatment in two-arm experimental studies with possible noncompliance in both treatment and control arms. We base the method on a causal model for repeated binary outcomes (before and...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5856

    authors: Bartolucci F,Farcomeni A

    更新日期:2013-11-10 00:00:00

  • Joint analysis of mixed types of outcomes with latent variables.

    abstract::We propose a joint modeling approach to investigating the observed and latent risk factors of mixed types of outcomes. The proposed model comprises three parts. The first part is an exploratory factor analysis model that summarizes latent factors through multiple observed variables. The second part is a proportional h...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8840

    authors: Pan D,Wei Y,Song X

    更新日期:2020-12-09 00:00:00

  • A flexible, interpretable framework for assessing sensitivity to unmeasured confounding.

    abstract::When estimating causal effects, unmeasured confounding and model misspecification are both potential sources of bias. We propose a method to simultaneously address both issues in the form of a semi-parametric sensitivity analysis. In particular, our approach incorporates Bayesian Additive Regression Trees into a two-p...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6973

    authors: Dorie V,Harada M,Carnegie NB,Hill J

    更新日期:2016-09-10 00:00:00

  • Analyzing sequentially randomized trials based on causal effect models for realistic individualized treatment rules.

    abstract::In this paper, we argue that causal effect models for realistic individualized treatment rules represent an attractive tool for analyzing sequentially randomized trials. Unlike a number of methods proposed previously, this approach does not rely on the assumption that intermediate outcomes are discrete or that models ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3268

    authors: Bembom O,van der Laan MJ

    更新日期:2008-08-30 00:00:00

  • Estimation of ROC curve with complex survey data.

    abstract::The receiver operating characteristic (ROC) curve can be utilized to evaluate the performance of diagnostic tests. The area under the ROC curve (AUC) is a widely used summary index for comparing multiple ROC curves. Both parametric and nonparametric methods have been developed to estimate and compare the AUCs. However...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6405

    authors: Yao W,Li Z,Graubard BI

    更新日期:2015-04-15 00:00:00

  • The effect of salvage therapy on survival in a longitudinal study with treatment by indication.

    abstract::We consider using observational data to estimate the effect of a treatment on disease recurrence, when the decision to initiate treatment is based on longitudinal factors associated with the risk of recurrence. The effect of salvage androgen deprivation therapy (SADT) on the risk of recurrence of prostate cancer is in...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4017

    authors: Kennedy EH,Taylor JM,Schaubel DE,Williams S

    更新日期:2010-11-10 00:00:00

  • Comparison of tests for categorical data from a stratified cluster randomized trial.

    abstract::Two features commonly exhibited by randomized trials of health promotion interventions are cluster randomization and stratification. Ignoring correlations between individuals within clusters can lead to an inflated type I error rate and hence a P-value which overstates the significance of the result. This paper compar...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1256

    authors: Dobbins TA,Simpson JM

    更新日期:2002-12-30 00:00:00

  • Estimation of the population effectiveness of vaccination.

    abstract::This paper presents a simple method for estimation of population vaccination effectiveness, which is the fraction of disease cases prevented by a vaccination programme. The method is based on the susceptible-infectious-recovered (SIR) model for the spread of an epidemic in a heterogeneous population under non-homogene...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970330)16:6<601::aid-sim

    authors: Haber M

    更新日期:1997-03-30 00:00:00

  • Weighted hurdle regression method for joint modeling of cardiovascular events likelihood and rate in the US dialysis population.

    abstract::We propose a new weighted hurdle regression method for modeling count data, with particular interest in modeling cardiovascular events in patients on dialysis. Cardiovascular disease remains one of the leading causes of hospitalization and death in this population. Our aim is to jointly model the relationship/associat...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6232

    authors: Sentürk D,Dalrymple LS,Mu Y,Nguyen DV

    更新日期:2014-11-10 00:00:00

  • Testing conditional independence in sets of I × J tables by means of moment and correlation score tests with application to HPV vaccine.

    abstract::A new testing approach is described for improving statistical tests of independence in sets of tables stratified on one or more relevant factors in case of categorical (nominal or ordinal) variables. Common tests of independence that exploit the ordinality of one of the variables use a restricted-alternative approach....

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7006

    authors: Iannario M,Lang JB

    更新日期:2016-11-10 00:00:00

  • Assessing neural activity related to decision-making through flexible odds ratio curves and their derivatives.

    abstract::It is well established that neural activity is stochastically modulated over time. Therefore, direct comparisons across experimental conditions and determination of change points or maximum firing rates are not straightforward. This study sought to compare temporal firing probability curves that may vary across groups...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4220

    authors: Roca-Pardiñas J,Cadarso-Suárez C,Pardo-Vazquez JL,Leboran V,Molenberghs G,Faes C,Acuña C

    更新日期:2011-06-30 00:00:00

  • Reclassification of predictions for uncovering subgroup specific improvement.

    abstract::Risk prediction models play an important role in prevention and treatment of several diseases. Models that are in clinical use are often refined and improved. In many instances, the most efficient way to improve a successful model is to identify subgroups for which there is a specific biological rationale for improvem...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6077

    authors: Biswas S,Arun B,Parmigiani G

    更新日期:2014-05-20 00:00:00

  • Nonparametric comparison of two survival functions with dependent censoring via nonparametric multiple imputation.

    abstract::When the event time of interest depends on the censoring time, conventional two-sample test methods, such as the log-rank and Wilcoxon tests, can produce an invalid test result. We extend our previous work on estimation using auxiliary variables to adjust for dependent censoring via multiple imputation, to the compari...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3480

    authors: Hsu CH,Taylor JM

    更新日期:2009-02-01 00:00:00