Automated time series forecasting for biosurveillance.

Abstract:

:For robust detection performance, traditional control chart monitoring for biosurveillance is based on input data free of trends, day-of-week effects, and other systematic behaviour. Time series forecasting methods may be used to remove this behaviour by subtracting forecasts from observations to form residuals for algorithmic input. We describe three forecast methods and compare their predictive accuracy on each of 16 authentic syndromic data streams. The methods are (1) a non-adaptive regression model using a long historical baseline, (2) an adaptive regression model with a shorter, sliding baseline, and (3) the Holt-Winters method for generalized exponential smoothing. Criteria for comparing the forecasts were the root-mean-square error, the median absolute per cent error (MedAPE), and the median absolute deviation. The median-based criteria showed best overall performance for the Holt-Winters method. The MedAPE measures over the 16 test series averaged 16.5, 11.6, and 9.7 for the non-adaptive regression, adaptive regression, and Holt-Winters methods, respectively. The non-adaptive regression forecasts were degraded by changes in the data behaviour in the fixed baseline period used to compute model coefficients. The mean-based criterion was less conclusive because of the effects of poor forecasts on a small number of calendar holidays. The Holt-Winters method was also most effective at removing serial autocorrelation, with most 1-day-lag autocorrelation coefficients below 0.15. The forecast methods were compared without tuning them to the behaviour of individual series. We achieved improved predictions with such tuning of the Holt-Winters method, but practical use of such improvements for routine surveillance will require reliable data classification methods.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Burkom HS,Murphy SP,Shmueli G

doi

10.1002/sim.2835

subject

Has Abstract

pub_date

2007-09-30 00:00:00

pages

4202-18

issue

22

eissn

0277-6715

issn

1097-0258

journal_volume

26

pub_type

杂志文章
  • Using marginal structural models to adjust for treatment drop-in when developing clinical prediction models.

    abstract::Clinical prediction models (CPMs) can inform decision making about treatment initiation, which requires predicted risks assuming no treatment is given. However, this is challenging since CPMs are usually derived using data sets where patients received treatment, often initiated postbaseline as "treatment drop-ins." Th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7913

    authors: Sperrin M,Martin GP,Pate A,Van Staa T,Peek N,Buchan I

    更新日期:2018-12-10 00:00:00

  • Corrections for exposure measurement error in logistic regression models with an application to nutritional data.

    abstract::Two correction methods are considered for multiple logistic regression models with some covariates measured with error. Both methods are based on approximating the complicated regression model between the response and the observed covariates with simpler models. The first model is the logistic approximation proposed b...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131105

    authors: Kuha J

    更新日期:1994-06-15 00:00:00

  • Bias in the evaluation of DNA-amplification tests for detecting Chlamydia trachomatis.

    abstract::The purpose of this paper is to show that the sensitivity and specificity estimates obtained by 'discrepant analysis' are biased. Discrepant analysis is a widely used technique that attempts to provide estimates of sensitivity and specificity in the presence of an imperfect gold standard. Many researchers have applied...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970630)16:12<1391::aid-s

    authors: Hadgu A

    更新日期:1997-06-30 00:00:00

  • Identifying optimal risk windows for self-controlled case series studies of vaccine safety.

    abstract::In vaccine safety studies, subjects are considered at increased risk for adverse events for a period of time after vaccination known as risk window. To our knowledge, risk windows for vaccine safety studies have tended to be pre-defined and not to use information from the current study. Inaccurate specification of the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4125

    authors: Xu S,Zhang L,Nelson JC,Zeng C,Mullooly J,McClure D,Glanz J

    更新日期:2011-03-30 00:00:00

  • An extension of the continual reassessment method using decision theory.

    abstract::The primary goal of a phase I trial is to find the maximally tolerated dose (MTD) of a treatment. The MTD is usually defined in terms of a tolerable probability, q(*), of toxicity. Our objective is to find the highest dose with toxicity risk that does not exceed q(*), a criterion that is often desired in designing pha...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.970

    authors: Leung DH,Wang YG

    更新日期:2002-01-15 00:00:00

  • REML and ML estimation for clustered grouped survival data.

    abstract::Clustered grouped survival data arise naturally in clinical medicine and biological research. For example, in a randomized clinical trial, the variable of interest is the time to occurrence of a certain event with or without a new treatment and the data are collected from possibly correlated subjects from independent ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1323

    authors: Lam KF,Ip D

    更新日期:2003-06-30 00:00:00

  • More powerful randomization-based p-values in double-blind trials with non-compliance.

    abstract::Standard randomization-based tests of sharp null hypotheses in randomized clinical trials, that is, intent-to-treat analyses, are valid without extraneous assumptions, but generally can be appropriately powerful only with alternative hypotheses that involve treatment assignment having an effect on outcome. In the cont...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19980215)17:3<371::aid-sim

    authors: Rubin DB

    更新日期:1998-02-15 00:00:00

  • Spatial disease clusters: detection and inference.

    abstract::We present a new method of detection and inference for spatial clusters of a disease. To avoid ad hoc procedures to test for clustering, we have a clearly defined alternative hypothesis and our test statistic is based on the likelihood ratio. The proposed test can detect clusters of any size, located anywhere in the s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140809

    authors: Kulldorff M,Nagarwalla N

    更新日期:1995-04-30 00:00:00

  • Combining mortality and longitudinal measures in clinical trials.

    abstract::Clinical trials often assess therapeutic benefit on the basis of an event such as death or the diagnosis of disease. Usually, there are several additional longitudinal measures of clinical status which are collected to be used in the treatment comparison. This paper proposes a simple non-parametric test which combines...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(sici)1097-0258(19990615)18:11<1341::aid-s

    authors: Finkelstein DM,Schoenfeld DA

    更新日期:1999-06-15 00:00:00

  • Estimates of disease incidence in women based on antenatal or neonatal seroprevalence data: HIV in New York City.

    abstract::Piecewise constant incidence models were developed to estimate the force of infection in women from age- and time-specific antenatal or neonatal seroprevalence data. Differential inclusion of infected women in sero-surveys compared to uninfected women was taken into account, with respect to both changes in inclusion r...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131809

    authors: Ades AE,Medley GF

    更新日期:1994-09-30 00:00:00

  • SARS incubation and quarantine times: when is an exposed individual known to be disease free?

    abstract::The setting of a quarantine time for an emerging infectious disease will depend on current knowledge concerning incubation times. Methods for the analysis of information on incubation times are investigated with a particular focus on inference regarding a possible maximum incubation time, after which an exposed indivi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2206

    authors: Farewell VT,Herzberg AM,James KW,Ho LM,Leung GM

    更新日期:2005-11-30 00:00:00

  • Some extensions and applications of a Bayesian strategy for monitoring multiple outcomes in clinical trials.

    abstract::We present some practical extensions and applications of a strategy proposed by Thall, Simon and Estey for designing and monitoring single-arm clinical trials with multiple outcomes. We show by application how the strategy may be applied to construct designs for phase IIA activity trials and phase II equivalence trial...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19980730)17:14<1563::aid-s

    authors: Thall PF,Sung HG

    更新日期:1998-07-30 00:00:00

  • Positing, fitting, and selecting regression models for pooled biomarker data.

    abstract::Pooling biospecimens prior to performing lab assays can help reduce lab costs, preserve specimens, and reduce information loss when subject to a limit of detection. Because many biomarkers measured in epidemiological studies are positive and right-skewed, proper analysis of pooled specimens requires special methods. I...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6496

    authors: Mitchell EM,Lyles RH,Schisterman EF

    更新日期:2015-07-30 00:00:00

  • A functional-model-adjusted spatial scan statistic.

    abstract::This paper introduces a new spatial scan statistic designed to adjust cluster detection for longitudinal confounding factors indexed in space. The functional-model-adjusted statistic was developed using generalized functional linear models in which longitudinal confounding factors were considered to be functional cova...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8459

    authors: Ahmed MS,Genin M

    更新日期:2020-04-15 00:00:00

  • A Bayesian approach estimating treatment effects on biomarkers containing zeros with detection limits.

    abstract::Often in randomized clinical trials and observational studies in occupational and environmental health, a non-negative continuously distributed response variable denoting some metabolites of environmental toxicants is measured in treatment and control groups. When observations occur in both unexposed and exposed subje...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3170

    authors: Chu H,Nie L,Kensler TW

    更新日期:2008-06-15 00:00:00

  • Posterior predictive model checks for disease mapping models.

    abstract::Disease incidence or disease mortality rates for small areas are often displayed on maps. Maps of raw rates, disease counts divided by the total population at risk, have been criticized as unreliable due to non-constant variance associated with heterogeneity in base population size. This has led to the use of model-ba...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20000915/30)19:17/18<2377::aid-s

    authors: Stern HS,Cressie N

    更新日期:2000-09-15 00:00:00

  • Methodological considerations on the design and analysis of an equivalence stratified cluster randomization trial.

    abstract::The World Health Organization and collaborating institutions in four developing countries have conducted a multi-centre randomized controlled trial, in which clinics were allocated at random to two antenatal care (ANC) models. These were the standard 'Western' ANC model and a 'new' ANC model consisting of tests, clini...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/1097-0258(20010215)20:3<401::aid-sim801>3.

    authors: Piaggio G,Carroli G,Villar J,Pinol A,Bakketeig L,Lumbiganon P,Bergsjø P,Al-Mazrou Y,Ba'aqeel H,Belizán JM,Farnot U,Berendes H,WHO Antenatal Care Trial Research Group.

    更新日期:2001-02-15 00:00:00

  • Multilevel time series models with applications to repeated measures data.

    abstract::The analysis of repeated measures data can be conducted efficiently using a two-level random coefficients model. A standard assumption is that the within-individual (level 1) residuals are uncorrelated. In some cases, especially where measurements are made close together in time, this may not be reasonable and this ad...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131605

    authors: Goldstein H,Healy MJ,Rasbash J

    更新日期:1994-08-30 00:00:00

  • A missing composite covariate in survival analysis: a case study of the Chinese Longitudinal Health and Longevity Survey.

    abstract::We estimate a Cox proportional hazards model where one of the covariates measures the level of a subject's cognitive functioning by grading the total score obtained by the subject on the items of a questionnaire. A case study is presented where the sample includes partial respondents, who did not answer some questionn...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3773

    authors: Lagona F,Zhang Z

    更新日期:2010-01-30 00:00:00

  • Assessing neural activity related to decision-making through flexible odds ratio curves and their derivatives.

    abstract::It is well established that neural activity is stochastically modulated over time. Therefore, direct comparisons across experimental conditions and determination of change points or maximum firing rates are not straightforward. This study sought to compare temporal firing probability curves that may vary across groups...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4220

    authors: Roca-Pardiñas J,Cadarso-Suárez C,Pardo-Vazquez JL,Leboran V,Molenberghs G,Faes C,Acuña C

    更新日期:2011-06-30 00:00:00

  • Testing whether genetic variation explains correlation of quantitative measures of gene expression, and application to genetic network analysis.

    abstract::Genetic networks for gene expression data are often built by graphical models, which in turn are built from pair-wise correlations of gene expression levels. A key feature of building graphical models is the evaluation of conditional independence of two traits, given other traits. When conditional independence can be ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3274

    authors: Yu Z,Wang L,Hildebrandt MA,Schaid DJ

    更新日期:2008-08-30 00:00:00

  • Estimating the stage-specific numbers of HIV infection using a Markov model and back-calculation.

    abstract::The back-calculation method has been used to estimate the number of HIV infections from AIDS incidence data in a particular population. We present an extension of back calculation that provides estimates of the numbers of HIV infectives in different stages of infection. We model the staging process with a time-depende...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110612

    authors: Longini IM Jr,Byers RH,Hessol NA,Tan WY

    更新日期:1992-04-01 00:00:00

  • Analysis of mortality rates via marginal extended quasi-likelihood.

    abstract::We use a mixed Poisson regression model with extra variation to analyse mortality data cross-classified by age and geographic region. We use estimates of dispersion parameter and fixed effects parameters, obtained by maximizing a marginal quasi-likelihood function, to estimate mortality rates in an empirical Bayes man...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(SICI)1097-0258(19960715)15:13<1397::AID-S

    authors: Lu WS,Tsutakawa RK

    更新日期:1996-07-15 00:00:00

  • Methods for analysing county-level mortality rates.

    abstract::The identification of counties burdened by exceptionally high rates of mortality is a fundamental step in the development of state-based intervention and prevention strategies. However, the estimation of rates from small geographic areas presents special problems, especially for rare events. This paper compares the us...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120320

    authors: Stevenson JM,Olson DR

    更新日期:1993-02-01 00:00:00

  • Changes in clinical trials mandated by the advent of meta-analysis.

    abstract::Service on the Data Monitoring Committee of the CPEP (Calcium for Pre-eclampsia Prevention) has led us to four conclusions about clinical trials which we should like to present to this gathering of biostatisticians for their reactions: (i) meta-analyses of the pertinent published trials of the same therapy should alwa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(SICI)1097-0258(19960630)15:12<1263::AID-S

    authors: Chalmers TC,Lau J

    更新日期:1996-06-30 00:00:00

  • Analysis of in vitro fertilization data with multiple outcomes using discrete time-to-event analysis.

    abstract::In vitro fertilization (IVF) is an increasingly common method of assisted reproductive technology. Because of the careful observation and follow-up required as part of the procedure, IVF studies provide an ideal opportunity to identify and assess clinical and demographic factors along with environmental exposures that...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6050

    authors: Maity A,Williams PL,Ryan L,Missmer SA,Coull BA,Hauser R

    更新日期:2014-05-10 00:00:00

  • Proportional hazards models and age-period-cohort analysis of cancer rates.

    abstract::Age-period-cohort (APC) analysis is widely used in cancer epidemiology to model trends in cancer rates. We develop methods for comparative APC analysis of two independent cause-specific hazard rates assuming that an APC model holds for each one. We construct linear hypothesis tests to determine whether the two hazards...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3865

    authors: Rosenberg PS,Anderson WF

    更新日期:2010-05-20 00:00:00

  • Group sequential designs for clinical trials with bivariate endpoints.

    abstract::Although all clinical trials are designed and monitored using more than one endpoint, methods are needed to assure that decision criteria are chosen to reflect the clinically relevant tradeoffs that assure the trial's scientific integrity. This article presents a framework for the design and monitoring clinical trials...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8696

    authors: Hu J,Blatchford PJ,Goldenberg NA,Kittelson JM

    更新日期:2020-11-20 00:00:00

  • Evidence-based medicine as Bayesian decision-making.

    abstract::We review two recent trends: the emergence of evidence-based medicine and the growing use of Bayesian statistics in medical applications. Evidence-based medicine requires an integrated assessment of the available evidence, and associated uncertainty, but there is also an emphasis on decision-making, for individual pat...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/1097-0258(20001215)19:23<3291::aid-sim627>

    authors: Ashby D,Smith AF

    更新日期:2000-12-15 00:00:00

  • Integrating multiple-domain rules for disease classification.

    abstract::In psychiatry, clinicians use criteria sets from the Diagnostic and Statistical Manual of Mental Disorders to diagnose mental disorders. Most criteria sets have several symptom domains, and in order to be diagnosed, an individual must meet the minimum number of symptoms required by each domain. Some efforts are now fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8173

    authors: Mauro C,Shear MK,Wang Y

    更新日期:2019-07-20 00:00:00