Software for tabular data protection.

Abstract:

:In order for national statistical offices to maintain the trust of the public to collect data and publish statistics of importance to society and decision-making, it is imperative that respondents (persons or establishments) be guaranteed privacy and confidentiality in return for providing requested confidential data. Consequently, for most survey and census data, disclosure limitation techniques must be applied before the data are ready for public release. For microdata, examples of methods that can be used to identify respondents include directly extracting identifying information from microdata files or indirectly identifying respondents by matching a given file with an external file. For tabular data, respondents may be identified directly from small cell counts or respondent contributions to heavily concentrated cells of magnitude data may be closely approximated by the cell value. Indirect disclosure is possible in tables through manipulation of additive tabular relationships between cell values and totals, e.g. manipulating rows and column totals in a two-dimensional table. Two-dimensional statistical tables are a staple of official statistics. This paper describes a desktop software system that for the first time implements within a single framework four standard disclosure limitation techniques for protecting tabular data in two-dimensional tables: complementary cell suppression, minimum-distance controlled rounding, unbiased controlled rounding, and controlled rounding subject to subtotals constraints, and a fifth, new method: controlled tabular adjustment, and summarizes the five methods.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Gonzalez JF Jr,Cox LH

doi

10.1002/sim.2043

subject

Has Abstract

pub_date

2005-02-28 00:00:00

pages

659-69

issue

4

eissn

0277-6715

issn

1097-0258

journal_volume

24

pub_type

杂志文章
  • Statistical methods for active extension trials.

    abstract::This paper develops methods of analysis for active extension clinical trials. Under this design, patients are randomized to treatment or placebo for a period of time (period 1), and then all patients receive treatment for an additional period of time (period 2). We assume a continuous outcome is measured at baseline a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2720

    authors: Hu Z,Follmann D

    更新日期:2007-05-30 00:00:00

  • Automated time series forecasting for biosurveillance.

    abstract::For robust detection performance, traditional control chart monitoring for biosurveillance is based on input data free of trends, day-of-week effects, and other systematic behaviour. Time series forecasting methods may be used to remove this behaviour by subtracting forecasts from observations to form residuals for al...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2835

    authors: Burkom HS,Murphy SP,Shmueli G

    更新日期:2007-09-30 00:00:00

  • Generalized pairwise comparison methods to analyze (non)prioritized composite endpoints.

    abstract::In the analysis of composite endpoints in a clinical trial, time to first event analysis techniques such as the logrank test and Cox proportional hazard test do not take into account the multiplicity, importance, and the severity of events in the composite endpoint. Several generalized pairwise comparison analysis met...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8388

    authors: Verbeeck J,Spitzer E,de Vries T,van Es GA,Anderson WN,Van Mieghem NM,Leon MB,Molenberghs G,Tijssen J

    更新日期:2019-12-30 00:00:00

  • Circular-circular regression model with a spike at zero.

    abstract::With reference to a real data on cataract surgery, we discuss the problem of zero-inflated circular-circular regression when both covariate and response are circular random variables and a large proportion of the responses are zeros. The regression model is proposed, and the estimation procedure for the parameters is ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7496

    authors: Jha J,Biswas A

    更新日期:2018-01-15 00:00:00

  • Beta-binomial/Poisson regression models for repeated bivariate counts.

    abstract::We analyze data obtained from a study designed to evaluate training effects on the performance of certain motor activities of Parkinson's disease patients. Maximum likelihood methods were used to fit beta-binomial/Poisson regression models tailored to evaluate the effects of training on the numbers of attempted and su...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3303

    authors: Lora MI,Singer JM

    更新日期:2008-07-30 00:00:00

  • A meta-analysis of clinical trials involving different classifications of response into ordered categories.

    abstract::Statistical methods are available for performing a meta-analysis when the response variable of interest is the same in each study. Problems arise when studies exploring a common therapeutic question use different patient response types. This article presents statistical methods for combining studies which involve diff...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780132313

    authors: Whitehead A,Jones NM

    更新日期:1994-12-15 00:00:00

  • A comparison of group sequential methods for binary longitudinal data.

    abstract::Interim analyses are conducted to allow for early termination of the trial, for ethical as well as economical reasons. Here we consider interim analyses in repeated measurements studies where the measurements are binary. Two methods for analysing this kind of data are compared according to their operating characterist...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1361

    authors: Spiessens B,Lesaffre E,Verbeke G

    更新日期:2003-02-28 00:00:00

  • A functional-model-adjusted spatial scan statistic.

    abstract::This paper introduces a new spatial scan statistic designed to adjust cluster detection for longitudinal confounding factors indexed in space. The functional-model-adjusted statistic was developed using generalized functional linear models in which longitudinal confounding factors were considered to be functional cova...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8459

    authors: Ahmed MS,Genin M

    更新日期:2020-04-15 00:00:00

  • Estimation of the wild-type minimum inhibitory concentration value distribution.

    abstract::Antimicrobial resistance has become one of the main public health burdens of the last decades, and monitoring the development and spread of non-wild-type isolates has therefore gained increased interest. Monitoring is performed based on the minimum inhibitory concentration (MIC) values, which are collected through the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5939

    authors: Jaspers S,Aerts M,Verbeke G,Beloeil PA

    更新日期:2014-01-30 00:00:00

  • On the use of the generalized t and generalized rank-sum statistics in medical research.

    abstract::We have used Monte Carlo methods to compare the type I error properties of the conditional and unconditional versions of the generalized t and the generalized rank-sum tests to those of the independent samples t and Wilcoxon rank-sum tests. Results showed inflated type I errors for the conditional generalized tests bu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110410

    authors: Blair RC,Morel JG

    更新日期:1992-02-28 00:00:00

  • Incorporating longitudinal biomarkers for dynamic risk prediction in the era of big data: A pseudo-observation approach.

    abstract::Longitudinal biomarker data are often collected in studies, providing important information regarding the probability of an outcome of interest occurring at a future time. With many new and evolving technologies for biomarker discovery, the number of biomarker measurements available for analysis of disease progression...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8687

    authors: Zhao L,Murray S,Mariani LH,Ju W

    更新日期:2020-11-20 00:00:00

  • Methods for proper handling of overrunning and underrunning in phase II designs for oncology trials.

    abstract::Phase II studies in oncology are frequently conducted as two-stage single-arm trials with a binary endpoint indicating tumor response. As a common feature of these designs, the sample sizes of the two stages and the decision rules for the interim and the final analysis have to be pre-specified and adhered to strictly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6479

    authors: Englert S,Kieser M

    更新日期:2015-06-15 00:00:00

  • Identifying optimal risk windows for self-controlled case series studies of vaccine safety.

    abstract::In vaccine safety studies, subjects are considered at increased risk for adverse events for a period of time after vaccination known as risk window. To our knowledge, risk windows for vaccine safety studies have tended to be pre-defined and not to use information from the current study. Inaccurate specification of the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4125

    authors: Xu S,Zhang L,Nelson JC,Zeng C,Mullooly J,McClure D,Glanz J

    更新日期:2011-03-30 00:00:00

  • Comparisons of the performance of different statistical tests for time-to-event analysis with confounding factors: practical illustrations in kidney transplantation.

    abstract::Confounding factors are commonly encountered in observational studies. Several confounder-adjusted tests to compare survival between differently exposed subjects were proposed. However, only few studies have compared their performances regarding type I error rates, and no study exists evaluating their type II error ra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6777

    authors: Le Borgne F,Giraudeau B,Querard AH,Giral M,Foucher Y

    更新日期:2016-03-30 00:00:00

  • Methods for dose finding studies in cancer clinical trials: a review and results of a Monte Carlo study.

    abstract::We discuss some of the statistical approaches to the design and analysis of phase I clinical trials in cancer. An attempt is made to identify the issues, particular to this type of trial, that should be addressed by an appropriate methodology. A brief review of schemes currently in use is provided together with our vi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/sim.4780101104

    authors: O'Quigley J,Chevret S

    更新日期:1991-11-01 00:00:00

  • Adjusting for misclassification in a stratified biomarker clinical trial.

    abstract::Clinical trials utilizing predictive biomarkers have become a research focus in personalized medicine. We investigate the effects of biomarker misclassification on the design and analysis of stratified biomarker clinical trials. For a variety of inference problems including marker-treatment interaction in particular, ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6164

    authors: Liu C,Liu A,Hu J,Yuan V,Halabi S

    更新日期:2014-08-15 00:00:00

  • Competing risks analysis of patients with osteosarcoma: a comparison of four different approaches.

    abstract::In failure time studies involving a chronic disease such as cancer, several competing causes of mortality may be operating. Commonly, the conventional statistical technique of Kaplan-Meier, which is only meaningfully interpreted by assuming independence of failure types and the censoring mechanism, is employed in clin...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,多中心研究,随机对照试验

    doi:10.1002/sim.711

    authors: Tai BC,Machin D,White I,Gebski V,EOI (The European Osteosarcoma Intergroup).

    更新日期:2001-03-15 00:00:00

  • A new permutation-based method for assessing agreement between two observers making replicated quantitative readings.

    abstract::The coefficient of individual equivalence is a permutation-based measure of agreement between two observers making replicated readings on each subject. It compares the observed disagreement between the observers to the expected disagreement under individual equivalence. Individual equivalence of observers requires tha...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5323

    authors: Pan Y,Haber M,Gao J,Barnhart HX

    更新日期:2012-09-10 00:00:00

  • The effect of non-differential outcome misclassification on estimates of the attributable and prevented fraction.

    abstract::This paper considers the effect of non-differential outcome misclassification on the population attributable fraction and the population prevented fraction. I examine the bias in the attributable and the prevented fraction derived from a risk ratio estimate as a function of the sensitivity and specificity of the outco...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780100308

    authors: Hsieh CC

    更新日期:1991-03-01 00:00:00

  • Proportional hazards models and age-period-cohort analysis of cancer rates.

    abstract::Age-period-cohort (APC) analysis is widely used in cancer epidemiology to model trends in cancer rates. We develop methods for comparative APC analysis of two independent cause-specific hazard rates assuming that an APC model holds for each one. We construct linear hypothesis tests to determine whether the two hazards...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3865

    authors: Rosenberg PS,Anderson WF

    更新日期:2010-05-20 00:00:00

  • Model selection in logistic joinpoint regression with applications to analyzing cohort mortality patterns.

    abstract::We consider a general model for anomaly detection in a longitudinal cohort mortality pattern based on logistic joinpoint regression with unknown joinpoints. We discuss backward and forward sequential procedures for selecting both the locations and the number of joinpoints. Estimation of the model parameters and the se...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3017

    authors: Czajkowski M,Gill R,Rempala G

    更新日期:2008-04-30 00:00:00

  • Generalization of normal discriminant analysis using Fourier series density estimators. Transfusion Safety Study Group.

    abstract::In this paper we examine the efficiency of a generalization of the traditional normal linear (LDA) or quadratic (QDA) discriminant analysis. This procedure (the generalized discriminant analysis, GDA) replaces each normal density used in the traditional classification rule by a Fourier series density estimator which '...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780100319

    authors: Odom-Maryon T,Langholz B,Niland J,Azen S

    更新日期:1991-03-01 00:00:00

  • Smooth bootstrap methods for analysis of longitudinal data.

    abstract::In analysis of longitudinal data, the variance matrix of the parameter estimates is usually estimated by the 'sandwich' method, in which the variance for each subject is estimated by its residual products. We propose smooth bootstrap methods by perturbing the estimating functions to obtain 'bootstrapped' realizations ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3027

    authors: Li Y,Wang YG

    更新日期:2008-03-30 00:00:00

  • Exact group-sequential designs for clinical trials with randomized play-the-winner allocation.

    abstract::The use of both sequential designs and adaptive treatment allocation are effective in reducing the number of patients receiving an inferior treatment in a clinical trial. In large samples, when the asymptotic normality of test statistics can be utilized, a standard sequential design can be combined with adaptive alloc...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.998

    authors: Stallard N,Rosenberger WF

    更新日期:2002-02-28 00:00:00

  • Establishing the relationship between nurse staffing and hospital mortality using a clustered discrete-time logistic model.

    abstract::Studies based on aggregated hospital outcome data have established that there is a relationship between nurse staffing and adverse events. However, this result could not be confirmed in Belgium where 96 per cent of the variability of nurse staffing levels over nursing units (belonging to different hospitals) is explai...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3756

    authors: Diya L,Lesaffre E,Van den Heede K,Sermeus W,Vleugels A

    更新日期:2010-03-30 00:00:00

  • Robust Bayesian sample size determination in clinical trials.

    abstract::This article deals with determination of a sample size that guarantees the success of a trial. We follow a Bayesian approach and we say an experiment is successful if it yields a large posterior probability that an unknown parameter of interest (an unknown treatment effect or an effects-difference) is greater than a c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3175

    authors: Brutti P,De Santis F,Gubbiotti S

    更新日期:2008-06-15 00:00:00

  • Study of coverage of confidence intervals for the standardized mortality ratio in studies with missing death certificates.

    abstract::This paper assesses the coverage probability of commonly used confidence intervals for the standardized mortality ratio (SMR) when death certificates are missing. It also proposes alternative confidence interval approaches with coverage probabilities close to .95. In epidemiology, the SMR is an important measure of ri...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7432

    authors: Timkova J,Kotik L,Tomasek L

    更新日期:2017-11-30 00:00:00

  • Bias resulting from the use of 'assay sensitivity' as an inclusion criterion for meta-analysis.

    abstract::Assay sensitivity has been proposed as a criterion for including psychiatric clinical outcome studies in meta-analyses. The authors assess the performance of assay sensitivity as a method for determining study appropriateness for meta-analysis by calculating expected standard drug vs placebo effect sizes for various c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2240

    authors: Gelfand LA,Strunk DR,Tu XM,Noble RE,Derubeis RJ

    更新日期:2006-03-30 00:00:00

  • An extension of the continual reassessment method using decision theory.

    abstract::The primary goal of a phase I trial is to find the maximally tolerated dose (MTD) of a treatment. The MTD is usually defined in terms of a tolerable probability, q(*), of toxicity. Our objective is to find the highest dose with toxicity risk that does not exceed q(*), a criterion that is often desired in designing pha...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.970

    authors: Leung DH,Wang YG

    更新日期:2002-01-15 00:00:00

  • Estimation of death rates in US states with small subpopulations.

    abstract::In US states with small subpopulations, the observed mortality rates are often zero, particularly among young ages. Because in life tables, death rates are reported mostly on a log scale, zero mortality rates are problematic. To overcome the observed zero death rates problem, appropriate probability models are used. U...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6385

    authors: Voulgaraki A,Wei R,Kedem B

    更新日期:2015-05-20 00:00:00