Optimal matching with minimal deviation from fine balance in a study of obesity and surgical outcomes.

Abstract:

:In multivariate matching, fine balance constrains the marginal distributions of a nominal variable in treated and matched control groups to be identical without constraining who is matched to whom. In this way, a fine balance constraint can balance a nominal variable with many levels while focusing efforts on other more important variables when pairing individuals to minimize the total covariate distance within pairs. Fine balance is not always possible; that is, it is a constraint on an optimization problem, but the constraint is not always feasible. We propose a new algorithm that returns a minimum distance finely balanced match when one is feasible, and otherwise minimizes the total distance among all matched samples that minimize the deviation from fine balance. Perhaps we can come very close to fine balance when fine balance is not attainable; moreover, in any event, because our algorithm is guaranteed to come as close as possible to fine balance, the investigator may perform one match, and on that basis judge whether the best attainable balance is adequate or not. We also show how to incorporate an additional constraint. The algorithm is implemented in two similar ways, first as an optimal assignment problem with an augmented distance matrix, second as a minimum cost flow problem in a network. The case of knee surgery in the Obesity and Surgical Outcomes Study motivated the development of this algorithm and is used as an illustration. In that example, 2 of 47 hospitals had too few nonobese patients to permit fine balance for the nominal variable with 47 levels representing the hospital, but our new algorithm came very close to fine balance. Moreover, in that example, there was a shortage of nonobese diabetic patients, and incorporation of an additional constraint forced the match to include all of these nonobese diabetic patients, thereby coming as close as possible to balance for this important but recalcitrant covariate.

journal_name

Biometrics

journal_title

Biometrics

authors

Yang D,Small DS,Silber JH,Rosenbaum PR

doi

10.1111/j.1541-0420.2011.01691.x

subject

Has Abstract

pub_date

2012-06-01 00:00:00

pages

628-36

issue

2

eissn

0006-341X

issn

1541-0420

journal_volume

68

pub_type

杂志文章
  • Extraction of food consumption systems by nonnegative matrix factorization (NMF) for the assessment of food choices.

    abstract::In Western countries where food supply is satisfactory, consumers organize their diets around a large combination of foods. It is the purpose of this article to examine how recent nonnegative matrix factorization (NMF) techniques can be applied to food consumption data to understand these combinations. Such data are n...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01588.x

    authors: Zetlaoui M,Feinberg M,Verger P,Clémençon S

    更新日期:2011-12-01 00:00:00

  • Statistical methods for classification of human chromosomes.

    abstract::The basic technical facts of human cytogenetics and the laboratory methods employed in chromosome research are explained in simple terms. The main variables used to describe chromosome images are defined and discussed. Three discriminant analysis models for chromosome classification are developed: one in which each ch...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Habbema JD

    更新日期:1979-03-01 00:00:00

  • Large scale maximum average power multiple inference on time-course count data with application to RNA-seq analysis.

    abstract::Experiments that longitudinally collect RNA sequencing (RNA-seq) data can provide transformative insights in biology research by revealing the dynamic patterns of genes. Such experiments create a great demand for new analytic approaches to identify differentially expressed (DE) genes based on large-scale time-course c...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13144

    authors: Cao M,Zhou W,Breidt FJ,Peers G

    更新日期:2020-03-01 00:00:00

  • Bayesian influence measures for joint models for longitudinal and survival data.

    abstract::This article develops a variety of influence measures for carrying out perturbation (or sensitivity) analysis to joint models of longitudinal and survival data (JMLS) in Bayesian analysis. A perturbation model is introduced to characterize individual and global perturbations to the three components of a Bayesian model...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01745.x

    authors: Zhu H,Ibrahim JG,Chi YY,Tang N

    更新日期:2012-09-01 00:00:00

  • Bayesian model selection for incomplete data using the posterior predictive distribution.

    abstract::We explore the use of a posterior predictive loss criterion for model selection for incomplete longitudinal data. We begin by identifying a property that most model selection criteria for incomplete data should consider. We then show that a straightforward extension of the Gelfand and Ghosh (1998, Biometrika, 85, 1-11...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01766.x

    authors: Daniels MJ,Chatterjee AS,Wang C

    更新日期:2012-12-01 00:00:00

  • On modelling microbial infections.

    abstract::Several models for the course of microbial infections during the incubation period are examined. Each model fits sore throat incubation data very well. Together with the lack of precision of the resulting parameter estimates, this suggests that incubation data alone are insufficient for elucidating the finer details o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Morgan BJ,Watts SA

    更新日期:1980-06-01 00:00:00

  • Three-period crossover designs for two treatments.

    abstract::The use of three periods in the two-treatment crossover design for clinical trials is considered. It is proposed that a series of such trials in a particular therapeutic area may establish the relevance of the crossover design in that area. Treatment sequences to be used in three-period two-treatment trials are discus...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Ebbutt AF

    更新日期:1984-03-01 00:00:00

  • Analysis of ordered categorical data: two score-independent approaches.

    abstract:SUMMARY:A trend test is often employed to analyze ordered categorical data, in which a set of increasing scores is assigned a priori. There is a drawback in this approach, because how to choose a set of scores is not clear. There have been debates on which scores should be used (e.g., Graubard and Korn, 1987, Biometric...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.00992.x

    authors: Zheng G

    更新日期:2008-12-01 00:00:00

  • Additive gamma frailty models with applications to competing risks in related individuals.

    abstract::Epidemiological studies of related individuals are often complicated by the fact that follow-up on the event type of interest is incomplete due to the occurrence of other events. We suggest a class of frailty models with cause-specific hazards for correlated competing events in related individuals. The frailties are b...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12326

    authors: Eriksson F,Scheike T

    更新日期:2015-09-01 00:00:00

  • Fitting a multiplicative incidence model to age- and time-specific prevalence data.

    abstract::We discuss the assessment of age- and time-specific disease incidence using prevalence data. A method is described for conveniently fitting a discrete-time multiplicative model, subject to positivity constraints, using the EM-algorithm. Together with smoothing, it allows essentially nonparametric assessment of inciden...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Marschner IC

    更新日期:1996-06-01 00:00:00

  • A spatial Bayesian latent factor model for image-on-image regression.

    abstract::Image-on-image regression analysis, using images to predict images, is a challenging task, due to (1) the high dimensionality and (2) the complex spatial dependence structures in image predictors and image outcomes. In this work, we propose a novel image-on-image regression model, by extending a spatial Bayesian laten...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13420

    authors: Guo C,Kang J,Johnson TD

    更新日期:2020-12-27 00:00:00

  • On Bayesian methods for bioequivalence.

    abstract::Bayesian methods are presented for assessing bioequivalence for studies in which a new formulation and a standard are administered simultaneously, and for Latin square designs which compare two or more new formulations to a standard. Two examples illustrate the application of the methods. ...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Selwyn MR,Hall NR

    更新日期:1984-12-01 00:00:00

  • Confidence intervals for the generalized ROC criterion.

    abstract::Receiver operating characteristic (ROC) curves are frequently used to assess the usefulness of diagnostic markers. When several diagnostic markers are available, they can be combined by a best linear combination: that is, when the area under the ROC curve of this combination is maximized among all possible linear comb...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Reiser B,Faraggi D

    更新日期:1997-06-01 00:00:00

  • A Bayesian hidden Markov model for detecting differentially methylated regions.

    abstract::Alterations in DNA methylation have been linked to the development and progression of many diseases. The bisulfite sequencing technique presents methylation profiles at base resolution. Count data on methylated and unmethylated reads provide information on the methylation level at each CpG site. As more bisulfite sequ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13000

    authors: Ji T

    更新日期:2019-06-01 00:00:00

  • Incorporating correlation for multivariate failure time data when cluster size is large.

    abstract::We propose a new estimation method for multivariate failure time data using the quadratic inference function (QIF) approach. The proposed method efficiently incorporates within-cluster correlations. Therefore, it is more efficient than those that ignore within-cluster correlation. Furthermore, the proposed method is e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01307.x

    authors: Xue L,Wang L,Qu A

    更新日期:2010-06-01 00:00:00

  • Robustness of group testing in the estimation of proportions.

    abstract::In binomial group testing, unlike one-at-a-time testing, the test unit consists of a group of individuals, and each group is declared to be defective or nondefective. A defective group is one that is presumed to include one or more defective (e.g., infected, positive) individuals and a nondefective group to contain on...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00231.x

    authors: Hung M,Swallow WH

    更新日期:1999-03-01 00:00:00

  • Case-control studies of gene-environment interaction: Bayesian design and analysis.

    abstract::With increasing frequency, epidemiologic studies are addressing hypotheses regarding gene-environment interaction. In many well-studied candidate genes and for standard dietary and behavioral epidemiologic exposures, there is often substantial prior information available that may be used to analyze current data as wel...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01357.x

    authors: Mukherjee B,Ahn J,Gruber SB,Ghosh M,Chatterjee N

    更新日期:2010-09-01 00:00:00

  • Sample size determination for establishing equivalence/noninferiority via ratio of two proportions in matched-pair design.

    abstract::In this article, we propose approximate sample size formulas for establishing equivalence or noninferiority of two treatments in match-pairs design. Using the ratio of two proportions as the equivalence measure, we derive sample size formulas based on a score statistic for two types of analyses: hypothesis testing and...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00957.x

    authors: Tang ML,Tang NS,Chan IS,Chan BP

    更新日期:2002-12-01 00:00:00

  • A moving blocks empirical likelihood method for longitudinal data.

    abstract::In the analysis of longitudinal or panel data, neglecting the serial correlations among the repeated measurements within subjects may lead to inefficient inference. In particular, when the number of repeated measurements is large, it may be desirable to model the serial correlations more generally. An appealing approa...

    journal_title:Biometrics

    pub_type: 杂志文章,随机对照试验

    doi:10.1111/biom.12317

    authors: Qiu J,Wu L

    更新日期:2015-09-01 00:00:00

  • Impact of time to start treatment following infection with application to initiating HAART in HIV-positive patients.

    abstract::We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data arises from biases associated with the nonrandom assignment of treatment, for example bias induced by d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01738.x

    authors: Lok JJ,DeGruttola V

    更新日期:2012-09-01 00:00:00

  • A two-stage stepwise estimation procedure.

    abstract::This article proposes a two-stage simultaneous confidence procedure for the comparisons of k pairs of population means, without using multiplicity adjustment of more than two populations. The proposed procedure can be broadly applied to parametric or nonparametric models. It is robust and versatile because its derivat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00902.x

    authors: Chen JT

    更新日期:2008-06-01 00:00:00

  • A stochastic model for the occurrence of transient ischemic attacks.

    abstract::This paper presents the development, application and evaluation of a stochastic model of the frequency of occurrence of transient ischemic attacks (TIAs). TIAs occur during periods of abnormal arterial activity. The TIAs which occur during a single period of abnormal activity are called a cluster of TIAs. Thus, the nu...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Dunn JK,Hardy RJ

    更新日期:1980-03-01 00:00:00

  • Nonparametric comparison of two survival-time distributions in the presence of dependent censoring.

    abstract::When testing the null hypothesis that treatment arm-specific survival-time distributions are equal, the log-rank test is asymptotically valid when the distribution of time to censoring is conditionally independent of randomized treatment group given survival time. We introduce a test of the null hypothesis for use whe...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00059

    authors: DiRienzo AG

    更新日期:2003-09-01 00:00:00

  • An improved method for measuring heart-rate variability: assessment of cardiac autonomic function.

    abstract::Heart rate oscillates in synchrony with respiration. Several methods have been employed to assess this 'sinus arrhythmia', as an index of autonomic nervous system function. This paper proposes a new, easily computed measure, R, which is relatively resistant to the major nonrespiratory sources of variation, including p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR,Pfeifer MA

    更新日期:1984-09-01 00:00:00

  • Order-restricted inference for means with missing values.

    abstract::Missing values appear very often in many applications, but the problem of missing values has not received much attention in testing order-restricted alternatives. Under the missing at random (MAR) assumption, we impute the missing values nonparametrically using kernel regression. For data with imputation, the classica...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12658

    authors: Wang H,Zhong PS

    更新日期:2017-09-01 00:00:00

  • Likelihood-ratio tests for hidden Markov models.

    abstract::We consider hidden Markov models as a versatile class of models for weakly dependent random phenomena. The topic of the present paper is likelihood-ratio testing for hidden Markov models, and we show that, under appropriate conditions, the standard asymptotic theory of likelihood-ratio tests is valid. Such tests are c...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00742.x

    authors: Giudici P,Rydén T,Vandekerkhove P

    更新日期:2000-09-01 00:00:00

  • Determining the number of clusters using the weighted gap statistic.

    abstract::Estimating the number of clusters in a data set is a crucial step in cluster analysis. In this article, motivated by the gap method (Tibshirani, Walther, and Hastie, 2001, Journal of the Royal Statistical Society B63, 411-423), we propose the weighted gap and the difference of difference-weighted (DD-weighted) gap met...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00784.x

    authors: Yan M,Ye K

    更新日期:2007-12-01 00:00:00

  • False discovery rate estimation for frequentist pharmacovigilance signal detection methods.

    abstract::Pharmacovigilance systems aim at early detection of adverse effects of marketed drugs. They maintain large spontaneous reporting databases for which several automatic signaling methods have been developed. One limit of those methods is that the decision rules for the signal generation are based on arbitrary thresholds...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01262.x

    authors: Ahmed I,Dalmasso C,Haramburu F,Thiessard F,Broët P,Tubert-Bitter P

    更新日期:2010-03-01 00:00:00

  • Variance estimation for systematic designs in spatial surveys.

    abstract::In spatial surveys for estimating the density of objects in a survey region, systematic designs will generally yield lower variance than random designs. However, estimating the systematic variance is well known to be a difficult problem. Existing methods tend to overestimate the variance, so although the variance is g...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01604.x

    authors: Fewster RM

    更新日期:2011-12-01 00:00:00

  • Bayesian partitioning for modeling and mapping spatial case-control data.

    abstract::Methods for modeling and mapping spatial variation in disease risk continue to motivate much research. In particular, spatial analyses provide a useful tool for exploring geographical heterogeneity in health outcomes, and consequently can yield clues as to disease etiology, direct public health management, and generat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01193.x

    authors: Costain DA

    更新日期:2009-12-01 00:00:00