Optimal matching with minimal deviation from fine balance in a study of obesity and surgical outcomes.

Abstract:

:In multivariate matching, fine balance constrains the marginal distributions of a nominal variable in treated and matched control groups to be identical without constraining who is matched to whom. In this way, a fine balance constraint can balance a nominal variable with many levels while focusing efforts on other more important variables when pairing individuals to minimize the total covariate distance within pairs. Fine balance is not always possible; that is, it is a constraint on an optimization problem, but the constraint is not always feasible. We propose a new algorithm that returns a minimum distance finely balanced match when one is feasible, and otherwise minimizes the total distance among all matched samples that minimize the deviation from fine balance. Perhaps we can come very close to fine balance when fine balance is not attainable; moreover, in any event, because our algorithm is guaranteed to come as close as possible to fine balance, the investigator may perform one match, and on that basis judge whether the best attainable balance is adequate or not. We also show how to incorporate an additional constraint. The algorithm is implemented in two similar ways, first as an optimal assignment problem with an augmented distance matrix, second as a minimum cost flow problem in a network. The case of knee surgery in the Obesity and Surgical Outcomes Study motivated the development of this algorithm and is used as an illustration. In that example, 2 of 47 hospitals had too few nonobese patients to permit fine balance for the nominal variable with 47 levels representing the hospital, but our new algorithm came very close to fine balance. Moreover, in that example, there was a shortage of nonobese diabetic patients, and incorporation of an additional constraint forced the match to include all of these nonobese diabetic patients, thereby coming as close as possible to balance for this important but recalcitrant covariate.

journal_name

Biometrics

journal_title

Biometrics

authors

Yang D,Small DS,Silber JH,Rosenbaum PR

doi

10.1111/j.1541-0420.2011.01691.x

subject

Has Abstract

pub_date

2012-06-01 00:00:00

pages

628-36

issue

2

eissn

0006-341X

issn

1541-0420

journal_volume

68

pub_type

杂志文章
  • Flexible variable selection for recovering sparsity in nonadditive nonparametric models.

    abstract::Variable selection for recovering sparsity in nonadditive and nonparametric models with high-dimensional variables has been challenging. This problem becomes even more difficult due to complications in modeling unknown interaction terms among high-dimensional variables. There is currently no variable selection method ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12518

    authors: Fang Z,Kim I,Schaumont P

    更新日期:2016-12-01 00:00:00

  • Inference for reaction networks using the linear noise approximation.

    abstract::We consider inference for the reaction rates in discretely observed networks such as those found in models for systems biology, population ecology, and epidemics. Most such networks are neither slow enough nor small enough for inference via the true state-dependent Markov jump process to be feasible. Typically, infere...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12152

    authors: Fearnhead P,Giagos V,Sherlock C

    更新日期:2014-06-01 00:00:00

  • Case-control analysis with partial knowledge of exposure misclassification probabilities.

    abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00598.x

    authors: Gustafson P,Le ND,Saskin R

    更新日期:2001-06-01 00:00:00

  • Bayesian dose-finding in phase I/II clinical trials using toxicity and efficacy odds ratios.

    abstract::A Bayesian adaptive design is proposed for dose-finding in phase I/II clinical trials to incorporate the bivariate outcomes, toxicity and efficacy, of a new treatment. Without specifying any parametric functional form for the drug dose-response curve, we jointly model the bivariate binary data to account for the corre...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00534.x

    authors: Yin G,Li Y,Ji Y

    更新日期:2006-09-01 00:00:00

  • Fitting nonlinear and constrained generalized estimating equations with optimization software.

    abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01268.x

    authors: Contreras M,Ryan LM

    更新日期:2000-12-01 00:00:00

  • Alternative hypotheses for the effects of drugs in small-scale clinical studies.

    abstract::New drugs that will be investigated in the future are expected to deal with chronic diseases, where the number of patients available for controlled clinical trials will be small and where the long-term sequelae that it is hoped will be ameliorated take a long time to occur. Thus, it would be useful to construct powerf...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Salsburg D

    更新日期:1986-09-01 00:00:00

  • The gamma-frailty Poisson model for the nonparametric estimation of panel count data.

    abstract::In this article, we study nonparametric estimation of the mean function of a counting process with panel observations. We introduce the gamma frailty variable to account for the intracorrelation between the panel counts of the counting process and construct a maximum pseudo-likelihood estimate with the frailty variabl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00126.x

    authors: Zhang Y,Jamshidian M

    更新日期:2003-12-01 00:00:00

  • A multi-source adaptive platform design for testing sequential combinatorial therapeutic strategies.

    abstract::Traditional paradigms for clinical translation are challenged in settings where multiple contemporaneous therapeutic strategies have been identified as potentially beneficial. Platform trials have emerged as an approach for sequentially comparing multiple trials using a single protocol. The Ebola virus disease outbrea...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12841

    authors: Kaizer AM,Hobbs BP,Koopmeiners JS

    更新日期:2018-09-01 00:00:00

  • A Markov model for analysing cancer markers and disease states in survival studies.

    abstract::In studies of serial cancer markers or disease states and their relation to survival, data on the marker or state are usually obtained at infrequent time points during follow-up. A Markov model is developed to assess the dependence of risk of death on marker level or disease state and inferences within this model are ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kay R

    更新日期:1986-12-01 00:00:00

  • A mixed model for repeated dilution assays.

    abstract::We propose a generalized linear mixed model to estimate and test marginal effects on titers repeatedly measured by serial dilution assays. The link is log-log and the titer is assumed to follow a gamma distribution. The parameters are estimated by generalized estimating equations. The marginal effects are tested by me...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Bloch J,Chavance M

    更新日期:1998-06-01 00:00:00

  • Bayesian approaches to joint cure-rate and longitudinal models with applications to cancer vaccine trials.

    abstract::Complex issues arise when investigating the association between longitudinal immunologic measures and time to an event, such as time to relapse, in cancer vaccine trials. Unlike many clinical trials, we may encounter patients who are cured and no longer susceptible to the time-to-event endpoint. If there are cured pat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/1541-0420.00079

    authors: Brown ER,Ibrahim JG

    更新日期:2003-09-01 00:00:00

  • Bayesian prediction of spatial count data using generalized linear mixed models.

    abstract::Spatial weed count data are modeled and predicted using a generalized linear mixed model combined with a Bayesian approach and Markov chain Monte Carlo. Informative priors for a data set with sparse sampling are elicited using a previously collected data set with extensive sampling. Furthermore, we demonstrate that so...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00280.x

    authors: Christensen OF,Waagepetersen R

    更新日期:2002-06-01 00:00:00

  • Three-period crossover designs for two treatments.

    abstract::The use of three periods in the two-treatment crossover design for clinical trials is considered. It is proposed that a series of such trials in a particular therapeutic area may establish the relevance of the crossover design in that area. Treatment sequences to be used in three-period two-treatment trials are discus...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Ebbutt AF

    更新日期:1984-03-01 00:00:00

  • Analysis of current status data with missing covariates.

    abstract::Statistical inference based on right-censored data for the proportional hazards (PH) model with missing covariates has received considerable attention, but interval-censored or current status data with missing covariates has not yet been investigated. Our study is partly motivated by the analysis of fracture data from...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01505.x

    authors: Wen CC,Lin CT

    更新日期:2011-09-01 00:00:00

  • A Bayesian approach to modeling associations between pulsatile hormones.

    abstract:SUMMARY:Many hormones are secreted in pulses. The pulsatile relationship between hormones regulates many biological processes. To understand endocrine system regulation, time series of hormone concentrations are collected. The goal is to characterize pulsatile patterns and associations between hormones. Currently each ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01117.x

    authors: Carlson NE,Johnson TD,Brown MB

    更新日期:2009-06-01 00:00:00

  • Statistical modelling of the AIDS epidemic for forecasting health care needs.

    abstract::The objective of this paper is to develop statistical methods for estimating current and future numbers of individuals in different stages of the natural history of the human immunodeficiency (AIDS) virus infection and to evaluate the impact of therapeutic advances on these numbers. The approach is to extend the metho...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Brookmeyer R,Liao JG

    更新日期:1990-12-01 00:00:00

  • Sharpening bounds on principal effects with covariates.

    abstract::Estimation of treatment effects in randomized studies is often hampered by possible selection bias induced by conditioning on or adjusting for a variable measured post-randomization. One approach to obviate such selection bias is to consider inference about treatment effects within principal strata, that is, principal...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12103

    authors: Long DM,Hudgens MG

    更新日期:2013-12-01 00:00:00

  • Objective Bayesian search of Gaussian directed acyclic graphical models for ordered variables with non-local priors.

    abstract::Directed acyclic graphical (DAG) models are increasingly employed in the study of physical and biological systems to model direct influences between variables. Identifying the graph from data is a challenging endeavor, which can be more reasonably tackled if the variables are assumed to satisfy a given ordering; in th...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12018

    authors: Altomare D,Consonni G,La Rocca L

    更新日期:2013-06-01 00:00:00

  • Assessing the goodness-of-fit of hidden Markov models.

    abstract::In this article, we propose a graphical technique for assessing the goodness-of-fit of a stationary hidden Markov model (HMM). We show that plots of the estimated distribution against the empirical distribution detect lack of fit with high probability for large sample sizes. By considering plots of the univariate and ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00189.x

    authors: MacKay Altman R

    更新日期:2004-06-01 00:00:00

  • Variable selection for logistic regression using a prediction-focused information criterion.

    abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00567.x

    authors: Claeskens G,Croux C,Van Kerckhoven J

    更新日期:2006-12-01 00:00:00

  • Calculating sample size for studies with expected all-or-none nonadherence and selection bias.

    abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01114.x

    authors: Shardell MD,El-Kamary SS

    更新日期:2009-06-01 00:00:00

  • Sample-size formula for the proportional-hazards regression model.

    abstract::A formula is derived for determining the number of observations necessary to test the equality of two survival distributions when concomitant information is incorporated. This formula should be useful in designing clinical trials with a heterogeneous patient population. Schoenfeld (1981, Biometrika 68, 316-319) derive...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Schoenfeld DA

    更新日期:1983-06-01 00:00:00

  • Hierarchical Bayes estimation of hunting success rates with spatial correlations.

    abstract::A Bayesian hierarchical generalized linear model is used to estimate hunting success rates at the subarea level for postseason harvest surveys. The model includes fixed week effects, random geographic effects, and spatial correlations between neighboring subareas. The computation is done by Gibbs sampling and adaptive...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00360.x

    authors: He Z,Sun D

    更新日期:2000-06-01 00:00:00

  • Model selection for G-estimation of dynamic treatment regimes.

    abstract::Dynamic treatment regimes (DTRs) aim to formalize personalized medicine by tailoring treatment decisions to individual patient characteristics. G-estimation for DTR identification targets the parameters of a structural nested mean model, known as the blip function, from which the optimal DTR is derived. Despite its po...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13104

    authors: Wallace MP,Moodie EEM,Stephens DA

    更新日期:2019-12-01 00:00:00

  • A multilevel mixed effects varying coefficient model with multilevel predictors and random effects for modeling hospitalization risk in patients on dialysis.

    abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13205

    authors: Li Y,Nguyen DV,Kürüm E,Rhee CM,Chen Y,Kalantar-Zadeh K,Şentürk D

    更新日期:2020-09-01 00:00:00

  • A spatial Bayesian latent factor model for image-on-image regression.

    abstract::Image-on-image regression analysis, using images to predict images, is a challenging task, due to (1) the high dimensionality and (2) the complex spatial dependence structures in image predictors and image outcomes. In this work, we propose a novel image-on-image regression model, by extending a spatial Bayesian laten...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13420

    authors: Guo C,Kang J,Johnson TD

    更新日期:2020-12-27 00:00:00

  • Sensitivity analysis for nonrandom dropout: a local influence approach.

    abstract::Diggle and Kenward (1994, Applied Statistics 43, 49-93) proposed a selection model for continuous longitudinal data subject to nonrandom dropout. It has provoked a large debate about the role for such models. The original enthusiasm was followed by skepticism about the strong but untestable assumptions on which this t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00007.x

    authors: Verbeke G,Molenberghs G,Thijs H,Lesaffre E,Kenward MG

    更新日期:2001-03-01 00:00:00

  • Case-control studies of gene-environment interaction: Bayesian design and analysis.

    abstract::With increasing frequency, epidemiologic studies are addressing hypotheses regarding gene-environment interaction. In many well-studied candidate genes and for standard dietary and behavioral epidemiologic exposures, there is often substantial prior information available that may be used to analyze current data as wel...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01357.x

    authors: Mukherjee B,Ahn J,Gruber SB,Ghosh M,Chatterjee N

    更新日期:2010-09-01 00:00:00

  • Fitting a multiplicative incidence model to age- and time-specific prevalence data.

    abstract::We discuss the assessment of age- and time-specific disease incidence using prevalence data. A method is described for conveniently fitting a discrete-time multiplicative model, subject to positivity constraints, using the EM-algorithm. Together with smoothing, it allows essentially nonparametric assessment of inciden...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Marschner IC

    更新日期:1996-06-01 00:00:00

  • Coregionalized single- and multiresolution spatially varying growth curve modeling with application to weed growth.

    abstract::Modeling of longitudinal data from agricultural experiments using growth curves helps understand conditions conducive or unconducive to crop growth. Recent advances in Geographical Information Systems (GIS) now allow geocoding of agricultural data that help understand spatial patterns. A particularly common problem is...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00535.x

    authors: Banerjee S,Johnson GA

    更新日期:2006-09-01 00:00:00