Abstract:
:In multivariate matching, fine balance constrains the marginal distributions of a nominal variable in treated and matched control groups to be identical without constraining who is matched to whom. In this way, a fine balance constraint can balance a nominal variable with many levels while focusing efforts on other more important variables when pairing individuals to minimize the total covariate distance within pairs. Fine balance is not always possible; that is, it is a constraint on an optimization problem, but the constraint is not always feasible. We propose a new algorithm that returns a minimum distance finely balanced match when one is feasible, and otherwise minimizes the total distance among all matched samples that minimize the deviation from fine balance. Perhaps we can come very close to fine balance when fine balance is not attainable; moreover, in any event, because our algorithm is guaranteed to come as close as possible to fine balance, the investigator may perform one match, and on that basis judge whether the best attainable balance is adequate or not. We also show how to incorporate an additional constraint. The algorithm is implemented in two similar ways, first as an optimal assignment problem with an augmented distance matrix, second as a minimum cost flow problem in a network. The case of knee surgery in the Obesity and Surgical Outcomes Study motivated the development of this algorithm and is used as an illustration. In that example, 2 of 47 hospitals had too few nonobese patients to permit fine balance for the nominal variable with 47 levels representing the hospital, but our new algorithm came very close to fine balance. Moreover, in that example, there was a shortage of nonobese diabetic patients, and incorporation of an additional constraint forced the match to include all of these nonobese diabetic patients, thereby coming as close as possible to balance for this important but recalcitrant covariate.
journal_name
Biometricsjournal_title
Biometricsauthors
Yang D,Small DS,Silber JH,Rosenbaum PRdoi
10.1111/j.1541-0420.2011.01691.xsubject
Has Abstractpub_date
2012-06-01 00:00:00pages
628-36issue
2eissn
0006-341Xissn
1541-0420journal_volume
68pub_type
杂志文章相关文献
BIOMETRICS文献大全abstract::Variable selection for recovering sparsity in nonadditive and nonparametric models with high-dimensional variables has been challenging. This problem becomes even more difficult due to complications in modeling unknown interaction terms among high-dimensional variables. There is currently no variable selection method ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12518
更新日期:2016-12-01 00:00:00
abstract::We consider inference for the reaction rates in discretely observed networks such as those found in models for systems biology, population ecology, and epidemics. Most such networks are neither slow enough nor small enough for inference via the true state-dependent Markov jump process to be feasible. Typically, infere...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12152
更新日期:2014-06-01 00:00:00
abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00598.x
更新日期:2001-06-01 00:00:00
abstract::A Bayesian adaptive design is proposed for dose-finding in phase I/II clinical trials to incorporate the bivariate outcomes, toxicity and efficacy, of a new treatment. Without specifying any parametric functional form for the drug dose-response curve, we jointly model the bivariate binary data to account for the corre...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00534.x
更新日期:2006-09-01 00:00:00
abstract::In this article, we present an estimation approach for solving nonlinear constrained generalized estimating equations that can be implemented using object-oriented software for nonlinear programming, such as nlminb in Splus or fmincon and lsqnonlin in Matlab. We show how standard estimating equation theory includes th...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.01268.x
更新日期:2000-12-01 00:00:00
abstract::New drugs that will be investigated in the future are expected to deal with chronic diseases, where the number of patients available for controlled clinical trials will be small and where the long-term sequelae that it is hoped will be ameliorated take a long time to occur. Thus, it would be useful to construct powerf...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1986-09-01 00:00:00
abstract::In this article, we study nonparametric estimation of the mean function of a counting process with panel observations. We introduce the gamma frailty variable to account for the intracorrelation between the panel counts of the counting process and construct a maximum pseudo-likelihood estimate with the frailty variabl...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2003.00126.x
更新日期:2003-12-01 00:00:00
abstract::Traditional paradigms for clinical translation are challenged in settings where multiple contemporaneous therapeutic strategies have been identified as potentially beneficial. Platform trials have emerged as an approach for sequentially comparing multiple trials using a single protocol. The Ebola virus disease outbrea...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12841
更新日期:2018-09-01 00:00:00
abstract::In studies of serial cancer markers or disease states and their relation to survival, data on the marker or state are usually obtained at infrequent time points during follow-up. A Markov model is developed to assess the dependence of risk of death on marker level or disease state and inferences within this model are ...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1986-12-01 00:00:00
abstract::We propose a generalized linear mixed model to estimate and test marginal effects on titers repeatedly measured by serial dilution assays. The link is log-log and the titer is assumed to follow a gamma distribution. The parameters are estimated by generalized estimating equations. The marginal effects are tested by me...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1998-06-01 00:00:00
abstract::Complex issues arise when investigating the association between longitudinal immunologic measures and time to an event, such as time to relapse, in cancer vaccine trials. Unlike many clinical trials, we may encounter patients who are cured and no longer susceptible to the time-to-event endpoint. If there are cured pat...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/1541-0420.00079
更新日期:2003-09-01 00:00:00
abstract::Spatial weed count data are modeled and predicted using a generalized linear mixed model combined with a Bayesian approach and Markov chain Monte Carlo. Informative priors for a data set with sparse sampling are elicited using a previously collected data set with extensive sampling. Furthermore, we demonstrate that so...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2002.00280.x
更新日期:2002-06-01 00:00:00
abstract::The use of three periods in the two-treatment crossover design for clinical trials is considered. It is proposed that a series of such trials in a particular therapeutic area may establish the relevance of the crossover design in that area. Treatment sequences to be used in three-period two-treatment trials are discus...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1984-03-01 00:00:00
abstract::Statistical inference based on right-censored data for the proportional hazards (PH) model with missing covariates has received considerable attention, but interval-censored or current status data with missing covariates has not yet been investigated. Our study is partly motivated by the analysis of fracture data from...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2010.01505.x
更新日期:2011-09-01 00:00:00
abstract:SUMMARY:Many hormones are secreted in pulses. The pulsatile relationship between hormones regulates many biological processes. To understand endocrine system regulation, time series of hormone concentrations are collected. The goal is to characterize pulsatile patterns and associations between hormones. Currently each ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01117.x
更新日期:2009-06-01 00:00:00
abstract::The objective of this paper is to develop statistical methods for estimating current and future numbers of individuals in different stages of the natural history of the human immunodeficiency (AIDS) virus infection and to evaluate the impact of therapeutic advances on these numbers. The approach is to extend the metho...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1990-12-01 00:00:00
abstract::Estimation of treatment effects in randomized studies is often hampered by possible selection bias induced by conditioning on or adjusting for a variable measured post-randomization. One approach to obviate such selection bias is to consider inference about treatment effects within principal strata, that is, principal...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12103
更新日期:2013-12-01 00:00:00
abstract::Directed acyclic graphical (DAG) models are increasingly employed in the study of physical and biological systems to model direct influences between variables. Identifying the graph from data is a challenging endeavor, which can be more reasonably tackled if the variables are assumed to satisfy a given ordering; in th...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.12018
更新日期:2013-06-01 00:00:00
abstract::In this article, we propose a graphical technique for assessing the goodness-of-fit of a stationary hidden Markov model (HMM). We show that plots of the estimated distribution against the empirical distribution detect lack of fit with high probability for large sample sizes. By considering plots of the univariate and ...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341X.2004.00189.x
更新日期:2004-06-01 00:00:00
abstract::In biostatistical practice, it is common to use information criteria as a guide for model selection. We propose new versions of the focused information criterion (FIC) for variable selection in logistic regression. The FIC gives, depending on the quantity to be estimated, possibly different sets of selected variables....
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00567.x
更新日期:2006-12-01 00:00:00
abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2008.01114.x
更新日期:2009-06-01 00:00:00
abstract::A formula is derived for determining the number of observations necessary to test the equality of two survival distributions when concomitant information is incorporated. This formula should be useful in designing clinical trials with a heterogeneous patient population. Schoenfeld (1981, Biometrika 68, 316-319) derive...
journal_title:Biometrics
pub_type: 临床试验,杂志文章
doi:
更新日期:1983-06-01 00:00:00
abstract::A Bayesian hierarchical generalized linear model is used to estimate hunting success rates at the subarea level for postseason harvest surveys. The model includes fixed week effects, random geographic effects, and spatial correlations between neighboring subareas. The computation is done by Gibbs sampling and adaptive...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2000.00360.x
更新日期:2000-06-01 00:00:00
abstract::Dynamic treatment regimes (DTRs) aim to formalize personalized medicine by tailoring treatment decisions to individual patient characteristics. G-estimation for DTR identification targets the parameters of a structural nested mean model, known as the blip function, from which the optimal DTR is derived. Despite its po...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13104
更新日期:2019-12-01 00:00:00
abstract::For patients on dialysis, hospitalizations remain a major risk factor for mortality and morbidity. We use data from a large national database, United States Renal Data System, to model time-varying effects of hospitalization risk factors as functions of time since initiation of dialysis. To account for the three-level...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13205
更新日期:2020-09-01 00:00:00
abstract::Image-on-image regression analysis, using images to predict images, is a challenging task, due to (1) the high dimensionality and (2) the complex spatial dependence structures in image predictors and image outcomes. In this work, we propose a novel image-on-image regression model, by extending a spatial Bayesian laten...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/biom.13420
更新日期:2020-12-27 00:00:00
abstract::Diggle and Kenward (1994, Applied Statistics 43, 49-93) proposed a selection model for continuous longitudinal data subject to nonrandom dropout. It has provoked a large debate about the role for such models. The original enthusiasm was followed by skepticism about the strong but untestable assumptions on which this t...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.0006-341x.2001.00007.x
更新日期:2001-03-01 00:00:00
abstract::With increasing frequency, epidemiologic studies are addressing hypotheses regarding gene-environment interaction. In many well-studied candidate genes and for standard dietary and behavioral epidemiologic exposures, there is often substantial prior information available that may be used to analyze current data as wel...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2009.01357.x
更新日期:2010-09-01 00:00:00
abstract::We discuss the assessment of age- and time-specific disease incidence using prevalence data. A method is described for conveniently fitting a discrete-time multiplicative model, subject to positivity constraints, using the EM-algorithm. Together with smoothing, it allows essentially nonparametric assessment of inciden...
journal_title:Biometrics
pub_type: 杂志文章
doi:
更新日期:1996-06-01 00:00:00
abstract::Modeling of longitudinal data from agricultural experiments using growth curves helps understand conditions conducive or unconducive to crop growth. Recent advances in Geographical Information Systems (GIS) now allow geocoding of agricultural data that help understand spatial patterns. A particularly common problem is...
journal_title:Biometrics
pub_type: 杂志文章
doi:10.1111/j.1541-0420.2006.00535.x
更新日期:2006-09-01 00:00:00