Estimating probit models with self-selected treatments.

Abstract:

:Outcomes research often requires estimating the impact of a binary treatment on a binary outcome in a non-randomized setting, such as the effect of taking a drug on mortality. The data often come from self-selected samples, leading to a spurious correlation between the treatment and outcome when standard binary dependent variable techniques, like logit or probit, are used. Intuition suggests that a two-step procedure (analogous to two-stage least squares) might be sufficient to deal with this problem if variables are available that are correlated with the treatment choice but not the outcome. This paper demonstrates the limitations of such a two-step procedure. We show that such estimators will not generally be consistent. We conduct a Monte Carlo exercise to compare the performance of the two-step probit estimator, the two-stage least squares linear probability model estimator, and the multivariate probit. The results from this exercise argue in favour of using the multivariate probit rather than the two-step or linear probability model estimators, especially when there is more than one treatment, when the average probability of the dependent variable is close to 0 or 1, or when the data generating process is not normal. We demonstrate how these different methods perform in an empirical example examining the effect of private and public insurance coverage on the mortality of HIV+ patients.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Bhattacharya J,Goldman D,McCaffrey D

doi

10.1002/sim.2226

subject

Has Abstract

pub_date

2006-02-15 00:00:00

pages

389-413

issue

3

eissn

0277-6715

issn

1097-0258

journal_volume

25

pub_type

杂志文章
  • Empirical Bayes versus fully Bayesian analysis of geographical variation in disease risk.

    abstract::This paper reviews methods for mapping geographical variation in disease incidence and mortality. Recent results in Bayesian hierarchical modelling of relative risk are discussed. Two approaches to relative risk estimation, along with the related computational procedures, are described and compared. The first is an em...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110802

    authors: Bernardinelli L,Montomoli C

    更新日期:1992-06-15 00:00:00

  • Efficient evaluation of treatment effects in the presence of missing covariate values.

    abstract::In clinical trials, treatment comparisons are often performed by models that incorporate important prognostic factors. Since these models require complete covariate information on all patients, statisticians frequently resort to complete case analysis or to omission of an important covariate. A probability imputation ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780090707

    authors: Schemper M,Smith TL

    更新日期:1990-07-01 00:00:00

  • Probabilistic cause-of-disease assignment using case-control diagnostic tests: A latent variable regression approach.

    abstract::Optimal prevention and treatment strategies for a disease of multiple causes, such as pneumonia, must be informed by the population distribution of causes among cases, or cause-specific case fractions (CSCFs). CSCFs may further depend on additional explanatory variables. Existing methodological literature in disease e...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8804

    authors: Wu Z,Chen I

    更新日期:2021-02-20 00:00:00

  • A method to test for a recent increase in HIV-1 seroconversion incidence: results from the Multicenter AIDS Cohort Study (MACS).

    abstract::We have formulated the problem of determining whether there has been an upturn in HIV-1 seroconversion incidence over the first five years of follow-up in the Multicenter AIDS Cohort Study (MACS) as that of locating the minimum of a quadratic regression or examination of two-knot piecewise spline models. Under a quadr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,多中心研究

    doi:10.1002/sim.4780120207

    authors: Zhou SY,Kingsley LA,Taylor JM,Chmiel JS,He DY,Hoover DR

    更新日期:1993-01-30 00:00:00

  • Goodness-of-fit test for proportional subdistribution hazards model.

    abstract::This paper concerns using modified weighted Schoenfeld residuals to test the proportionality of subdistribution hazards for the Fine-Gray model, similar to the tests proposed by Grambsch and Therneau for independently censored data. We develop a score test for the time-varying coefficients based on the modified Schoen...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5815

    authors: Zhou B,Fine J,Laird G

    更新日期:2013-09-30 00:00:00

  • Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX.

    abstract::For time-to-event outcomes, a rich literature exists on the bias introduced by covariate measurement error in regression models, such as the Cox model, and methods of analysis to address this bias. By comparison, less attention has been given to understanding the impact or addressing errors in the failure time outcome...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7554

    authors: Oh EJ,Shepherd BE,Lumley T,Shaw PA

    更新日期:2018-04-15 00:00:00

  • Monitoring clinical trials: issues and controversies regarding confidentiality.

    abstract::During phase III clinical trials in life-threatening disease settings, it is important to ensure that the Data Monitoring Committee (DMC) has exclusive access to the interim efficacy and safety data generated by the data analysis centre, in order to minimize the risk of widespread prejudgement of unreliable trial resu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1288

    authors: Fleming TR,Ellenberg S,DeMets DL

    更新日期:2002-10-15 00:00:00

  • Network analytic methods for epidemiological risk assessment.

    abstract::The authors measure the efficacy of three methods for predicting the time to infection for susceptible individuals in a population undergoing an HIV epidemic. The methods differ in whether they require detailed information of the contact network and whether they require knowledge of the initial source of infection. Ef...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780130107

    authors: Altmann M,Wee BC,Willard K,Peterson D,Gatewood LC

    更新日期:1994-01-15 00:00:00

  • Development and applications of a city-level alcohol availability and alcohol problems database.

    abstract::Data on alcohol availability and problems in all cities in Los Angeles County were collected from several different sources and linked together to form a Local Alcohol Availability Database (LAAD). The two major purposes of the project are to provide a city-level alcohol availability and alcohol-related problems datab...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140517

    authors: MacKinnon DP,Scribner R,Taft KA

    更新日期:1995-03-15 00:00:00

  • Logistic regression with incompletely observed categorical covariates--investigating the sensitivity against violation of the missing at random assumption.

    abstract::Missing values in the covariates are a widespread complication in the statistical inference of regression models. The maximum likelihood principle requires specification of the distribution of the covariates, at least in part. For categorical covariates, log-linear models can be used. Additionally, the missing at rand...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780141205

    authors: Vach W,Blettner M

    更新日期:1995-06-30 00:00:00

  • The partial testing design: a less costly way to test equivalence for sensitivity and specificity.

    abstract::We propose a new, less costly, design to test the equivalence of digital versus analogue mammography in terms of sensitivity and specificity. Because breast cancer is a rare event among asymptomatic women, the sample size for testing equivalence of sensitivity is larger than that for testing equivalence of specificity...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19981015)17:19<2219::aid-s

    authors: Baker SG,Connor RJ,Kessler LG

    更新日期:1998-10-15 00:00:00

  • Parametric multistate survival models: Flexible modelling allowing transition-specific distributions with application to estimating clinically useful measures of effect differences.

    abstract::Multistate models are increasingly being used to model complex disease profiles. By modelling transitions between disease states, accounting for competing events at each transition, we can gain a much richer understanding of patient trajectories and how risk factors impact over the entire disease pathway. In this arti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7448

    authors: Crowther MJ,Lambert PC

    更新日期:2017-12-20 00:00:00

  • Estimating patterns of CD4 lymphocyte decline using data from a prevalent cohort of HIV infected individuals.

    abstract::In natural history studies of chronic disease, it is of interest to understand the evolution of key variables that measure aspects of disease progression. This is particularly true for immunological variables among persons infected with the human immunodeficiency virus (HIV). The natural time scale for such studies is...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131103

    authors: Vittinghoff E,Malani HM,Jewell NP

    更新日期:1994-06-15 00:00:00

  • Simultaneous modelling of operative mortality and long-term survival after coronary artery bypass surgery.

    abstract::Typical analyses of lifetime data treat the time to death or failure as the response variable and use a variety of modelling strategies such as proportional hazards or fully parametric, to investigate the relationship between the response and covariates. In certain circumstances it may be more natural to view the dist...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.822

    authors: Ghahramani M,Dean CB,Spinelli JJ

    更新日期:2001-07-15 00:00:00

  • Rank-based estimating equations with general weight for accelerated failure time models: an induced smoothing approach.

    abstract::The induced smoothing technique overcomes the difficulties caused by the non-smoothness in rank-based estimating functions for accelerated failure time models, but it is only natural when the estimating function has Gehan's weight. For a general weight, the induced smoothing method does not provide smooth estimating f...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6415

    authors: Chiou S,Kang S,Yan J

    更新日期:2015-04-30 00:00:00

  • Estimation methods for marginal and association parameters for longitudinal binary data with nonignorable missing observations.

    abstract::In longitudinal studies, missing observations occur commonly. It has been well known that biased results could be produced if missingness is not properly handled in the analysis. Authors have developed many methods with the focus on either incomplete response or missing covariate observations, but rarely on both. The ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5536

    authors: Li H,Yi GY

    更新日期:2013-02-28 00:00:00

  • Changes in clinical trials mandated by the advent of meta-analysis.

    abstract::Service on the Data Monitoring Committee of the CPEP (Calcium for Pre-eclampsia Prevention) has led us to four conclusions about clinical trials which we should like to present to this gathering of biostatisticians for their reactions: (i) meta-analyses of the pertinent published trials of the same therapy should alwa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(SICI)1097-0258(19960630)15:12<1263::AID-S

    authors: Chalmers TC,Lau J

    更新日期:1996-06-30 00:00:00

  • Association models for periodontal disease progression: a comparison of methods for clustered binary data.

    abstract::We investigate population-averaged (PA) and cluster-specific (CS) associations for clustered binary logistic regression in the context of a longitudinal clinical trial that investigated the association between tooth-specific visual elastase kit results and periodontal disease progression within 26 weeks of follow-up. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140407

    authors: Ten Have TR,Landis JR,Weaver SL

    更新日期:1995-02-28 00:00:00

  • Adjusting for drop-out in clinical trials with repeated measures: design and analysis issues.

    abstract::Recently, Wu and Follmann developed summary measures to adjust for informative drop-out in longitudinal studies where drop-out depends on the underlying true value of the response. In this paper we evaluate these procedures in the common situation where drop-out depends on the observed responses. We also discuss vario...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20010115)20:1<93::aid-sim655>3.0

    authors: Wu MC,Albert PS,Wu BU

    更新日期:2001-01-15 00:00:00

  • Integrating multiple-domain rules for disease classification.

    abstract::In psychiatry, clinicians use criteria sets from the Diagnostic and Statistical Manual of Mental Disorders to diagnose mental disorders. Most criteria sets have several symptom domains, and in order to be diagnosed, an individual must meet the minimum number of symptoms required by each domain. Some efforts are now fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8173

    authors: Mauro C,Shear MK,Wang Y

    更新日期:2019-07-20 00:00:00

  • Estimating population effects of vaccination using large, routinely collected data.

    abstract::Vaccination in populations can have several kinds of effects. Establishing that vaccination produces population-level effects beyond the direct effects in the vaccinated individuals can have important consequences for public health policy. Formal methods have been developed for study designs and analysis that can esti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7392

    authors: Halloran ME,Hudgens MG

    更新日期:2018-01-30 00:00:00

  • A selection model for longitudinal binary responses subject to non-ignorable attrition.

    abstract::Longitudinal studies collect information on a sample of individuals which is followed over time to analyze the effects of individual and time-dependent characteristics on the observed response. These studies often suffer from attrition: individuals drop out of the study before its completion time and thus present inco...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3604

    authors: Alfò M,Maruotti A

    更新日期:2009-08-30 00:00:00

  • Signal detection in FDA AERS database using Dirichlet process.

    abstract::In the recent two decades, data mining methods for signal detection have been developed for drug safety surveillance, using large post-market safety data. Several of these methods assume that the number of reports for each drug-adverse event combination is a Poisson random variable with mean proportional to the unknow...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6510

    authors: Hu N,Huang L,Tiwari RC

    更新日期:2015-08-30 00:00:00

  • Robust and efficient estimation in the parametric proportional hazards model under random censoring.

    abstract::Cox proportional hazard regression model is a popular tool to analyze the relationship between a censored lifetime variable with other relevant factors. The semiparametric Cox model is widely used to study different types of data arising from applied disciplines such as medical science, biology, and reliability studie...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8377

    authors: Ghosh A,Basu A

    更新日期:2019-11-30 00:00:00

  • Multilevel latent variable models for global health-related quality of life assessment.

    abstract::Quality of life (QOL) assessment is a key component of many clinical studies and frequently requires the use of single global summary measures that capture the overall balance of findings from a potentially wide-ranging assessment of QOL issues. We propose and evaluate an irregular multilevel latent variable model sui...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4455

    authors: Kifley A,Heller GZ,Beath KJ,Bulger D,Ma J,Gebski V

    更新日期:2012-05-20 00:00:00

  • A model for cross-over trials evaluating therapeutic preferences.

    abstract::A preference trial is a special form of cross-over trial where clinical conditions determine when patients change treatment, in a prescribed order. This can be modelled using a geometric distribution. The model can be simply fitted using standard logistic regression methodology. The procedure is applied to a trial stu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(SICI)1097-0258(19960229)15:4<443::AID-SIM

    authors: Lindsey JK,Jones B

    更新日期:1996-02-28 00:00:00

  • Concordance correlation coefficient applied to discrete data.

    abstract::In any field in which decisions are subject to measurements, interchangeability between the methods used to obtain these measurements is essential. To consider methods as interchangeable, a certain degree of agreement is needed between the measurements they provide. The concordance correlation coefficient is an index ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2397

    authors: Carrasco JL,Jover L

    更新日期:2005-12-30 00:00:00

  • Survival time models for analysing drug combination treatments.

    abstract::Several relative risk models for survival time data in drug combination therapy are derived and their properties are discussed. The main intention of this paper is to clarify the differences among the models in order to help to choose the appropriate one in a given situation. The models are motivated by discussing the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780091216

    authors: Kübler J,Schumacher M

    更新日期:1990-12-01 00:00:00

  • Testing whether genetic variation explains correlation of quantitative measures of gene expression, and application to genetic network analysis.

    abstract::Genetic networks for gene expression data are often built by graphical models, which in turn are built from pair-wise correlations of gene expression levels. A key feature of building graphical models is the evaluation of conditional independence of two traits, given other traits. When conditional independence can be ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3274

    authors: Yu Z,Wang L,Hildebrandt MA,Schaid DJ

    更新日期:2008-08-30 00:00:00

  • How should meta-regression analyses be undertaken and interpreted?

    abstract::Appropriate methods for meta-regression applied to a set of clinical trials, and the limitations and pitfalls in interpretation, are insufficiently recognized. Here we summarize recent research focusing on these issues, and consider three published examples of meta-regression in the light of this work. One principal m...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1187

    authors: Thompson SG,Higgins JP

    更新日期:2002-06-15 00:00:00