Estimating inverse probability weights using super learner when weight-model specification is unknown in a marginal structural Cox model context.

Abstract:

:Correct specification of the inverse probability weighting (IPW) model is necessary for consistent inference from a marginal structural Cox model (MSCM). In practical applications, researchers are typically unaware of the true specification of the weight model. Nonetheless, IPWs are commonly estimated using parametric models, such as the main-effects logistic regression model. In practice, assumptions underlying such models may not hold and data-adaptive statistical learning methods may provide an alternative. Many candidate statistical learning approaches are available in the literature. However, the optimal approach for a given dataset is impossible to predict. Super learner (SL) has been proposed as a tool for selecting an optimal learner from a set of candidates using cross-validation. In this study, we evaluate the usefulness of a SL in estimating IPW in four different MSCM simulation scenarios, in which we varied the specification of the true weight model specification (linear and/or additive). Our simulations show that, in the presence of weight model misspecification, with a rich and diverse set of candidate algorithms, SL can generally offer a better alternative to the commonly used statistical learning approaches in terms of MSE as well as the coverage probabilities of the estimated effect in an MSCM. The findings from the simulation studies guided the application of the MSCM in a multiple sclerosis cohort from British Columbia, Canada (1995-2008), to estimate the impact of beta-interferon treatment in delaying disability progression. Copyright © 2017 John Wiley & Sons, Ltd.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Karim ME,Platt RW,BeAMS study group.

doi

10.1002/sim.7266

subject

Has Abstract

pub_date

2017-06-15 00:00:00

pages

2032-2047

issue

13

eissn

0277-6715

issn

1097-0258

journal_volume

36

pub_type

杂志文章
  • Estimating heterogeneous treatment effects for latent subgroups in observational studies.

    abstract::Individuals may vary in their responses to treatment, and identification of subgroups differentially affected by a treatment is an important issue in medical research. The risk of misleading subgroup analyses has become well known, and some exploratory analyses can be helpful in clarifying how covariates potentially i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7970

    authors: Kim HJ,Lu B,Nehus EJ,Kim MO

    更新日期:2019-02-10 00:00:00

  • Bayesian modelling of imperfect ascertainment methods in cancer studies.

    abstract::Tumour registry linkage, chart review and patient self-report are all commonly used ascertainment methods in cancer epidemiology. These methods are used for estimating the incidence or prevalence of different cancer types in a population, and for investigating the effects of possible risk factors for cancer. Tumour re...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2116

    authors: Bernatsky S,Joseph L,Bélisle P,Boivin JF,Rajan R,Moore A,Clarke A

    更新日期:2005-08-15 00:00:00

  • Global goodness-of-fit tests for group testing regression models.

    abstract::In a variety of biomedical applications, particularly those involving screening for infectious diseases, testing individuals (e.g. blood/urine samples, etc.) in pools has become a standard method of data collection. This experimental design, known as group testing (or pooled testing), can provide a large reduction in ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3678

    authors: Chen P,Tebbs JM,Bilder CR

    更新日期:2009-10-15 00:00:00

  • Predictive value of statistical models.

    abstract::A review is given of different ways of estimating the error rate of a prediction rule based on a statistical model. A distinction is drawn between apparent, optimum and actual error rates. Moreover it is shown how cross-validation can be used to obtain an adjusted predictor with smaller error rate. A detailed discussi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780091109

    authors: Van Houwelingen JC,Le Cessie S

    更新日期:1990-11-01 00:00:00

  • Minimum sample size for developing a multivariable prediction model: Part I - Continuous outcomes.

    abstract::In the medical literature, hundreds of prediction models are being developed to predict health outcomes in individuals. For continuous outcomes, typically a linear regression model is developed to predict an individual's outcome value conditional on values of multiple predictors (covariates). To improve model developm...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7993

    authors: Riley RD,Snell KIE,Ensor J,Burke DL,Harrell FE Jr,Moons KGM,Collins GS

    更新日期:2019-03-30 00:00:00

  • Estimating population effects of vaccination using large, routinely collected data.

    abstract::Vaccination in populations can have several kinds of effects. Establishing that vaccination produces population-level effects beyond the direct effects in the vaccinated individuals can have important consequences for public health policy. Formal methods have been developed for study designs and analysis that can esti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7392

    authors: Halloran ME,Hudgens MG

    更新日期:2018-01-30 00:00:00

  • Long-term survivor mixture model with random effects: application to a multi-centre clinical trial of carcinoma.

    abstract::A mixture model incorporating long-term survivors has been adopted in the field of biostatistics where some individuals may never experience the failure event under study. The surviving fractions may be considered as cured. In most applications, the survival times are assumed to be independent. However, when the survi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.932

    authors: Yau KK,Ng AS

    更新日期:2001-06-15 00:00:00

  • Human disease cost network analysis.

    abstract::Diseases can be interconnected. In the recent years, there has been a surge of multidisease studies. Among them, HDN (human disease network) analysis takes a system perspective, examines the interconnections among diseases along with their individual properties, and has demonstrated great potential. Most of the existi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8472

    authors: Ma C,Li Y,Shia B,Ma S

    更新日期:2020-04-30 00:00:00

  • Design and analysis of non-inferiority mortality trials in oncology.

    abstract::The recent revision of the Declaration of Helsinki and the existence of many new therapies that affect survival or serious morbidity, and that therefore cannot be denied patients, have generated increased interest in active-control trials, particularly those intended to show equivalence or non-inferiority to the activ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1400

    authors: Rothmann M,Li N,Chen G,Chi GY,Temple R,Tsou HH

    更新日期:2003-01-30 00:00:00

  • Dunnett-type inference in the frailty Cox model with covariates.

    abstract::A frequent objective in medical research is the investigation of differences in patient survival between several experimental treatments and one standard treatment. In order to assess these differences statistically, we have to apply adjustments for multiple comparisons to prevent an increased number of false-positive...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4403

    authors: Herberich E,Hothorn T

    更新日期:2012-01-13 00:00:00

  • Correlation analysis for longitudinal data: applications to HIV and psychosocial research.

    abstract::Correlation analysis is widely used in biomedical and psychosocial research for assessing rater reliability, precision of diagnosis and accuracy of proxy outcomes. The popularity of longitudinal study designs has propelled the proliferation in recent years of new methods for longitudinal and other multi-level clustere...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2857

    authors: Tu XM,Feng C,Kowalski J,Tang W,Wang H,Wan C,Ma Y

    更新日期:2007-09-30 00:00:00

  • Performance assessment for radiologists interpreting screening mammography.

    abstract::When interpreting screening mammograms radiologists decide whether suspicious abnormalities exist that warrant the recall of the patient for further testing. Previous work has found significant differences in interpretation among radiologists; their false-positive and false-negative rates have been shown to vary widel...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2633

    authors: Woodard DB,Gelfand AE,Barlow WE,Elmore JG

    更新日期:2007-03-30 00:00:00

  • The many weak instruments problem and Mendelian randomization.

    abstract::Instrumental variable estimates of causal effects can be biased when using many instruments that are only weakly associated with the exposure. We describe several techniques to reduce this bias and estimate corrected standard errors. We present our findings using a simulation study and an empirical application. For th...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6358

    authors: Davies NM,von Hinke Kessler Scholder S,Farbmacher H,Burgess S,Windmeijer F,Smith GD

    更新日期:2015-02-10 00:00:00

  • Logistic discrimination of mixtures of M. tuberculosis and non-specific tuberculin reactions.

    abstract::Interpretation of the Mantoux test for tuberculous infection can be complicated by cross-reactions caused by infection with non-specific mycobacteria. Thus, the distribution of positive indurations is a mixture of two distributions. To estimate tuberculous infection prevalence, the marginal distribution of indurations...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.745

    authors: Nagelkerke NJ,Borgdorff MW,Kim SJ

    更新日期:2001-04-15 00:00:00

  • Constructing binomial confidence intervals with near nominal coverage by adding a single imaginary failure or success.

    abstract::In this paper we present a simple method for constructing (1- alpha)100 per cent confidence intervals for binomial proportions with near nominal coverage for all underlying proportion parameters on the unit interval. This new method uses, with a slight modification, the standard normal approximation technique taught i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2469

    authors: Borkowf CB

    更新日期:2006-11-15 00:00:00

  • Cancer immunotherapy trial design with cure rate and delayed treatment effect.

    abstract::Cancer immunotherapy trials have two special features: a delayed treatment effect and a cure rate. Both features violate the proportional hazard model assumption and ignoring either one of the two features in an immunotherapy trial design will result in substantial loss of statistical power. To properly design immunot...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8440

    authors: Wei J,Wu J

    更新日期:2020-03-15 00:00:00

  • Bayesian non-response models for categorical data from small areas: an application to BMD and age.

    abstract::We provide a Bayesian analysis of data categorized into two levels of age (younger than 50 years, at least 50 years) and three levels of bone mineral density (normal, osteopenia, osteoporosis) for white females at least 20 years old in the third National Health and Nutrition Examination Survey. For the sample, the age...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1985

    authors: Nandram B,Liu N,Choi JW,Cox L

    更新日期:2005-04-15 00:00:00

  • Comparison of methods for the analysis of longitudinal interval count data.

    abstract::Longitudinal studies are often concerned with estimating the recurrence rate of a non-fatal event. In many cases, only the total number of events occurring during successive time intervals is known. We compared a mixed Poisson-gamma regression method proposed by Thall and a quasi-likelihood method proposed by Zeger an...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/sim.4780121406

    authors: Stukel TA

    更新日期:1993-07-30 00:00:00

  • A score test for establishing non-inferiority with respect to short-term survival in two-sample comparisons with identical proportions of long-term survivors.

    abstract::In recent years randomized trials designed to establish non-inferiority of a new treatment as compared to a standard one have been more widely used. Two-sample statistics have been proposed for this equivalence testing problem. However, they are not suited to situations where a long-term survivor fraction is expected....

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1453

    authors: Broët P,Tubert-Bitter P,De Rycke Y,Moreau T

    更新日期:2003-03-30 00:00:00

  • Adjusting for verification bias in diagnostic test evaluation: a Bayesian approach.

    abstract::Obtaining accurate estimates of the performance of a diagnostic test for some population of patients might be difficult when the sample of subjects used for this purpose is not representative for the whole population. Thus, in the motivating example of this paper a test is evaluated by comparing its results with those...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3099

    authors: Buzoianu M,Kadane JB

    更新日期:2008-06-15 00:00:00

  • Cross calibration in longitudinal studies.

    abstract::In a long-running longitudinal study using complex machinery to obtain measurements, it is sometimes necessary to replace the machine. This can result in lack of continuity in the measurements that can overwhelm any treatment effect or time trend. We propose a Bayesian procedure implemented using Markov chain Monte Ca...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1868

    authors: Ambrosius WT,Hui SL

    更新日期:2004-09-30 00:00:00

  • Statistical education for medical students--concepts are what remain when the details are forgotten.

    abstract::Teaching statistics to medical students is a challenging and often unrewarding task. However, few would argue the need for statistics in the medical school curriculum. In recent years, there has been a growing call for teaching only statistical concepts in medical schools. We strongly oppose this opinion and offer an ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2906

    authors: Herman A,Notzer N,Libman Z,Braunstein R,Steinberg DM

    更新日期:2007-10-15 00:00:00

  • Analyzing longitudinal data with patients in different disease states during follow-up and death as final state.

    abstract::This paper considers the analysis of longitudinal data complicated by the fact that during follow-up patients can be in different disease states, such as remission, relapse or death. If both the response of interest (for example, quality of life (QOL)) and the amount of missing data depend on this disease state, ignor...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3755

    authors: le Cessie S,de Vries EG,Buijs C,Post WJ

    更新日期:2009-12-30 00:00:00

  • A scan statistic with a variable window.

    abstract::Given N points or events occurring according to some probability distribution in the unit interval (0, 1), the simple scan statistic is defined to be the maximum number of points in any sub-interval of length d. In many areas, as in epidemiology, it is used to test the null hypothesis that the events are random, again...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19960415)15:7/9<845::aid-s

    authors: Nagarwalla N

    更新日期:1996-04-15 00:00:00

  • Issues in applied statistics for public health bioterrorism surveillance using multiple data streams: research needs.

    abstract::The objective of this report is to provide a basis to inform decisions about priorities for developing statistical research initiatives in the field of public health surveillance for emerging threats. Rapid information system advances have created a vast opportunity of secondary data sources for information to enhance...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2793

    authors: Rolka H,Burkom H,Cooper GF,Kulldorff M,Madigan D,Wong WK

    更新日期:2007-04-15 00:00:00

  • Confidence intervals for an exposure adjusted incidence rate difference with applications to clinical trials.

    abstract::To summarize safety data such as clinical adverse experiences in clinical trials with a moderate to long-term follow-up, we may use a measurement which accounts for the potential differences in the follow-up duration between treatment groups. The incidence rate, which uses the total person-time follow-up in a treatmen...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2335

    authors: Liu GF,Wang J,Liu K,Snavely DB

    更新日期:2006-04-30 00:00:00

  • Testing whether genetic variation explains correlation of quantitative measures of gene expression, and application to genetic network analysis.

    abstract::Genetic networks for gene expression data are often built by graphical models, which in turn are built from pair-wise correlations of gene expression levels. A key feature of building graphical models is the evaluation of conditional independence of two traits, given other traits. When conditional independence can be ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3274

    authors: Yu Z,Wang L,Hildebrandt MA,Schaid DJ

    更新日期:2008-08-30 00:00:00

  • Multiple statistics for multiple events, with application to repeated infections in the growth factor studies.

    abstract::Clinical studies that involve the recording of two or more distinct and well-defined events on each subject give rise to multiple event data. Treatment comparisons are usually reported in univariate analyses of time to first event or number of events observed. However, this approach may not uncover the 'full story' of...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,多中心研究,随机对照试验

    doi:10.1002/(sici)1097-0258(19970430)16:8<941::aid-sim

    authors: Barai U,Teoh N

    更新日期:1997-04-30 00:00:00

  • A framework establishing clear decision criteria for the assessment of drug efficacy.

    abstract::Much has been published on various aspects of data analysis and reporting from clinical trials within the biopharmaceutical environment. This ranges from regulatory guidelines on the format and content of registration dossiers to recommendations on data presentation and the statistical methodologies that are appropria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/(sici)1097-0258(19980815/30)17:15/16<1829:

    authors: Huster WJ,Enas GG

    更新日期:1998-08-15 00:00:00

  • Modification of the sample size and the schedule of interim analyses in survival trials based on data inspections.

    abstract::A method is presented which allows us to adapt the sample size as well as the number and time points of interim analyses to the treatment difference observed at an interim look during the course of a clinical trial with censored survival time as the endpoint. The method allows the inclusion of data inspections during ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1136

    authors: Schäfer H,Müller HH

    更新日期:2001-12-30 00:00:00