Predictive accuracy of risk factors and markers: a simulation study of the effect of novel markers on different performance measures for logistic regression models.

Abstract:

:The change in c-statistic is frequently used to summarize the change in predictive accuracy when a novel risk factor is added to an existing logistic regression model. We explored the relationship between the absolute change in the c-statistic, Brier score, generalized R(2) , and the discrimination slope when a risk factor was added to an existing model in an extensive set of Monte Carlo simulations. The increase in model accuracy due to the inclusion of a novel marker was proportional to both the prevalence of the marker and to the odds ratio relating the marker to the outcome but inversely proportional to the accuracy of the logistic regression model with the marker omitted. We observed greater improvements in model accuracy when the novel risk factor or marker was uncorrelated with the existing predictor variable compared with when the risk factor has a positive correlation with the existing predictor variable. We illustrated these findings by using a study on mortality prediction in patients hospitalized with heart failure. In conclusion, the increase in predictive accuracy by adding a marker should be considered in the context of the accuracy of the initial model.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Austin PC,Steyerberg EW

doi

10.1002/sim.5598

subject

Has Abstract

pub_date

2013-02-20 00:00:00

pages

661-72

issue

4

eissn

0277-6715

issn

1097-0258

journal_volume

32

pub_type

杂志文章
  • Bayesian synthesis of epidemiological evidence with different combinations of exposure groups: application to a gene-gene-environment interaction.

    abstract::Meta-analysis to investigate the joint effect of multiple factors in the aetiology of a disease is of increasing importance in epidemiology. This task is often challenging in practice, because studies typically concentrate on studying the effect of only one exposure, sometimes may report the interaction between two ex...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2689

    authors: Salanti G,Higgins JP,White IR

    更新日期:2006-12-30 00:00:00

  • Estimating time-dependent ROC curves using data under prevalent sampling.

    abstract::Prevalent sampling is frequently a convenient and economical sampling technique for the collection of time-to-event data and thus is commonly used in studies of the natural history of a disease. However, it is biased by design because it tends to recruit individuals with longer survival times. This paper considers est...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7184

    authors: Li S

    更新日期:2017-04-15 00:00:00

  • A graphical approach to sequentially rejective multiple test procedures.

    abstract::For clinical trials with multiple treatment arms or endpoints a variety of sequentially rejective, weighted Bonferroni-type tests have been proposed, such as gatekeeping procedures, fixed sequence tests, and fallback procedures. They allow to map the difference in importance as well as the relationship between the var...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3495

    authors: Bretz F,Maurer W,Brannath W,Posch M

    更新日期:2009-02-15 00:00:00

  • Determining the value of additional surrogate exposure data for improving the estimate of an odds ratio.

    abstract::We consider the design of both cohort and case-control studies in which an initial ('stage 1') sample of complete data on an error-free disease indicator (D), a correct ('gold standard') dichotomous exposure measurement (X) and an error-prone exposure measurement (Z) are available. We calculate the amount of additiona...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780142307

    authors: Dahm PF,Gail MH,Rosenberg PS,Pee D

    更新日期:1995-12-15 00:00:00

  • Model selection and diagnostics for joint modeling of survival and longitudinal data with crossing hazard rate functions.

    abstract::Comparison of two hazard rate functions is important for evaluating treatment effect in studies concerning times to some important events. In practice, it may happen that the two hazard rate functions cross each other at one or more unknown time points, representing temporal changes of the treatment effect. Also, besi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6259

    authors: Park KY,Qiu P

    更新日期:2014-11-20 00:00:00

  • Estimating net transition probabilities from cross-sectional data with application to risk factors in chronic disease modeling.

    abstract::A problem occurring in chronic disease modeling is the estimation of transition probabilities of moving from one state of a categorical risk factor to another. Transitions could be obtained from a cohort study, but often such data may not be available. However, under the assumption that transitions remain stable over ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4423

    authors: Kassteele Jv,Hoogenveen RT,Engelfriet PM,Baal PH,Boshuizen HC

    更新日期:2012-03-15 00:00:00

  • Robust Bayesian sample size determination in clinical trials.

    abstract::This article deals with determination of a sample size that guarantees the success of a trial. We follow a Bayesian approach and we say an experiment is successful if it yields a large posterior probability that an unknown parameter of interest (an unknown treatment effect or an effects-difference) is greater than a c...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3175

    authors: Brutti P,De Santis F,Gubbiotti S

    更新日期:2008-06-15 00:00:00

  • The multiple-record systems estimator when registrations refer to different but overlapping populations.

    abstract::In multiple-record systems estimation it is usually assumed that all registration relate to the same population. In this paper, we develop a method which can be used when the registrations relate to different populations, in the sense that they cover, for example, different time periods or regions. We show that under ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1818

    authors: Zwane EN,van der Pal-de Bruin K,van der Heijden PG

    更新日期:2004-07-30 00:00:00

  • Semiparametric Bayesian variable selection for gene-environment interactions.

    abstract::Many complex diseases are known to be affected by the interactions between genetic variants and environmental exposures beyond the main genetic and environmental effects. Study of gene-environment (G×E) interactions is important for elucidating the disease etiology. Existing Bayesian methods for G×E interaction studie...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8434

    authors: Ren J,Zhou F,Li X,Chen Q,Zhang H,Ma S,Jiang Y,Wu C

    更新日期:2020-02-28 00:00:00

  • Bayesian analysis of misclassified binary data from a matched case-control study with a validation sub-study.

    abstract::Bayesian methods are proposed for analysing matched case-control studies in which a binary exposure variable is sometimes measured with error, but whose correct values have been validated for a random sample of the matched case-control sets. Three models are considered. Model 1 makes few assumptions other than randomn...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2000

    authors: Prescott GJ,Garthwaite PH

    更新日期:2005-02-15 00:00:00

  • Comparing and combining data across multiple sources via integration of paired-sample data to correct for measurement error.

    abstract::In biomedical research such as the development of vaccines for infectious diseases or cancer, study outcomes measured by an assay or device are often collected from multiple sources or laboratories. Measurement error that may vary between laboratories needs to be adjusted for when combining samples across data sources...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5446

    authors: Huang Y,Huang Y,Moodie Z,Li S,Self S

    更新日期:2012-12-10 00:00:00

  • Likelihood-based confidence intervals for a log-normal mean.

    abstract::To construct a confidence interval for the mean of a log-normal distribution in small samples, we propose likelihood-based approaches - the signed log-likelihood ratio and modified signed log-likelihood ratio methods. Extensive Monte Carlo simulation results show the advantages of the modified signed log-likelihood ra...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1381

    authors: Wu J,Wong AC,Jiang G

    更新日期:2003-06-15 00:00:00

  • Optimal three-stage designs for phase II cancer clinical trials.

    abstract::The objective of a phase II cancer clinical trial is to screen a treatment that can produce a similar or better response rate compared to the current treatment results. This screening is usually carried out in two stages as proposed by Simon. For ineffective treatment, the trial should terminate at the first stage. En...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19971215)16:23<2701::aid-s

    authors: Chen TT

    更新日期:1997-12-15 00:00:00

  • Exact unconditional tables for significance testing in the 2 x 2 multinomial trial.

    abstract::This paper presents tables analogous to T-tables for use in the 2 x 2 multinomial trial, where the continuity corrected Z-statistic is used to make exact unconditional inference. This is the first solution of a discrete exact unconditional inference problem involving a multivariate nuisance parameter for which no anci...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110708

    authors: Shuster JJ

    更新日期:1992-05-01 00:00:00

  • Weighted hurdle regression method for joint modeling of cardiovascular events likelihood and rate in the US dialysis population.

    abstract::We propose a new weighted hurdle regression method for modeling count data, with particular interest in modeling cardiovascular events in patients on dialysis. Cardiovascular disease remains one of the leading causes of hospitalization and death in this population. Our aim is to jointly model the relationship/associat...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6232

    authors: Sentürk D,Dalrymple LS,Mu Y,Nguyen DV

    更新日期:2014-11-10 00:00:00

  • Semiparametric transformation models for joint analysis of multivariate recurrent and terminal events.

    abstract::Recurrent event data occur in many clinical and observational studies, and in these situations, there may exist a terminal event such as death that is related to the recurrent event of interest. In addition, sometimes more than one type of recurrent events may occur, that is, one may encounter multivariate recurrent e...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4306

    authors: Zhu L,Sun J,Srivastava DK,Tong X,Leisenring W,Zhang H,Robison LL

    更新日期:2011-11-10 00:00:00

  • A semi-parametric Bayesian approach to average bioequivalence.

    abstract::Bioequivalence assessment is an issue of great interest. Development of statistical methods for assessing bioequivalence is an important area of research for statisticians. Bioequivalence is usually determined based on the normal distribution. We relax this assumption and develop a semi-parametric mixed model for bioe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2620

    authors: Ghosh P,Rosner GL

    更新日期:2007-03-15 00:00:00

  • The standard error of Cohen's Kappa.

    abstract::This paper gives a standard error for Cohen's Kappa, conditional on the margins of the observed r x r table. An explicit formula is given for the 2 x 2 table, and a procedure for the more general situation. A parsimonious log-linear model is suggested for the general case and an approximate confidence interval for kap...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780100512

    authors: Garner JB

    更新日期:1991-05-01 00:00:00

  • Estimating the cumulative mean function for history process with time-dependent covariates and censoring mechanism.

    abstract::In this paper, an approach to estimating the cumulative mean function for history process with time dependent covariates and right censored time-to-event variable is developed using the combined technique of joint modeling and inverse probability weighting method. The consistency of proposed estimator is derived. Theo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6998

    authors: Deng D

    更新日期:2016-11-10 00:00:00

  • A multiple imputation strategy for incomplete longitudinal data.

    abstract::Longitudinal studies are commonly used to study processes of change. Because data are collected over time, missing data are pervasive in longitudinal studies, and complete ascertainment of all variables is rare. In this paper a new imputation strategy for completing longitudinal data sets is proposed. The proposed met...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.740

    authors: Landrum MB,Becker MP

    更新日期:2001-09-15 00:00:00

  • Multivariate joint frailty model for the analysis of nonlinear tumor kinetics and dynamic predictions of death.

    abstract::The Response Evaluation Criteria in Solid Tumors are used as standard guidelines for the clinical evaluation of cancer treatments. The assessment is based on the anatomical tumor burden: change in size of target lesions and evolution of nontarget lesions (NTL). Despite unquestionable advantages of this standard tool, ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7640

    authors: Król A,Tournigand C,Michiels S,Rondeau V

    更新日期:2018-06-15 00:00:00

  • A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates.

    abstract::In behavioral, biomedical, and social-psychological sciences, it is common to encounter latent variables and heterogeneous data. Mixture structural equation models (SEMs) are very useful methods to analyze these kinds of data. Moreover, the presence of missing data, including both missing responses and missing covaria...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3915

    authors: Cai JH,Song XY,Hser YI

    更新日期:2010-08-15 00:00:00

  • Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes.

    abstract::When designing a study to develop a new prediction model with binary or time-to-event outcomes, researchers should ensure their sample size is adequate in terms of the number of participants (n) and outcome events (E) relative to the number of predictor parameters (p) considered for inclusion. We propose that the mini...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7992

    authors: Riley RD,Snell KI,Ensor J,Burke DL,Harrell FE Jr,Moons KG,Collins GS

    更新日期:2019-03-30 00:00:00

  • Correction of sampling bias in a cross-sectional study of post-surgical complications.

    abstract::Cross-sectional designs are often used to monitor the proportion of infections and other post-surgical complications acquired in hospitals. However, conventional methods for estimating incidence proportions when applied to cross-sectional data may provide estimators that are highly biased, as cross-sectional designs t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5608

    authors: Fluss R,Mandel M,Freedman LS,Weiss IS,Zohar AE,Haklai Z,Gordon ES,Simchen E

    更新日期:2013-06-30 00:00:00

  • Constrained S-estimators for linear mixed effects models with covariance components.

    abstract::Linear mixed effects (LME) models are increasingly used for analyses of biological and biomedical data. When the multivariate normal assumption is not adequate for an LME model, then a robust estimation approach is preferable to the maximum likelihood one. M-estimators were considered before for robust estimation of t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4169

    authors: Chervoneva I,Vishnyakov M

    更新日期:2011-06-30 00:00:00

  • Simple methods for checking for possible errors in reported odds ratios, relative risks and confidence intervals.

    abstract::Meta-analyses of data from epidemiological studies are often based on odds ratios (ORs) or relative risks (RRs) and their 95 per cent confidence intervals (CIs) as reported by the authors. Where possible ORs, RRs and CIs should be checked against the source data. Some simple methods are presented for checking the vali...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990815)18:15<1973::aid-s

    authors: Lee PN

    更新日期:1999-08-15 00:00:00

  • Rank-based principal stratum sensitivity analyses.

    abstract::We describe rank-based approaches to assess principal stratification treatment effects in studies where the outcome of interest is only well-defined in a subgroup selected after randomization. Our methods are sensitivity analyses, in that estimands are identified by fixing a parameter and then we investigate the sensi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5849

    authors: Lu X,Mehrotra DV,Shepherd BE

    更新日期:2013-11-20 00:00:00

  • Variable selection for proportional odds model.

    abstract::In this paper we study the problem of variable selection for the proportional odds model, which is a useful alternative to the proportional hazards model and might be appropriate when the proportional hazards assumption is not satisfied. We propose to fit the proportional odds model by maximizing the marginal likeliho...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2833

    authors: Lu W,Zhang HH

    更新日期:2007-09-10 00:00:00

  • Measurement error correction for nutritional exposures with correlated measurement error: use of the method of triads in a longitudinal setting.

    abstract::Nutritional exposures are often measured with considerable error in commonly used surrogate instruments such as the food frequency questionnaire (FFQ) (denoted by Q(i) for the ith subject). The error can be both systematic and random. The diet record (DR) denoted by R(i) for the ith subject is considered an alloyed go...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3238

    authors: Rosner B,Michels KB,Chen YH,Day NE

    更新日期:2008-08-15 00:00:00

  • The power to detect differences in average rates of change in longitudinal studies.

    abstract::With considerable current interest in longitudinal epidemiologic studies, little is available regarding sample size requirements. This paper considers a method for analysis of longitudinal data, where one compares the mean rates of change for two or more groups, and proposes a statistic for use in determining sample s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780090414

    authors: Lefante JJ

    更新日期:1990-04-01 00:00:00