Construction and validation of a prognostic model across several studies, with an application in superficial bladder cancer.

Abstract:

:Many models for clinical prediction (prognosis or diagnosis) are published in the medical literature every year but few such models find their way into clinical practice. The reason may be that since in most cases models have not been validated in independent data, they lack generality and/or credibility. In this paper we consider the situation in which several compatible, independent data sets relating to a given disease with a time-to-event endpoint are available for analysis. The aim is to construct and evaluate a single prognostic model. Building a multivariable model from the available prognostic factors is accomplished within the Cox proportional hazards framework, stratifying by study. Non-linear relationships with continuous predictors are modelled by using fractional polynomials. To assess the discrimination or separation of a survival model, we use the D statistic of Royston and Sauerbrei. D may be interpreted as the separation (log hazard ratio) between the survival distributions for two independent prognostic groups. To evaluate the generality of a prognostic model across the data sets, we propose 'internal-external cross-validation' on D: each study is omitted in turn, the model parameters are estimated from the remaining studies and D is evaluated in the omitted study. Because the linear predictor of a survival model tells only part of the story, we also suggest a method for investigating heterogeneity in the baseline distribution function across studies which involves fitting completely specified, flexible parametric survival models (Royston and Parmar). Our final models combine the prognostic index (obtained with stratification by study) with the pooled baseline survival distribution (estimated parametrically). By applying this methodology, we construct two prognostic scores in superficial bladder cancer. The simpler of the two scores is more suited to clinical application. We show that a three-group prognostic classification scheme based on either score produces well-separated survival curves for each of the data sets, despite identifiable heterogeneity among the baseline distribution functions and to a lesser extent among the prognostic indexes for the individual studies.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Royston P,Parmar MK,Sylvester R

doi

10.1002/sim.1691

subject

Has Abstract

pub_date

2004-03-30 00:00:00

pages

907-26

issue

6

eissn

0277-6715

issn

1097-0258

journal_volume

23

pub_type

杂志文章
  • An easy-to-implement approach for analyzing case-control and case-only studies assuming gene-environment independence and Hardy-Weinberg equilibrium.

    abstract::The case-control study is a simple and an useful method to characterize the effect of a gene, the effect of an exposure, as well as the interaction between the two. The control-free case-only study is yet an even simpler design, if interest is centered on gene-environment interaction only. It requires the sometimes pl...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4028

    authors: Lee WC,Wang LY,Cheng KF

    更新日期:2010-10-30 00:00:00

  • ML and REML estimation in survival analysis with time dependent correlated frailty.

    abstract::In the study of multiple failure times for the same subjects, for example, recurrent infections for patients with a given disease, there are often subject effects, that is, subjects have different risks that cannot be explained by known covariates. Standard methods, which ignore subject effects, lead to overestimation...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(sici)1097-0258(19980615)17:11<1201::aid-s

    authors: Yau KK,McGilchrist CA

    更新日期:1998-06-15 00:00:00

  • An analysis of disease surveillance data that uses the geographic locations of the reporting units.

    abstract::The primary purpose of a disease surveillance system is to provide data for the detection of changes in the incidence of the disease. Methods for the analysis of data from surveillance systems are reviewed. A new procedure is proposed for use when the system includes geographically dispersed reporting units, such as h...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080306

    authors: Raubertas RF

    更新日期:1989-03-01 00:00:00

  • Structured correlation in models for clustered data.

    abstract::Correlation is always a concern in the analysis of clustered data. One area of interest is to develop a general correlation modelling approach for high dimensional data with unbalanced hierarchical and heterogeneous data structures, e.g. multilevel data. Commonly used correlation structures might have limitation for s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2368

    authors: Chao EC

    更新日期:2006-07-30 00:00:00

  • On choosing the number of interim analyses in clinical trials.

    abstract::Small but important therapeutic effects of new treatments can be most efficiently detected through the study of large randomized prospective series of patients. Such large scale clinical trials are nowadays commonplace. The alternative is years of polemic and debate surrounding several trials each too small to detect ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780010105

    authors: McPherson K

    更新日期:1982-01-01 00:00:00

  • An application of Harrell's C-index to PH frailty models.

    abstract::Frailty models are encountered in many medical applications, yet little research has been devoted to develop measures that quantify the predictive ability of these models. In this paper, we elaborate on the concept of the concordance probability to clustered data, resulting in an 'Overall Conditional C-index' or bfC(O...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4058

    authors: Van Oirbeek R,Lesaffre E

    更新日期:2010-12-30 00:00:00

  • Violations of the independent increment assumption when using generalized estimating equation in longitudinal group sequential trials.

    abstract::In phase 3 clinical trials, ethical and financial concerns motivate sequential analyses in which the data are analyzed prior to completion of the entire planned study. Existing group sequential software accounts for the effects of these interim analyses on the sampling density by assuming that the contribution of subs...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6306

    authors: Shoben AB,Emerson SS

    更新日期:2014-12-20 00:00:00

  • On tests of the overall treatment effect in meta-analysis with normally distributed responses.

    abstract::For the meta-analysis of controlled clinical trials or epidemiological studies, in which the responses are at least approximately normally distributed, a refined test for the hypothesis of no overall treatment effect is proposed. The test statistic is based on a direct estimation function for the variance of the overa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.791

    authors: Hartung J,Knapp G

    更新日期:2001-06-30 00:00:00

  • Decision-theoretic designs for dose-finding clinical trials with multiple outcomes.

    abstract::A decision-theoretic framework is proposed for designing sequential dose-finding trials with multiple outcomes. The optimal strategy is solvable theoretically via backward induction. However, for dose-finding studies involving k doses, the computational complexity is the same as the bandit problem with k-dependent arm...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2322

    authors: Fan SK,Wang YG

    更新日期:2006-05-30 00:00:00

  • Level-adjusted funnel plots based on predicted marginal expectations: an application to prophylactic antibiotics in gallstone surgery.

    abstract::Funnel plots are widely used to visualize grouped data, for example, in institutional comparison. This paper extends the concept to a multi-level setting, displaying one level at a time, adjusted for the other levels, as well as for covariates at all levels. These level-adjusted funnel plots are based on a Markov chai...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5677

    authors: Lindhagen L,Darkahi B,Sandblom G,Berglund L

    更新日期:2014-09-20 00:00:00

  • Using data from multiple studies to develop a child growth correlation matrix.

    abstract::In many countries, the monitoring of child growth does not occur in a regular manner, and instead, we may have to rely on sporadic observations that are subject to substantial measurement error. In these countries, it can be difficult to identify patterns of poor growth, and faltering children may miss out on essentia...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7696

    authors: Anderson C,Xiao L,Checkley W

    更新日期:2019-08-30 00:00:00

  • Cluster without fluster: The effect of correlated outcomes on inference in randomized clinical trials.

    abstract::Inference for randomized clinical trials is generally based on the assumption that outcomes are independently and identically distributed under the null hypothesis. In some trials, particularly in infectious disease, outcomes may be correlated. This may be known in advance (e.g. allowing randomization of family member...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2977

    authors: Proschan M,Follmann D

    更新日期:2008-03-15 00:00:00

  • A Bayesian methodology for detecting targeted genes under two related experiments.

    abstract::Many gene expression data are based on two experiments where the gene expressions of the targeted genes under both experiments are correlated. We consider problems in which objectives are to find genes that are simultaneously upregulated/downregulated under both experiments. A Bayesian methodology is proposed based on...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6555

    authors: Bansal NK,Jiang H,Pradeep P

    更新日期:2015-11-10 00:00:00

  • Survival analysis for recurrent event data: an application to childhood infectious diseases.

    abstract::Many extensions of survival models based on the Cox proportional hazards approach have been proposed to handle clustered or multiple event data. Of particular note are five Cox-based models for recurrent event data: Andersen and Gill (AG); Wei, Lin and Weissfeld (WLW); Prentice, Williams and Peterson, total time (PWP-...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(sici)1097-0258(20000115)19:1<13::aid-sim2

    authors: Kelly PJ,Lim LL

    更新日期:2000-01-15 00:00:00

  • Practical issues in data monitoring of clinical trials: summary of responses to a questionnaire at NIH.

    abstract::A targeted poll was undertaken to compare and contrast models of data monitoring of randomized clinical trials sponsored by the National Institutes of Health. In an attempt to represent the institutes which conduct clinical trials, twelve individuals were selected and asked to respond to a questionnaire specifically p...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120520

    authors: Geller NL,Stylianou M

    更新日期:1993-03-01 00:00:00

  • Biomarker discovery using targeted maximum-likelihood estimation: application to the treatment of antiretroviral-resistant HIV infection.

    abstract::Researchers in clinical science and bioinformatics frequently aim to learn which of a set of candidate biomarkers is important in determining a given outcome, and to rank the contributions of the candidates accordingly. This article introduces a new approach to research questions of this type, based on targeted maximu...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3414

    authors: Bembom O,Petersen ML,Rhee SY,Fessel WJ,Sinisi SE,Shafer RW,van der Laan MJ

    更新日期:2009-01-15 00:00:00

  • Policy developments in regulatory approval.

    abstract::Although radical changes in drug regulation are rare (e.g., the Federal Food, Drug and Cosmetic Act of 1938 and the 1962 amendment to the Act creating an effectiveness requirement), regulations and guidance do evolve significantly in the face of new problems and accumulating experience. Recent changes have been driven...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1298

    authors: Temple R

    更新日期:2002-10-15 00:00:00

  • Comparison of adjusted attributable risk estimators.

    abstract::The estimation of attributable risk in the presence of confounding and effect modification is studied in this paper. Different adjustment methods for the attributable risk are reviewed. The results of a stimulation study comparing these methods under the unrestricted multinomial sampling model are reported. From the s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,评审

    doi:10.1002/sim.4780111606

    authors: Gefeller O

    更新日期:1992-12-01 00:00:00

  • A random set approach to confidence regions with applications to the effective dose with combinations of agents.

    abstract::The effective dose (ED) is the pharmaceutical dosage required to produce a therapeutic response in a fixed proportion of the patients. When only one drug is considered, the problem is a univariate one and has been well-studied. However, in the multidimensional setting, that is, in the presence of combinations of agent...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6226

    authors: Jankowski H,Ji X,Stanberry L

    更新日期:2014-10-30 00:00:00

  • Establishing the relationship between nurse staffing and hospital mortality using a clustered discrete-time logistic model.

    abstract::Studies based on aggregated hospital outcome data have established that there is a relationship between nurse staffing and adverse events. However, this result could not be confirmed in Belgium where 96 per cent of the variability of nurse staffing levels over nursing units (belonging to different hospitals) is explai...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3756

    authors: Diya L,Lesaffre E,Van den Heede K,Sermeus W,Vleugels A

    更新日期:2010-03-30 00:00:00

  • Parameterization of treatment effects for meta-analysis in multi-state Markov models.

    abstract::Standard approaches to analysis of randomized controlled trials (RCTs) using Markov models make it difficult to generalize treatment effects to new patient groups and synthesize evidence across trials. This paper demonstrates how pair-wise and mixed treatment comparison meta-analysis can be applied to event history da...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4059

    authors: Price MJ,Welton NJ,Ades AE

    更新日期:2011-01-30 00:00:00

  • A meta-analysis of clinical trials involving different classifications of response into ordered categories.

    abstract::Statistical methods are available for performing a meta-analysis when the response variable of interest is the same in each study. Problems arise when studies exploring a common therapeutic question use different patient response types. This article presents statistical methods for combining studies which involve diff...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780132313

    authors: Whitehead A,Jones NM

    更新日期:1994-12-15 00:00:00

  • Heterogeneity in the probability of HIV transmission per sexual contact: the case of male-to-female transmission in penile-vaginal intercourse.

    abstract::Recent studies have indicated variation in the infectivity beta of HIV among heterosexual couples. We represent this heterogeneity by modelling beta as a random variable. Using data on the number of contacts and seroconversion of couples, we fit the model by maximum-likelihood estimation with a beta distribution and a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780080110

    authors: Wiley JA,Herschkorn SJ,Padian NS

    更新日期:1989-01-01 00:00:00

  • Correction of sampling bias in a cross-sectional study of post-surgical complications.

    abstract::Cross-sectional designs are often used to monitor the proportion of infections and other post-surgical complications acquired in hospitals. However, conventional methods for estimating incidence proportions when applied to cross-sectional data may provide estimators that are highly biased, as cross-sectional designs t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5608

    authors: Fluss R,Mandel M,Freedman LS,Weiss IS,Zohar AE,Haklai Z,Gordon ES,Simchen E

    更新日期:2013-06-30 00:00:00

  • Comparing the performance of two indices for spatial model selection: application to two mortality data.

    abstract::The statistical analysis of spatially correlated data has become an important scientific research topic lately. The analysis of the mortality or morbidity rates observed at different areas may help to decide if people living in certain locations are considered at higher risk than others. Once the statistical model for...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20000730)19:14<1915::aid-sim503>

    authors: Hsiao CK,Tzeng JY,Wang CH

    更新日期:2000-07-30 00:00:00

  • Adaptive dose modification for phase I clinical trials.

    abstract::Most phase I dose-finding methods in oncology aim to find the maximum-tolerated dose from a set of prespecified doses. However, in practice, because of a lack of understanding of the true dose-toxicity relationship, it is likely that none of these prespecified doses are equal or reasonably close to the true maximum-to...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6933

    authors: Chu Y,Pan H,Yuan Y

    更新日期:2016-09-10 00:00:00

  • M. D. deB. Edwardes, 'The generalization of the odds ratio, risk ratio and risk difference to r x k tables'. Statistics in Medicine 2000; 19(14): 1901-1914.

    abstract::The original article to which this Correction refers was published in Statistics in Medicine 2000 19(14): 1901-1914. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/1097-0258(20001115)19:21<3017::aid-sim785>

    authors: Edwardes MD

    更新日期:2000-11-15 00:00:00

  • Surveillance of clustering near point sources.

    abstract::Health authorities are often alerted to suspected cancer clusters near the vicinity of potential point sources by members of the public. A surveillance system, where administrative regions around the potential point sources are regularly monitored for high disease rates, would allow for responses which are easier to o...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19960415)15:7/9<727::aid-s

    authors: Le ND,Petkau AJ,Rosychuk R

    更新日期:1996-04-15 00:00:00

  • Empirical Bayes versus fully Bayesian analysis of geographical variation in disease risk.

    abstract::This paper reviews methods for mapping geographical variation in disease incidence and mortality. Recent results in Bayesian hierarchical modelling of relative risk are discussed. Two approaches to relative risk estimation, along with the related computational procedures, are described and compared. The first is an em...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110802

    authors: Bernardinelli L,Montomoli C

    更新日期:1992-06-15 00:00:00

  • Quantifying degrees of necessity and of sufficiency in cause-effect relationships with dichotomous and survival outcomes.

    abstract::We suggest measures to quantify the degrees of necessity and of sufficiency of prognostic factors for dichotomous and for survival outcomes. A cause, represented by certain values of prognostic factors, is considered necessary for an event if, without the cause, the event cannot develop. It is considered sufficient fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8331

    authors: Gleiss A,Schemper M

    更新日期:2019-10-15 00:00:00