Abstract:
:Many models for clinical prediction (prognosis or diagnosis) are published in the medical literature every year but few such models find their way into clinical practice. The reason may be that since in most cases models have not been validated in independent data, they lack generality and/or credibility. In this paper we consider the situation in which several compatible, independent data sets relating to a given disease with a time-to-event endpoint are available for analysis. The aim is to construct and evaluate a single prognostic model. Building a multivariable model from the available prognostic factors is accomplished within the Cox proportional hazards framework, stratifying by study. Non-linear relationships with continuous predictors are modelled by using fractional polynomials. To assess the discrimination or separation of a survival model, we use the D statistic of Royston and Sauerbrei. D may be interpreted as the separation (log hazard ratio) between the survival distributions for two independent prognostic groups. To evaluate the generality of a prognostic model across the data sets, we propose 'internal-external cross-validation' on D: each study is omitted in turn, the model parameters are estimated from the remaining studies and D is evaluated in the omitted study. Because the linear predictor of a survival model tells only part of the story, we also suggest a method for investigating heterogeneity in the baseline distribution function across studies which involves fitting completely specified, flexible parametric survival models (Royston and Parmar). Our final models combine the prognostic index (obtained with stratification by study) with the pooled baseline survival distribution (estimated parametrically). By applying this methodology, we construct two prognostic scores in superficial bladder cancer. The simpler of the two scores is more suited to clinical application. We show that a three-group prognostic classification scheme based on either score produces well-separated survival curves for each of the data sets, despite identifiable heterogeneity among the baseline distribution functions and to a lesser extent among the prognostic indexes for the individual studies.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Royston P,Parmar MK,Sylvester Rdoi
10.1002/sim.1691subject
Has Abstractpub_date
2004-03-30 00:00:00pages
907-26issue
6eissn
0277-6715issn
1097-0258journal_volume
23pub_type
杂志文章abstract::The case-control study is a simple and an useful method to characterize the effect of a gene, the effect of an exposure, as well as the interaction between the two. The control-free case-only study is yet an even simpler design, if interest is centered on gene-environment interaction only. It requires the sometimes pl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4028
更新日期:2010-10-30 00:00:00
abstract::In the study of multiple failure times for the same subjects, for example, recurrent infections for patients with a given disease, there are often subject effects, that is, subjects have different risks that cannot be explained by known covariates. Standard methods, which ignore subject effects, lead to overestimation...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(sici)1097-0258(19980615)17:11<1201::aid-s
更新日期:1998-06-15 00:00:00
abstract::The primary purpose of a disease surveillance system is to provide data for the detection of changes in the incidence of the disease. Methods for the analysis of data from surveillance systems are reviewed. A new procedure is proposed for use when the system includes geographically dispersed reporting units, such as h...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080306
更新日期:1989-03-01 00:00:00
abstract::Correlation is always a concern in the analysis of clustered data. One area of interest is to develop a general correlation modelling approach for high dimensional data with unbalanced hierarchical and heterogeneous data structures, e.g. multilevel data. Commonly used correlation structures might have limitation for s...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2368
更新日期:2006-07-30 00:00:00
abstract::Small but important therapeutic effects of new treatments can be most efficiently detected through the study of large randomized prospective series of patients. Such large scale clinical trials are nowadays commonplace. The alternative is years of polemic and debate surrounding several trials each too small to detect ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780010105
更新日期:1982-01-01 00:00:00
abstract::Frailty models are encountered in many medical applications, yet little research has been devoted to develop measures that quantify the predictive ability of these models. In this paper, we elaborate on the concept of the concordance probability to clustered data, resulting in an 'Overall Conditional C-index' or bfC(O...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4058
更新日期:2010-12-30 00:00:00
abstract::In phase 3 clinical trials, ethical and financial concerns motivate sequential analyses in which the data are analyzed prior to completion of the entire planned study. Existing group sequential software accounts for the effects of these interim analyses on the sampling density by assuming that the contribution of subs...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6306
更新日期:2014-12-20 00:00:00
abstract::For the meta-analysis of controlled clinical trials or epidemiological studies, in which the responses are at least approximately normally distributed, a refined test for the hypothesis of no overall treatment effect is proposed. The test statistic is based on a direct estimation function for the variance of the overa...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.791
更新日期:2001-06-30 00:00:00
abstract::A decision-theoretic framework is proposed for designing sequential dose-finding trials with multiple outcomes. The optimal strategy is solvable theoretically via backward induction. However, for dose-finding studies involving k doses, the computational complexity is the same as the bandit problem with k-dependent arm...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2322
更新日期:2006-05-30 00:00:00
abstract::Funnel plots are widely used to visualize grouped data, for example, in institutional comparison. This paper extends the concept to a multi-level setting, displaying one level at a time, adjusted for the other levels, as well as for covariates at all levels. These level-adjusted funnel plots are based on a Markov chai...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5677
更新日期:2014-09-20 00:00:00
abstract::In many countries, the monitoring of child growth does not occur in a regular manner, and instead, we may have to rely on sporadic observations that are subject to substantial measurement error. In these countries, it can be difficult to identify patterns of poor growth, and faltering children may miss out on essentia...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7696
更新日期:2019-08-30 00:00:00
abstract::Inference for randomized clinical trials is generally based on the assumption that outcomes are independently and identically distributed under the null hypothesis. In some trials, particularly in infectious disease, outcomes may be correlated. This may be known in advance (e.g. allowing randomization of family member...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2977
更新日期:2008-03-15 00:00:00
abstract::Many gene expression data are based on two experiments where the gene expressions of the targeted genes under both experiments are correlated. We consider problems in which objectives are to find genes that are simultaneously upregulated/downregulated under both experiments. A Bayesian methodology is proposed based on...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6555
更新日期:2015-11-10 00:00:00
abstract::Many extensions of survival models based on the Cox proportional hazards approach have been proposed to handle clustered or multiple event data. Of particular note are five Cox-based models for recurrent event data: Andersen and Gill (AG); Wei, Lin and Weissfeld (WLW); Prentice, Williams and Peterson, total time (PWP-...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(sici)1097-0258(20000115)19:1<13::aid-sim2
更新日期:2000-01-15 00:00:00
abstract::A targeted poll was undertaken to compare and contrast models of data monitoring of randomized clinical trials sponsored by the National Institutes of Health. In an attempt to represent the institutes which conduct clinical trials, twelve individuals were selected and asked to respond to a questionnaire specifically p...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780120520
更新日期:1993-03-01 00:00:00
abstract::Researchers in clinical science and bioinformatics frequently aim to learn which of a set of candidate biomarkers is important in determining a given outcome, and to rank the contributions of the candidates accordingly. This article introduces a new approach to research questions of this type, based on targeted maximu...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3414
更新日期:2009-01-15 00:00:00
abstract::Although radical changes in drug regulation are rare (e.g., the Federal Food, Drug and Cosmetic Act of 1938 and the 1962 amendment to the Act creating an effectiveness requirement), regulations and guidance do evolve significantly in the face of new problems and accumulating experience. Recent changes have been driven...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1298
更新日期:2002-10-15 00:00:00
abstract::The estimation of attributable risk in the presence of confounding and effect modification is studied in this paper. Different adjustment methods for the attributable risk are reviewed. The results of a stimulation study comparing these methods under the unrestricted multinomial sampling model are reported. From the s...
journal_title:Statistics in medicine
pub_type: 杂志文章,评审
doi:10.1002/sim.4780111606
更新日期:1992-12-01 00:00:00
abstract::The effective dose (ED) is the pharmaceutical dosage required to produce a therapeutic response in a fixed proportion of the patients. When only one drug is considered, the problem is a univariate one and has been well-studied. However, in the multidimensional setting, that is, in the presence of combinations of agent...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6226
更新日期:2014-10-30 00:00:00
abstract::Studies based on aggregated hospital outcome data have established that there is a relationship between nurse staffing and adverse events. However, this result could not be confirmed in Belgium where 96 per cent of the variability of nurse staffing levels over nursing units (belonging to different hospitals) is explai...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3756
更新日期:2010-03-30 00:00:00
abstract::Standard approaches to analysis of randomized controlled trials (RCTs) using Markov models make it difficult to generalize treatment effects to new patient groups and synthesize evidence across trials. This paper demonstrates how pair-wise and mixed treatment comparison meta-analysis can be applied to event history da...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4059
更新日期:2011-01-30 00:00:00
abstract::Statistical methods are available for performing a meta-analysis when the response variable of interest is the same in each study. Problems arise when studies exploring a common therapeutic question use different patient response types. This article presents statistical methods for combining studies which involve diff...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780132313
更新日期:1994-12-15 00:00:00
abstract::Recent studies have indicated variation in the infectivity beta of HIV among heterosexual couples. We represent this heterogeneity by modelling beta as a random variable. Using data on the number of contacts and seroconversion of couples, we fit the model by maximum-likelihood estimation with a beta distribution and a...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080110
更新日期:1989-01-01 00:00:00
abstract::Cross-sectional designs are often used to monitor the proportion of infections and other post-surgical complications acquired in hospitals. However, conventional methods for estimating incidence proportions when applied to cross-sectional data may provide estimators that are highly biased, as cross-sectional designs t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5608
更新日期:2013-06-30 00:00:00
abstract::The statistical analysis of spatially correlated data has become an important scientific research topic lately. The analysis of the mortality or morbidity rates observed at different areas may help to decide if people living in certain locations are considered at higher risk than others. Once the statistical model for...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20000730)19:14<1915::aid-sim503>
更新日期:2000-07-30 00:00:00
abstract::Most phase I dose-finding methods in oncology aim to find the maximum-tolerated dose from a set of prespecified doses. However, in practice, because of a lack of understanding of the true dose-toxicity relationship, it is likely that none of these prespecified doses are equal or reasonably close to the true maximum-to...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6933
更新日期:2016-09-10 00:00:00
abstract::The original article to which this Correction refers was published in Statistics in Medicine 2000 19(14): 1901-1914. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/1097-0258(20001115)19:21<3017::aid-sim785>
更新日期:2000-11-15 00:00:00
abstract::Health authorities are often alerted to suspected cancer clusters near the vicinity of potential point sources by members of the public. A surveillance system, where administrative regions around the potential point sources are regularly monitored for high disease rates, would allow for responses which are easier to o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19960415)15:7/9<727::aid-s
更新日期:1996-04-15 00:00:00
abstract::This paper reviews methods for mapping geographical variation in disease incidence and mortality. Recent results in Bayesian hierarchical modelling of relative risk are discussed. Two approaches to relative risk estimation, along with the related computational procedures, are described and compared. The first is an em...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780110802
更新日期:1992-06-15 00:00:00
abstract::We suggest measures to quantify the degrees of necessity and of sufficiency of prognostic factors for dichotomous and for survival outcomes. A cause, represented by certain values of prognostic factors, is considered necessary for an event if, without the cause, the event cannot develop. It is considered sufficient fo...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8331
更新日期:2019-10-15 00:00:00