Abstract:
:Diagnostic problems in medicine are sometimes polytomous, meaning that the outcome has more than two distinct categories. For example, ovarian tumors can be benign, borderline, primary invasive, or metastatic. Extending the main measure of binary discrimination, the c-statistic or area under the ROC curve, to nominal polytomous settings is not straightforward. This paper reviews existing measures and presents the polytomous discrimination index (PDI) as an alternative. The PDI assesses all sets of k cases consisting of one case from each outcome category. For each category i (i = 1, … ,k), it is assessed whether the risk of category i is highest for the case from category i. A score of 1∕k is given per category for which this holds, yielding a set score between 0 and 1 to indicate the level of discrimination. The PDI is the average set score and is interpreted as the probability to correctly identify a case from a randomly selected category within a set of k cases. This probability can be split up by outcome category, yielding k category-specific values that result in the PDI when averaged. We demonstrate the measures on two diagnostic problems (residual mass histology after chemotherapy for testicular cancer; diagnosis of ovarian tumors). We compare the behavior of the measures on theoretical data, showing that PDI is more strongly influenced by simultaneous discrimination between all categories than by partial discrimination between pairs of categories. In conclusion, the PDI is attractive because it better matches the requirements of a measure to summarize polytomous discrimination.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Van Calster B,Van Belle V,Vergouwe Y,Timmerman D,Van Huffel S,Steyerberg EWdoi
10.1002/sim.5321subject
Has Abstractpub_date
2012-10-15 00:00:00pages
2610-26issue
23eissn
0277-6715issn
1097-0258journal_volume
31pub_type
杂志文章abstract::This paper concerns the statistical analysis of certain binary data arising in molecular studies of cancer. In allelic-loss experiments, tumour cell genomes are analysed at informative molecular marker loci to identify deleted chromosomal regions. The resulting binary data are used to infer properties of putative supp...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19980715)17:13<1425::aid-s
更新日期:1998-07-15 00:00:00
abstract::When asking 'what is known' about a drug or therapy or program at any time, both researchers and practitioners often confront more than a single study. Facing a variety of findings, where conflicts may outweigh agreement, how can a reviewer constructively approach the task? In this discussion, I will outline some ques...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780060304
更新日期:1987-04-01 00:00:00
abstract::The change in c-statistic is frequently used to summarize the change in predictive accuracy when a novel risk factor is added to an existing logistic regression model. We explored the relationship between the absolute change in the c-statistic, Brier score, generalized R(2) , and the discrimination slope when a risk f...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5598
更新日期:2013-02-20 00:00:00
abstract::Matched-pair designs have been commonly employed in diagnostic, epidemiologic and laboratory studies. For estimation of a ratio of two marginal probabilities in matched-pair data, a Wald-type logarithmic method is computationally simple, but an actual coverage rate is known to be smaller than a nominal one and a lengt...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3685
更新日期:2009-10-15 00:00:00
abstract::Sensible plans for health-care needs and determination of priorities for expenditure require regular assessment of trends in HIV incidences. In particular, trends in the relative HIV incidences of different risk categories are useful when assessing whether current control strategies are working equally well for all ri...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(SICI)1097-0258(19960830)15:16<1779::AID-S
更新日期:1996-08-30 00:00:00
abstract::Multivariate random length data occur when we observe multiple measurements of a quantitative variable and the variable number of these measurements is also an observed outcome for each experimental unit. For example, for a patient with coronary artery disease, we may observe a number of lesions in that patient's coro...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990130)18:2<199::aid-sim
更新日期:1999-01-30 00:00:00
abstract::A mediator acts as a third variable in the causal pathway between a risk factor and an outcome. In this paper, we consider the estimation of the mediation effect when the mediator is a binary variable. We give a precise definition of the mediation effect and examine asymptotic properties of five different estimators o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2730
更新日期:2007-08-15 00:00:00
abstract::Longitudinal studies with repeated measures are often subject to non-response. Methods currently employed to alleviate the difficulties caused by missing data are typically unsatisfactory, especially when the cause of the missingness is related to the outcomes. We present an approach for incomplete categorical data in...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.982
更新日期:2002-01-30 00:00:00
abstract::We consider the problem of identifying subgroups of participants in a clinical trial that have enhanced treatment effect. Recursive partitioning methods that recursively partition the covariate space based on some measure of between groups treatment effect difference are popular for such subgroup identification. The m...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8214
更新日期:2019-09-20 00:00:00
abstract::For each of 211 arteriosclerosis obliterans patients, the degree of stenosis of arteries at four sites were examined at Hiroshima University Hospital to analyse the relationship between the degree of stenosis and age, sex and site. The generalized estimating equations using a proportional odds model for the stenosis p...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1363
更新日期:2003-07-15 00:00:00
abstract:BACKGROUND:It has been recommended that onset of antidepressant action be assessed using survival analyses with assessments taken at least twice per week. However, such an assessment schedule is problematic to implement. The present study assessed the feasibility of comparing onset of action between treatments using a ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2309
更新日期:2006-07-30 00:00:00
abstract::Girardeau, Ravaud and Donner in 2008 presented a formula for sample size calculations for cluster randomised crossover trials, when the intracluster correlation coefficient, interperiod correlation coefficient and mean cluster size are specified in advance. However, in many randomised trials, the number of clusters is...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8191
更新日期:2019-08-15 00:00:00
abstract::We consider counting process methods for analysing time-to-event data with multiple or recurrent outcomes, using the models developed by Anderson and Gill, Wei, Lin and Weissfeld and Prentice, Williams and Peterson. We compare the methods, and show how to implement them using popular statistical software programs. By ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19970930)16:18<2029::aid-s
更新日期:1997-09-30 00:00:00
abstract::If the sample size for a t-test is calculated on the basis of a prior estimate of the variance then the power of the test at the treatment difference of interest is not robust to misspecification of the variance. We propose a t-test for a two-treatment comparison based on Stein's two-stage test which involves the use ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990715)18:13<1575::aid-s
更新日期:1999-07-15 00:00:00
abstract::Data on alcohol availability and problems in all cities in Los Angeles County were collected from several different sources and linked together to form a Local Alcohol Availability Database (LAAD). The two major purposes of the project are to provide a city-level alcohol availability and alcohol-related problems datab...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780140517
更新日期:1995-03-15 00:00:00
abstract::We develop reinforcement learning trials for discovering individualized treatment regimens for life-threatening diseases such as cancer. A temporal-difference learning method called Q-learning is utilized that involves learning an optimal policy from a single training set of finite longitudinal patient trajectories. A...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3720
更新日期:2009-11-20 00:00:00
abstract::Algorithms for identifying public health threats or disease outbreaks are vulnerable to false alarms arising from sudden shifts in health-care utilization or data participation. This paper describes a method of reducing false alerts in automated public health surveillance algorithms, and in particular, automated syndr...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4204
更新日期:2011-06-30 00:00:00
abstract::Existing methods for power analysis for longitudinal study designs are limited in that they do not adequately address random missing data patterns. Although the pattern of missing data can be assessed during data analysis, it is unknown during the design phase of a study. The random nature of the missing data pattern ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2773
更新日期:2007-07-10 00:00:00
abstract::We consider recurrent events of the same type that occur during alternating restraint and non-restraint time periods. This research is motivated by a study on juvenile recidivism, where the probationers were followed for re-offenses during alternating placement periods and free-time periods. During the placement perio...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7150
更新日期:2017-02-20 00:00:00
abstract::The case-control study is a simple and an useful method to characterize the effect of a gene, the effect of an exposure, as well as the interaction between the two. The control-free case-only study is yet an even simpler design, if interest is centered on gene-environment interaction only. It requires the sometimes pl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4028
更新日期:2010-10-30 00:00:00
abstract::Surrogate endpoint validation has been well established by the meta-analytical correlation-based approach as outlined in the seminal work of Buyse et al. (Biostatistics, 2000). Surrogacy can be assumed if strong associations on individual and study levels can be demonstrated. Alternatively, if an effect on a true endp...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6778
更新日期:2016-03-30 00:00:00
abstract::Screening and diagnostic tests are important in disease prevention or control. The predictive values of positive and negative (PPV and NPV) test results are two of four operational characteristics of a screening test. We review an existing method based on the generalized estimating equation (GEE) methodology for compa...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2332
更新日期:2006-07-15 00:00:00
abstract::We describe how trends in the vaccination coverage at 19 months of age vary by race/ethnicity; explore the extent to which data required to evaluate a child's up-to-date vaccination status is missing as a result of the scattering of vaccination records among many vaccination providers; evaluate how the prevalence of t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3223
更新日期:2008-09-10 00:00:00
abstract::Selection of dose for cancer patients treated with radiation therapy (RT) must balance the increased efficacy with the increased toxicity associated with higher dose. Historically, a single dose has been selected for a population of patients (e.g., all stage III non-small cell lung cancer). However, the availability o...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6285
更新日期:2014-12-30 00:00:00
abstract::The correct identification of change-points during ongoing outbreak investigations of infectious diseases is a matter of paramount importance in epidemiology, with major implications for the management of health care resources, public health and, as the COVID-19 pandemic has shown, social live. Onsets, peaks, and infl...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8807
更新日期:2021-02-20 00:00:00
abstract::We consider several sources of heterogeneity in a clinical trial with patients' survival time as the main response criterion: differences in prognosis which can be attributed to a latent or ignored prognostic factor; differences in treatment efficacy in subgroups of patients, and differences in treatment combinations ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780060708
更新日期:1987-10-01 00:00:00
abstract:BACKGROUND:The need to deliver interventions targeting multiple diseases in a cost-effective manner calls for integrated disease control efforts. Consequently, maps are required that show where the risk of co-infection is particularly high. Co-infection risk is preferably estimated via Bayesian geostatistical multinomi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4243
更新日期:2011-06-30 00:00:00
abstract::We examine the properties of several tests for goodness-of-fit for multinomial logistic regression. One test is based on a strategy of sorting the observations according to the complement of the estimated probability for the reference outcome category and then grouping the subjects into g equal-sized groups. A g x c c...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3202
更新日期:2008-09-20 00:00:00
abstract::Subjective judgements of complex variables are commonly recorded as ordered categorical data. The rank-invariant properties of such data are well known, and there are various statistical approaches to the analysis and modelling of ordinal data. This paper focuses on the non-additive property of ordered categorical dat...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19981230)17:24<2923::aid-s
更新日期:1998-12-30 00:00:00
abstract::In the recent years, studies of hepatitis B (HBV) and hepatitis C virus (HCV) dynamics have drawn great attention as they provide insight into the process of virus elimination/production and of infected cells decay during antiviral treatment. Estimates of viral dynamic parameters may be used to determine the lifetime ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3457
更新日期:2008-12-30 00:00:00