Abstract:
:A review is given of different ways of estimating the error rate of a prediction rule based on a statistical model. A distinction is drawn between apparent, optimum and actual error rates. Moreover it is shown how cross-validation can be used to obtain an adjusted predictor with smaller error rate. A detailed discussion is given for ordinary least squares, logistic regression and Cox regression in survival analysis. Finally, the splitsample approach is discussed and demonstrated on two data sets.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Van Houwelingen JC,Le Cessie Sdoi
10.1002/sim.4780091109subject
Has Abstractpub_date
1990-11-01 00:00:00pages
1303-25issue
11eissn
0277-6715issn
1097-0258journal_volume
9pub_type
杂志文章abstract::In this paper, we consider pooling schemes in which samples are to be tested in two-stages. We show that when batch size is limited as well as pool size, selection schemes tend to be more efficient and flexible. Formulae for the efficiencies of square arrays in all dimensions and for all selection schemes are given in...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3965
更新日期:2010-09-20 00:00:00
abstract::Assessment of equivalence or non-inferiority in accuracy between two diagnostic procedures often involves comparisons of paired areas under the receiver operating characteristic (ROC) curves. With some pre-specified clinically meaningful limits, the current approach to evaluating equivalence is to perform the two one-...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2358
更新日期:2006-04-15 00:00:00
abstract::Assessing the QT prolongation potential of a drug is typically done based on pivotal safety studies called thorough QT studies. Model-based estimation of the drug-induced QT prolongation at the estimated mean maximum drug concentration could increase efficiency over the currently used intersection-union test. However,...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7395
更新日期:2017-10-30 00:00:00
abstract::For massive survival data, we propose a subsampling algorithm to efficiently approximate the estimates of regression parameters in the additive hazards model. We establish consistency and asymptotic normality of the subsample-based estimator given the full data. The optimal subsampling probabilities are obtained via m...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8783
更新日期:2021-01-30 00:00:00
abstract::We propose a new weighted hurdle regression method for modeling count data, with particular interest in modeling cardiovascular events in patients on dialysis. Cardiovascular disease remains one of the leading causes of hospitalization and death in this population. Our aim is to jointly model the relationship/associat...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6232
更新日期:2014-11-10 00:00:00
abstract::In this paper, we argue that causal effect models for realistic individualized treatment rules represent an attractive tool for analyzing sequentially randomized trials. Unlike a number of methods proposed previously, this approach does not rely on the assumption that intermediate outcomes are discrete or that models ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3268
更新日期:2008-08-30 00:00:00
abstract::I discuss alternatives to the one compartment model, delta Yt = alpha + beta exp(- gamma t). Instead of comparing the one and two compartment models, I derive statistics for testing mixtures of the parameters (beta, gamma) in the one compartment model. I apply the proposed methods to the problem of hydrogen clearance ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780080811
更新日期:1989-08-01 00:00:00
abstract::In clinical research, we are often interested in assessing how a biomarker changes with time, and whether it could be used as a surrogate marker when evaluating the efficacy of a new drug. However, when the longitudinal marker is correlated with survival, linear mixed models for longitudinal data may be inappropriate....
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3142
更新日期:2007-12-30 00:00:00
abstract::A calibration line is used to define the relationship between a new clinical technique and a standard in vitro laboratory methodology. Discrimination intervals quantify the reliability of inverse estimates obtained from the calibration line. Applied to transcutaneous PCO2 monitoring, a new in vivo measurement, discrim...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780050407
更新日期:1986-07-01 00:00:00
abstract::A few large multi-centre male-only heart trials done in the 1970s and 1980s have been seen as ill-conceived because they did not include females. The purpose here is to revisit two of those trials and to consider consequences in terms of cost and power had they been designed to include females. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990215)18:3<241::aid-sim
更新日期:1999-02-15 00:00:00
abstract::The problem for assessing biosimilarity and drug interchangeability of follow-on biologics (biosimilar products) is studied. Unlike the generic products, the development of biosimilar products is much more complicated because of fundamental differences in functional structures and manufacturing processes. As a result,...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5571
更新日期:2013-02-10 00:00:00
abstract::Modelling disease clustering over space and time can be helpful in providing indications of possible exposures and planning corresponding public health practices. Though a considerable number of studies focus on modelling spatio-temporal patterns of disease, most of them do not directly model a spatio-temporal cluster...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2424
更新日期:2006-03-15 00:00:00
abstract::Some clinical trials aim to demonstrate therapeutic equivalence on multiple primary endpoints. For example, therapeutic equivalence studies of agents for the treatment of osteoarthritis use several primary endpoints including investigator's global assessment of disease activity, patient's global assessment of response...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.985
更新日期:2001-11-15 00:00:00
abstract::Maps of estimated disease rates over multiple time periods are useful tools for gaining etiologic insights regarding potential exposures associated with specific locations and times. In this paper, we describe an extension of the Gangnon-Clayton model for spatial clustering to spatio-temporal data. As in the purely sp...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3984
更新日期:2010-09-30 00:00:00
abstract::The 'landmark' and 'Simon and Makuch' non-parametric estimators of the survival function are commonly used to contrast the survival experience of time-dependent treatment groups in applications such as stem cell transplant versus chemotherapy in leukemia. However, the theoretical survival functions corresponding to th...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6765
更新日期:2016-03-30 00:00:00
abstract::Correlated response data arise often in biomedical studies. The generalized estimation equation (GEE) approach is widely used in regression analysis for such data. However, there are few methods available to check the adequacy of regression models in GEE. In this paper, a graphical method is proposed based on Cook and...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.889
更新日期:2001-10-15 00:00:00
abstract::The application of Bayesian hierarchical models to measure spatial effects in time to event data has not been widely reported. This case study aims to estimate the effect of area of residence on waiting times to coronary artery bypass graft (CABG) and to assess the role of important individual specific covariates (age...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1535
更新日期:2003-09-30 00:00:00
abstract::Graphical methods are often used to check goodness-of-fit of models to data. It is common to plot residuals against a reference distribution so that when the model fits the data, the configuration should be close to a straight line. Since the resemblance to a straight line is often unclear, it has been suggested to ad...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780141607
更新日期:1995-08-30 00:00:00
abstract::We propose a new, less costly, design to test the equivalence of digital versus analogue mammography in terms of sensitivity and specificity. Because breast cancer is a rare event among asymptomatic women, the sample size for testing equivalence of sensitivity is larger than that for testing equivalence of specificity...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19981015)17:19<2219::aid-s
更新日期:1998-10-15 00:00:00
abstract::Frailty models are encountered in many medical applications, yet little research has been devoted to develop measures that quantify the predictive ability of these models. In this paper, we elaborate on the concept of the concordance probability to clustered data, resulting in an 'Overall Conditional C-index' or bfC(O...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4058
更新日期:2010-12-30 00:00:00
abstract::Observational studies provide a rich source of information for assessing effectiveness of treatment interventions in many situations where it is not ethical or practical to perform randomized controlled trials. However, such studies are prone to bias from hidden (unmeasured) confounding. A promising approach to identi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7051
更新日期:2016-12-10 00:00:00
abstract::For bivariate meta-analysis of diagnostic studies, likelihood approaches are very popular. However, they often run into numerical problems with possible non-convergence. In addition, the construction of confidence intervals is controversial. Bayesian methods based on Markov chain Monte Carlo (MCMC) sampling could be u...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3858
更新日期:2010-05-30 00:00:00
abstract::In vitro fertilization (IVF) is an increasingly common method of assisted reproductive technology. Because of the careful observation and follow-up required as part of the procedure, IVF studies provide an ideal opportunity to identify and assess clinical and demographic factors along with environmental exposures that...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.6050
更新日期:2014-05-10 00:00:00
abstract::This paper introduces a dynamic clustering methodology based on multi-valued descriptors of dermoscopic images. The main idea is to support medical diagnosis to decide if pigmented skin lesions belonging to an uncertain set are nearer to malignant melanoma or to benign nevi. Melanoma is the most deadly skin cancer, an...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4285
更新日期:2011-09-10 00:00:00
abstract::We present methods for binomial regression when the outcome is determined using the results of a single diagnostic test with imperfect sensitivity and specificity. We present our model, illustrate it with the analysis of real data, and provide an example of WinBUGS program code for performing such an analysis. Conditi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1656
更新日期:2004-04-15 00:00:00
abstract::In many biomedical and epidemiological studies, data are often clustered due to longitudinal follow up or repeated sampling. While in some clustered data the cluster size is pre-determined, in others it may be correlated with the outcome of subunits, resulting in informative cluster size. When the cluster size is info...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4239
更新日期:2011-07-10 00:00:00
abstract::We develop analysis methods for clinical trials with time-to-event outcomes which correct for treatment changes during follow-up, yet are based on comparisons of randomized groups and not of selected groups. A causal model relating observed event times to event times that would have been observed under other treatment...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19991015)18:19<2617::aid-s
更新日期:1999-10-15 00:00:00
abstract::The problem of testing symmetry about zero has a long and rich history in the statistical literature. We introduce a new test that sequentially discards observations whose absolute value is below increasing thresholds defined by the data. McNemar's statistic is obtained at each threshold and the largest is used as the...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5384
更新日期:2012-11-20 00:00:00
abstract::In drug-drug interaction (DDI) research, a two drug interaction is usually predicted by individual drug pharmacokinetics (PK). Although subject-specific drug concentration data from clinical PK studies on inhibitor/inducer or substrate's PK are not usually published, sample mean plasma drug concentrations and their st...
journal_title:Statistics in medicine
pub_type: 杂志文章,meta分析
doi:10.1002/sim.2837
更新日期:2007-09-10 00:00:00
abstract::Prognostic models are used in medicine for investigating patient outcome in relation to patient and disease characteristics. Such models do not always work well in practice, so it is widely recommended that they need to be validated. The idea of validating a prognostic model is generally taken to mean establishing tha...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(20000229)19:4<453::aid-sim
更新日期:2000-02-29 00:00:00