Abstract:
:The receiver operating characteristic (ROC) curve can be utilized to evaluate the performance of diagnostic tests. The area under the ROC curve (AUC) is a widely used summary index for comparing multiple ROC curves. Both parametric and nonparametric methods have been developed to estimate and compare the AUCs. However, these methods are usually only applicable to data collected from simple random samples and not surveys and epidemiologic studies that use complex sample designs such as stratified and/or multistage cluster sampling with sample weighting. Such complex samples can inflate variances from intra-cluster correlation and alter the expectations of test statistics because of the use of sample weights that account for differential sampling rates. In this paper, we modify the nonparametric method to incorporate sampling weights to estimate the AUC and employ leaving-one-out jackknife methods along with the balanced repeated replication method to account for the effects of the complex sampling in the variance estimation of our proposed estimators of the AUC. The finite sample properties of our methods are evaluated using simulations, and our methods are illustrated by comparing the estimated AUC for predicting overweight/obesity using different measures of body weight and adiposity among sampled children and adults in the US Hispanic Health and Nutrition Examination Survey.
journal_name
Stat Medjournal_title
Statistics in medicineauthors
Yao W,Li Z,Graubard BIdoi
10.1002/sim.6405subject
Has Abstractpub_date
2015-04-15 00:00:00pages
1293-303issue
8eissn
0277-6715issn
1097-0258journal_volume
34pub_type
杂志文章abstract::There is considerable public concern about health disparities among different cultural/racial/ethnic groups. Important process measures that might reflect inequities are treated prevalence and the service utilization rate in a defined period of time. We have previously described a method for estimating N, the distinct...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3904
更新日期:2010-07-20 00:00:00
abstract::While genome-wide association studies (GWASs) have been widely used to uncover associations between diseases and genetic variants, standard SNP-level GWASs often lack the power to identify SNPs that individually have a moderate effect size but jointly contribute to the disease. To overcome this problem, pathway-based ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8442
更新日期:2020-03-15 00:00:00
abstract::In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good pr...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4014
更新日期:2010-10-30 00:00:00
abstract::Estimation and group comparison of survival curves are two very common issues in survival analysis. In practice, the Kaplan-Meier estimates of survival functions may be biased due to unbalanced distribution of confounders. Here we develop an adjusted Kaplan-Meier estimator (AKME) to reduce confounding effects using in...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2174
更新日期:2005-10-30 00:00:00
abstract::In Part I we presented a covariance structure model for analysing measurement error in the assessment of nitrogen intake. In this paper we include data on urine nitrogen excretion which allows a critical assessment of the model proposed. Inclusion of urine nitrogen data produces more pessimistic estimates of the quali...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780121005
更新日期:1993-05-30 00:00:00
abstract::In medical and health studies, heterogeneities in clustered count data have been traditionally modeled by positive random effects in Poisson mixed models; however, excessive zeros often occur in clustered medical and health count data. In this paper, we consider a three-level random effects zero-inflated Poisson model...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3619
更新日期:2009-08-15 00:00:00
abstract::In a medical study we are often interested in graphically displaying the relationship between continuous variables and clinical events indicating disease progression. Often, it is reasonable to make the minimal assumption that the risk of progression is an arbitrary monotone function of the continuous variable. Someti...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1561
更新日期:2003-10-30 00:00:00
abstract::Patients who switch treatment groups in randomized clinical trials can cause problems in the interpretation of the results. Although the intention-to-treat method is recognized as being the most reliable analysis, it may result in an underestimate of the treatment effect if there have been patients who switch treatmen...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(SICI)1097-0258(19961015)15:19<2069::AID-S
更新日期:1996-10-15 00:00:00
abstract::This paper discusses the various philosophies that influence the selection of patients for entry into randomized controlled trials. Although a number of different and often competing issues have to be considered depending upon the trial, keeping entry criteria simple, wide and at times even flexible is usually prefera...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780090114
更新日期:1990-01-01 00:00:00
abstract::In contrast to other models for ordinal data, the continuation ratio model can be fitted with standard statistical software. This makes it particularly appropriate for large clinical trials with ordinal response variables. In addition, when the trials are longitudinal, this model can be applied to individual responses...
journal_title:Statistics in medicine
pub_type: 临床试验,杂志文章,随机对照试验
doi:10.1002/(sici)1097-0258(19971230)16:24<2873::aid-s
更新日期:1997-12-30 00:00:00
abstract::A new method is proposed for the surveillance of Down's syndrome among newborn. Despite the strong dependence of overall risk of Down's syndrome on maternal age, it has been suggested that an environmentally induced increase in risk may be additive over all maternal ages. The surveillance method introduced here is spe...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780120104
更新日期:1993-01-15 00:00:00
abstract::Linear mixed effects (LME) models are increasingly used for analyses of biological and biomedical data. When the multivariate normal assumption is not adequate for an LME model, then a robust estimation approach is preferable to the maximum likelihood one. M-estimators were considered before for robust estimation of t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4169
更新日期:2011-06-30 00:00:00
abstract::Many different methods have been proposed for the analysis of cluster randomized trials (CRTs) over the last 30 years. However, the evaluation of methods on overdispersed count data has been based mostly on the comparison of results using empiric data; i.e. when the true model parameters are not known. In this study, ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3681
更新日期:2009-10-30 00:00:00
abstract::There is a single-minded focus on events in survival analysis, and we often ignore longitudinal data that are collected together with the event data. This is due to a lack of methodology but also a result of the artificial distinction between survival and longitudinal data analyses. Understanding the dynamics of such ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5324
更新日期:2012-08-15 00:00:00
abstract::In analysis of longitudinal data, the variance matrix of the parameter estimates is usually estimated by the 'sandwich' method, in which the variance for each subject is estimated by its residual products. We propose smooth bootstrap methods by perturbing the estimating functions to obtain 'bootstrapped' realizations ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3027
更新日期:2008-03-30 00:00:00
abstract::In this paper, we develop methods to combine multiple biomarker trajectories into a composite diagnostic marker using functional data analysis (FDA) to achieve better diagnostic accuracy in monitoring disease recurrence in the setting of a prospective cohort study. In such studies, the disease status is usually verifi...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.8079
更新日期:2019-05-20 00:00:00
abstract::Bioequivalence assessment is an issue of great interest. Development of statistical methods for assessing bioequivalence is an important area of research for statisticians. Bioequivalence is usually determined based on the normal distribution. We relax this assumption and develop a semi-parametric mixed model for bioe...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2620
更新日期:2007-03-15 00:00:00
abstract::Medical costs data with administratively censored observations often arise in cost-effectiveness studies of treatments for life-threatening diseases. Mean of medical costs incurred from the start of a treatment until death or a certain time point after the implementation of treatment is frequently of interest. In many...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.1556
更新日期:2004-11-15 00:00:00
abstract::The usual methods for analyzing case-cohort studies rely on sometimes not fully efficient weighted estimators. Multiple imputation might be a good alternative because it uses all the data available and approximates the maximum partial likelihood estimator. This method is based on the generation of several plausible co...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4130
更新日期:2011-06-15 00:00:00
abstract::Unmeasured confounding is a common concern when researchers attempt to estimate a treatment effect using observational data or randomized studies with nonperfect compliance. To address this concern, instrumental variable methods, such as 2-stage predictor substitution (2SPS) and 2-stage residual inclusion (2SRI), have...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7636
更新日期:2018-05-30 00:00:00
abstract::The weighted average treatment effect is a causal measure for the comparison of interventions in a specific target population, which may be different from the population where data are sampled from. For instance, when the goal is to introduce a new treatment to a target population, the question is what efficacy (or ef...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7980
更新日期:2019-02-10 00:00:00
abstract::Standard approaches to analysis of randomized controlled trials (RCTs) using Markov models make it difficult to generalize treatment effects to new patient groups and synthesize evidence across trials. This paper demonstrates how pair-wise and mixed treatment comparison meta-analysis can be applied to event history da...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4059
更新日期:2011-01-30 00:00:00
abstract::Many critical questions in medicine require the analysis of complex multivariate data, often from large data sets describing numerous variables for numerous subjects. In this paper, we describe CoPlot, a tool for visualizing multivariate data in medicine. CoPlot is an adaptation of multidimensional scaling (MDS) that ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3078
更新日期:2008-05-30 00:00:00
abstract::Individuals may vary in their responses to treatment, and identification of subgroups differentially affected by a treatment is an important issue in medical research. The risk of misleading subgroup analyses has become well known, and some exploratory analyses can be helpful in clarifying how covariates potentially i...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.7970
更新日期:2019-02-10 00:00:00
abstract::Risk prediction procedures can be quite useful for the patient's treatment selection, prevention strategy, or disease management in evidence-based medicine. Often, potentially important new predictors are available in addition to the conventional markers. The question is how to quantify the improvement from the new ma...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.5647
更新日期:2013-06-30 00:00:00
abstract::We develop a simulation-based procedure for determining the required sample size in binomial regression risk assessment studies when response data are subject to misclassification. A Bayesian average power criterion is used to determine a sample size that provides high probability, averaged over the distribution of po...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.3505
更新日期:2009-02-28 00:00:00
abstract::Estimation of genetic and environmental contributions to cancers falls in the framework of generalized linear mixed modelling with several random effect components. Computational challenges remain, however, in dealing with binary or survival phenotypes. In this paper, we consider the analysis of melanoma onset in a po...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2266
更新日期:2006-09-30 00:00:00
abstract::Case-control studies are usually defined to investigate risk factors for a single disease of interest. However, subsequent to data collection, investigators may wish to examine as an 'outcome' a variable that was an exposure in the original study. A naive analysis that disregards the sampling strategy that gave rise t...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.2398
更新日期:2005-12-30 00:00:00
abstract::Although prostate cancer and benign prostatic hyperplasia are major health problems in U.S. men, little is known about the early stages of the natural history of prostate disease. A molecular biomarker called prostate specific antigen (PSA), together with a unique longitudinal bank of frozen serum, now allows a histor...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/sim.4780130520
更新日期:1994-03-15 00:00:00
abstract::A few large multi-centre male-only heart trials done in the 1970s and 1980s have been seen as ill-conceived because they did not include females. The purpose here is to revisit two of those trials and to consider consequences in terms of cost and power had they been designed to include females. ...
journal_title:Statistics in medicine
pub_type: 杂志文章
doi:10.1002/(sici)1097-0258(19990215)18:3<241::aid-sim
更新日期:1999-02-15 00:00:00