Estimation of ROC curve with complex survey data.

Abstract:

:The receiver operating characteristic (ROC) curve can be utilized to evaluate the performance of diagnostic tests. The area under the ROC curve (AUC) is a widely used summary index for comparing multiple ROC curves. Both parametric and nonparametric methods have been developed to estimate and compare the AUCs. However, these methods are usually only applicable to data collected from simple random samples and not surveys and epidemiologic studies that use complex sample designs such as stratified and/or multistage cluster sampling with sample weighting. Such complex samples can inflate variances from intra-cluster correlation and alter the expectations of test statistics because of the use of sample weights that account for differential sampling rates. In this paper, we modify the nonparametric method to incorporate sampling weights to estimate the AUC and employ leaving-one-out jackknife methods along with the balanced repeated replication method to account for the effects of the complex sampling in the variance estimation of our proposed estimators of the AUC. The finite sample properties of our methods are evaluated using simulations, and our methods are illustrated by comparing the estimated AUC for predicting overweight/obesity using different measures of body weight and adiposity among sampled children and adults in the US Hispanic Health and Nutrition Examination Survey.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Yao W,Li Z,Graubard BI

doi

10.1002/sim.6405

subject

Has Abstract

pub_date

2015-04-15 00:00:00

pages

1293-303

issue

8

eissn

0277-6715

issn

1097-0258

journal_volume

34

pub_type

杂志文章
  • Estimating treated prevalence and service utilization rates: assessing disparities in mental health.

    abstract::There is considerable public concern about health disparities among different cultural/racial/ethnic groups. Important process measures that might reflect inequities are treated prevalence and the service utilization rate in a defined period of time. We have previously described a method for estimating N, the distinct...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3904

    authors: Laska EM,Meisner M,Wanderling J,Siegel C

    更新日期:2010-07-20 00:00:00

  • A Bayesian hierarchical variable selection prior for pathway-based GWAS using summary statistics.

    abstract::While genome-wide association studies (GWASs) have been widely used to uncover associations between diseases and genetic variants, standard SNP-level GWASs often lack the power to identify SNPs that individually have a moderate effect size but jointly contribute to the disease. To overcome this problem, pathway-based ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8442

    authors: Yang Y,Basu S,Zhang L

    更新日期:2020-03-15 00:00:00

  • Direct effects testing: a two-stage procedure to test for effect size and variable importance for correlated binary predictors and a binary response.

    abstract::In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good pr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4014

    authors: Sperrin M,Jaki T

    更新日期:2010-10-30 00:00:00

  • Adjusted Kaplan-Meier estimator and log-rank test with inverse probability of treatment weighting for survival data.

    abstract::Estimation and group comparison of survival curves are two very common issues in survival analysis. In practice, the Kaplan-Meier estimates of survival functions may be biased due to unbalanced distribution of confounders. Here we develop an adjusted Kaplan-Meier estimator (AKME) to reduce confounding effects using in...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2174

    authors: Xie J,Liu C

    更新日期:2005-10-30 00:00:00

  • Measurement error in dietary assessment: an investigation using covariance structure models. Part II.

    abstract::In Part I we presented a covariance structure model for analysing measurement error in the assessment of nitrogen intake. In this paper we include data on urine nitrogen excretion which allows a critical assessment of the model proposed. Inclusion of urine nitrogen data produces more pessimistic estimates of the quali...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780121005

    authors: Plummer M,Clayton D

    更新日期:1993-05-30 00:00:00

  • Modelling heterogeneity in clustered count data with extra zeros using compound Poisson random effect.

    abstract::In medical and health studies, heterogeneities in clustered count data have been traditionally modeled by positive random effects in Poisson mixed models; however, excessive zeros often occur in clustered medical and health count data. In this paper, we consider a three-level random effects zero-inflated Poisson model...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3619

    authors: Ma R,Hasan MT,Sneddon G

    更新日期:2009-08-15 00:00:00

  • Modelling the relationship between continuous covariates and clinical events using isotonic regression.

    abstract::In a medical study we are often interested in graphically displaying the relationship between continuous variables and clinical events indicating disease progression. Often, it is reasonable to make the minimal assumption that the risk of progression is an arbitrary monotone function of the continuous variable. Someti...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1561

    authors: Ancukiewicz M,Finkelstein DM,Schoenfeld DA

    更新日期:2003-10-30 00:00:00

  • Survival analyses of randomized clinical trials adjusted for patients who switch treatments.

    abstract::Patients who switch treatment groups in randomized clinical trials can cause problems in the interpretation of the results. Although the intention-to-treat method is recognized as being the most reliable analysis, it may result in an underestimate of the treatment effect if there have been patients who switch treatmen...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(SICI)1097-0258(19961015)15:19<2069::AID-S

    authors: Law MG,Kaldor JM

    更新日期:1996-10-15 00:00:00

  • Selection of patients for randomized controlled trials: implications of wide or narrow eligibility criteria.

    abstract::This paper discusses the various philosophies that influence the selection of patients for entry into randomized controlled trials. Although a number of different and often competing issues have to be considered depending upon the trial, keeping entry criteria simple, wide and at times even flexible is usually prefera...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780090114

    authors: Yusuf S,Held P,Teo KK,Toretsky ER

    更新日期:1990-01-01 00:00:00

  • Simple models for repeated ordinal responses with an application to a seasonal rhinitis clinical trial.

    abstract::In contrast to other models for ordinal data, the continuation ratio model can be fitted with standard statistical software. This makes it particularly appropriate for large clinical trials with ordinal response variables. In addition, when the trials are longitudinal, this model can be applied to individual responses...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/(sici)1097-0258(19971230)16:24<2873::aid-s

    authors: Lindsey JK,Jones B,Ebbutt AF

    更新日期:1997-12-30 00:00:00

  • A new sequential procedure for surveillance of Down's syndrome.

    abstract::A new method is proposed for the surveillance of Down's syndrome among newborn. Despite the strong dependence of overall risk of Down's syndrome on maternal age, it has been suggested that an environmentally induced increase in risk may be additive over all maternal ages. The surveillance method introduced here is spe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120104

    authors: Lie RT,Heuch I,Irgens LM

    更新日期:1993-01-15 00:00:00

  • Constrained S-estimators for linear mixed effects models with covariance components.

    abstract::Linear mixed effects (LME) models are increasingly used for analyses of biological and biomedical data. When the multivariate normal assumption is not adequate for an LME model, then a robust estimation approach is preferable to the maximum likelihood one. M-estimators were considered before for robust estimation of t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4169

    authors: Chervoneva I,Vishnyakov M

    更新日期:2011-06-30 00:00:00

  • Performance of analytical methods for overdispersed counts in cluster randomized trials: sample size, degree of clustering and imbalance.

    abstract::Many different methods have been proposed for the analysis of cluster randomized trials (CRTs) over the last 30 years. However, the evaluation of methods on overdispersed count data has been based mostly on the comparison of results using empiric data; i.e. when the true model parameters are not known. In this study, ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3681

    authors: Durán Pacheco G,Hattendorf J,Colford JM Jr,Mäusezahl D,Smith T

    更新日期:2009-10-30 00:00:00

  • Armitage lecture 2010: Understanding treatment effects: the value of integrating longitudinal data and survival analysis.

    abstract::There is a single-minded focus on events in survival analysis, and we often ignore longitudinal data that are collected together with the event data. This is due to a lack of methodology but also a result of the artificial distinction between survival and longitudinal data analyses. Understanding the dynamics of such ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5324

    authors: Aalen OO

    更新日期:2012-08-15 00:00:00

  • Smooth bootstrap methods for analysis of longitudinal data.

    abstract::In analysis of longitudinal data, the variance matrix of the parameter estimates is usually estimated by the 'sandwich' method, in which the variance for each subject is estimated by its residual products. We propose smooth bootstrap methods by perturbing the estimating functions to obtain 'bootstrapped' realizations ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3027

    authors: Li Y,Wang YG

    更新日期:2008-03-30 00:00:00

  • Combining biomarker trajectories to improve diagnostic accuracy in prospective cohort studies with verification bias.

    abstract::In this paper, we develop methods to combine multiple biomarker trajectories into a composite diagnostic marker using functional data analysis (FDA) to achieve better diagnostic accuracy in monitoring disease recurrence in the setting of a prospective cohort study. In such studies, the disease status is usually verifi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8079

    authors: Li H,Gatsonis C

    更新日期:2019-05-20 00:00:00

  • A semi-parametric Bayesian approach to average bioequivalence.

    abstract::Bioequivalence assessment is an issue of great interest. Development of statistical methods for assessing bioequivalence is an important area of research for statisticians. Bioequivalence is usually determined based on the normal distribution. We relax this assumption and develop a semi-parametric mixed model for bioe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2620

    authors: Ghosh P,Rosner GL

    更新日期:2007-03-15 00:00:00

  • Bootstrap confidence intervals for medical costs with censored observations.

    abstract::Medical costs data with administratively censored observations often arise in cost-effectiveness studies of treatments for life-threatening diseases. Mean of medical costs incurred from the start of a treatment until death or a certain time point after the implementation of treatment is frequently of interest. In many...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1556

    authors: Jiang H,Zhou XH

    更新日期:2004-11-15 00:00:00

  • Multiple imputation analysis of case-cohort studies.

    abstract::The usual methods for analyzing case-cohort studies rely on sometimes not fully efficient weighted estimators. Multiple imputation might be a good alternative because it uses all the data available and approximates the maximum partial likelihood estimator. This method is based on the generation of several plausible co...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4130

    authors: Marti H,Chavance M

    更新日期:2011-06-15 00:00:00

  • A general approach to evaluating the bias of 2-stage instrumental variable estimators.

    abstract::Unmeasured confounding is a common concern when researchers attempt to estimate a treatment effect using observational data or randomized studies with nonperfect compliance. To address this concern, instrumental variable methods, such as 2-stage predictor substitution (2SPS) and 2-stage residual inclusion (2SRI), have...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7636

    authors: Wan F,Small D,Mitra N

    更新日期:2018-05-30 00:00:00

  • Doubly robust estimation of the weighted average treatment effect for a target population.

    abstract::The weighted average treatment effect is a causal measure for the comparison of interventions in a specific target population, which may be different from the population where data are sampled from. For instance, when the goal is to introduce a new treatment to a target population, the question is what efficacy (or ef...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7980

    authors: Tao Y,Fu H

    更新日期:2019-02-10 00:00:00

  • Parameterization of treatment effects for meta-analysis in multi-state Markov models.

    abstract::Standard approaches to analysis of randomized controlled trials (RCTs) using Markov models make it difficult to generalize treatment effects to new patient groups and synthesize evidence across trials. This paper demonstrates how pair-wise and mixed treatment comparison meta-analysis can be applied to event history da...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4059

    authors: Price MJ,Welton NJ,Ades AE

    更新日期:2011-01-30 00:00:00

  • CoPlot: a tool for visualizing multivariate data in medicine.

    abstract::Many critical questions in medicine require the analysis of complex multivariate data, often from large data sets describing numerous variables for numerous subjects. In this paper, we describe CoPlot, a tool for visualizing multivariate data in medicine. CoPlot is an adaptation of multidimensional scaling (MDS) that ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3078

    authors: Bravata DM,Shojania KG,Olkin I,Raveh A

    更新日期:2008-05-30 00:00:00

  • Estimating heterogeneous treatment effects for latent subgroups in observational studies.

    abstract::Individuals may vary in their responses to treatment, and identification of subgroups differentially affected by a treatment is an important issue in medical research. The risk of misleading subgroup analyses has become well known, and some exploratory analyses can be helpful in clarifying how covariates potentially i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7970

    authors: Kim HJ,Lu B,Nehus EJ,Kim MO

    更新日期:2019-02-10 00:00:00

  • A unified inference procedure for a class of measures to assess improvement in risk prediction systems with survival data.

    abstract::Risk prediction procedures can be quite useful for the patient's treatment selection, prevention strategy, or disease management in evidence-based medicine. Often, potentially important new predictors are available in addition to the conventional markers. The question is how to quantify the improvement from the new ma...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5647

    authors: Uno H,Tian L,Cai T,Kohane IS,Wei LJ

    更新日期:2013-06-30 00:00:00

  • Bayesian approach to average power calculations for binary regression models with misclassified outcomes.

    abstract::We develop a simulation-based procedure for determining the required sample size in binomial regression risk assessment studies when response data are subject to misclassification. A Bayesian average power criterion is used to determine a sample size that provides high probability, averaged over the distribution of po...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3505

    authors: Cheng D,Stamey JD,Branscum AJ

    更新日期:2009-02-28 00:00:00

  • Estimation of genetic and environmental factors for melanoma onset using population-based family data.

    abstract::Estimation of genetic and environmental contributions to cancers falls in the framework of generalized linear mixed modelling with several random effect components. Computational challenges remain, however, in dealing with binary or survival phenotypes. In this paper, we consider the analysis of melanoma onset in a po...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2266

    authors: Lindström L,Pawitan Y,Reilly M,Hemminki K,Lichtenstein P,Czene K

    更新日期:2006-09-30 00:00:00

  • Re-use of case-control data for analysis of new outcome variables.

    abstract::Case-control studies are usually defined to investigate risk factors for a single disease of interest. However, subsequent to data collection, investigators may wish to examine as an 'outcome' a variable that was an exposure in the original study. A naive analysis that disregards the sampling strategy that gave rise t...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2398

    authors: Reilly M,Torrång A,Klint A

    更新日期:2005-12-30 00:00:00

  • Mixed-effects regression models for studying the natural history of prostate disease.

    abstract::Although prostate cancer and benign prostatic hyperplasia are major health problems in U.S. men, little is known about the early stages of the natural history of prostate disease. A molecular biomarker called prostate specific antigen (PSA), together with a unique longitudinal bank of frozen serum, now allows a histor...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780130520

    authors: Pearson JD,Morrell CH,Landis PK,Carter HB,Brant LJ

    更新日期:1994-03-15 00:00:00

  • Redesign of trials under different enrollment mixes.

    abstract::A few large multi-centre male-only heart trials done in the 1970s and 1980s have been seen as ill-conceived because they did not include females. The purpose here is to revisit two of those trials and to consider consequences in terms of cost and power had they been designed to include females. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990215)18:3<241::aid-sim

    authors: Meinert CL

    更新日期:1999-02-15 00:00:00