Receiver operating characteristic curves and confidence bands for support vector machines.

Abstract:

:Many problems that appear in biomedical decision-making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The support vector machine (SVM) is a popular classification technique that is robust to model misspecification and effectively handles high-dimensional data. The relative costs of false positives and false negatives can vary across application domains. The receiving operating characteristic (ROC) curve provides a visual representation of the trade-off between these two types of errors. Because the SVM does not produce a predicted probability, an ROC curve cannot be constructed in the traditional way of thresholding a predicted probability. However, a sequence of weighted SVMs can be used to construct an ROC curve. Although ROC curves constructed using weighted SVMs have great potential for allowing ROC curves analyses that cannot be done by thresholding predicted probabilities, their theoretical properties have heretofore been underdeveloped. We propose a method for constructing confidence bands for the SVM ROC curve and provide the theoretical justification for the SVM ROC curve by showing that the risk function of the estimated decision rule is uniformly consistent across the weight parameter. We demonstrate the proposed confidence band method using simulation studies. We present a predictive model for treatment response in breast cancer as an illustrative example.

journal_name

Biometrics

journal_title

Biometrics

authors

Luckett DJ,Laber EB,El-Kamary SS,Fan C,Jhaveri R,Perou CM,Shebl FM,Kosorok MR

doi

10.1111/biom.13365

subject

Has Abstract

pub_date

2020-08-31 00:00:00

eissn

0006-341X

issn

1541-0420

pub_type

杂志文章
  • A model-based approach for making ecological inference from distance sampling data.

    abstract::We consider a fully model-based approach for the analysis of distance sampling data. Distance sampling has been widely used to estimate abundance (or density) of animals or plants in a spatially explicit study area. There is, however, no readily available method of making statistical inference on the relationships bet...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01265.x

    authors: Johnson DS,Laake JL,Ver Hoef JM

    更新日期:2010-03-01 00:00:00

  • Bayesian inferences in the Cox model for order-restricted hypotheses.

    abstract::In studying the relationship between an ordered categorical predictor and an event time, it is standard practice to include dichotomous indicators of the different levels of the predictor in a Cox model. One can then use a multiple degree-of-freedom score or partial likelihood ratio test for hypothesis testing. Often,...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00106.x

    authors: Dunson DB,Herring AH

    更新日期:2003-12-01 00:00:00

  • A Bayesian approach to the analysis of quantal bioassay studies using nonparametric mixture models.

    abstract::We develop a Bayesian nonparametric mixture modeling framework for quantal bioassay settings. The approach is built upon modeling dose-dependent response distributions. We adopt a structured nonparametric prior mixture model, which induces a monotonicity restriction for the dose-response curve. Particular emphasis is ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12120

    authors: Fronczyk K,Kottas A

    更新日期:2014-03-01 00:00:00

  • Unbiased and locally efficient estimation of genetic effect on quantitative trait in the presence of population admixture.

    abstract::Population admixture can be a confounding factor in genetic association studies. Family-based methods (Rabinowitz and Larid, 2000, Human Heredity 50, 211-223) have been proposed in both testing and estimation settings to adjust for this confounding, especially in case-only association studies. The family-based methods...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01454.x

    authors: Wang Y,Yang Q,Rabinowitz D

    更新日期:2011-06-01 00:00:00

  • On symmetric semiparametric two-sample problem.

    abstract::We consider a two-sample problem where data come from symmetric distributions. Usual two-sample data with only magnitudes recorded, arising from case-control studies or logistic discriminant analyses, may constitute a symmetric two-sample problem. We propose a semiparametric model such that, in addition to symmetry, t...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13233

    authors: Li M,Diao G,Qin J

    更新日期:2020-12-01 00:00:00

  • FPCA-based method to select optimal sampling schedules that capture between-subject variability in longitudinal studies.

    abstract::A critical component of longitudinal study design involves determining the sampling schedule. Criteria for optimal design often focus on accurate estimation of the mean profile, although capturing the between-subject variance of the longitudinal process is also important since variance patterns may be associated with ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12714

    authors: Wu M,Diez-Roux A,Raghunathan TE,Sánchez BN

    更新日期:2018-03-01 00:00:00

  • UMPU and alternative tests for association in 2 x 2 tables.

    abstract::The use of the uniformly most powerful among the unbiased (UMPU) test was recently suggested for the study of gametic association between two polymorphic loci as an alternative to the Fisher's exact test (Zapata and Alvarez, 1997, Annals of Human Genetics 61, 71-77). However, the proposed test is not UMPU for two-side...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00535.x

    authors: Fuchs C

    更新日期:2001-06-01 00:00:00

  • A proportional hazards model for multivariate interval-censored failure time data.

    abstract::This paper focuses on the methodology developed for analyzing a multivariate interval-censored data set from an AIDS observational study. A purpose of the study was to determine the natural history of the opportunistic infection cytomeglovirus (CMV) in an HIV-infected individual. For this observational study, laborato...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00940.x

    authors: Goggins WB,Finkelstein DM

    更新日期:2000-09-01 00:00:00

  • A note on the conditional approach to interval estimation in the calibration problem.

    abstract::In the calibration problem, the need to construct a confidence interval to estimate the unknown chi 0 arises when the null hypothesis of zero slope is rejected. Otherwise, the resulting confidence interval will be infinite to reflect the fact that the slope of the regression line may be zero. Under the condition of re...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lee JJ

    更新日期:1991-12-01 00:00:00

  • Regression estimator in ranked set sampling.

    abstract::Ranked set sampling (RSS) utilizes inexpensive auxiliary information about the ranking of the units in a sample to provide a more precise estimator of the population mean of the variable of interest Y, which is either difficult or expensive to measure. However, the ranking may not be perfect in most situations. In thi...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Yu PL,Lam K

    更新日期:1997-09-01 00:00:00

  • Bias in estimating association parameters for longitudinal binary responses with drop-outs.

    abstract::This paper considers the impact of bias in the estimation of the association parameters for longitudinal binary responses when there are drop-outs. A number of different estimating equation approaches are considered for the case where drop-out cannot be assumed to be a completely random process. In particular, standar...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00015.x

    authors: Fitzmaurice GM,Lipsitz SR,Molenberghs G,Ibrahim JG

    更新日期:2001-03-01 00:00:00

  • Sample size methods for estimating HIV incidence from cross-sectional surveys.

    abstract::Understanding HIV incidence, the rate at which new infections occur in populations, is critical for tracking and surveillance of the epidemic. In this article, we derive methods for determining sample sizes for cross-sectional surveys to estimate incidence with sufficient precision. We further show how to specify samp...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12336

    authors: Konikoff J,Brookmeyer R

    更新日期:2015-12-01 00:00:00

  • Validity of tests under covariate-adaptive biased coin randomization and generalized linear models.

    abstract::Some covariate-adaptive randomization methods have been used in clinical trials for a long time, but little theoretical work has been done about testing hypotheses under covariate-adaptive randomization until Shao et al. (2010) who provided a theory with detailed discussion for responses under linear models. In this a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12062

    authors: Shao J,Yu X

    更新日期:2013-12-01 00:00:00

  • Bootstrap confidence intervals for adaptive cluster sampling.

    abstract::Consider a collection of spatially clustered objects where the clusters are geographically rare. Of interest is estimation of the total number of objects on the site from a sample of plots of equal size. Under these spatial conditions, adaptive cluster sampling of plots is generally useful in improving efficiency in e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00503.x

    authors: Christman MC,Pontius JS

    更新日期:2000-06-01 00:00:00

  • A two-stage experimental design for dilution assays.

    abstract::Dilution assays to determine solute concentration have found wide use in biomedical research. Many dilution assays return imprecise concentration estimates because they are only done to orders of magnitude. Previous statistical work has focused on how to design efficient experiments that can return more precise estima...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13032

    authors: Ferguson JM,Miura TA,Miller CR

    更新日期:2019-09-01 00:00:00

  • Line-segment confidence bands for repeated measures.

    abstract::For the case of repeated measures on Y with mean values linear in a concomitant variable Z in [a, b], a straight-line confidence band over [a, b] is given with width linear in Z. Graphical presentation of such line-segment confidence bands can help emphasize that appropriate inferences are limited to the range of the ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Stewart PW

    更新日期:1987-09-01 00:00:00

  • Estimating differences in restricted mean lifetime using observational data subject to dependent censoring.

    abstract::In epidemiologic studies of time to an event, mean lifetime is often of direct interest. We propose methods to estimate group- (e.g., treatment-) specific differences in restricted mean lifetime for studies where treatment is not randomized and lifetimes are subject to both dependent and independent censoring. The pro...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01503.x

    authors: Zhang M,Schaubel DE

    更新日期:2011-09-01 00:00:00

  • Fisher's contributions to genetics and heredity, with special emphasis on the Gregor Mendel controversy.

    abstract::R. A. Fisher is widely respected for his contributions to both statistics and genetics. For instance, his 1930 text on The Genetical Theory of Natural Selection remains a watershed contribution in that area. Fisher's subsequent research led him to study the work of (Johann) Gregor Mendel, the 19th century monk who fir...

    journal_title:Biometrics

    pub_type: 传,历史文章,杂志文章,评审

    doi:

    authors: Piegorsch WW

    更新日期:1990-12-01 00:00:00

  • Adaptive decision making in a lymphocyte infusion trial.

    abstract::We describe an adaptive Bayesian design for a clinical trial of an experimental treatment for patients with hematologic malignancies who initially received an allogeneic bone marrow transplant but subsequently suffered a disease recurrence. Treatment consists of up to two courses of targeted immunotherapy followed by ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00560.x

    authors: Thall PF,Inoue LY,Martin TG

    更新日期:2002-09-01 00:00:00

  • Attributable effects in case2-studies.

    abstract::In an effort to determine whether a particular treatment causes a particular outcome event, data are obtained from a database system that records events when they occur, and for such events, the system records exposure to the treatment. That is, the system records information about cases. The system provides no inform...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2005.030920.x

    authors: Rosenbaum PR

    更新日期:2005-03-01 00:00:00

  • Statistical inference for serial dilution assay data.

    abstract::Serial dilution assays are widely employed for estimating substance concentrations and minimum inhibitory concentrations. The Poisson-Bernoulli model for such assays is appropriate for count data but not for continuous measurements that are encountered in applications involving substance concentrations. This paper pre...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.01215.x

    authors: Lee ML,Whitmore GA

    更新日期:1999-12-01 00:00:00

  • Statistical modelling of the AIDS epidemic for forecasting health care needs.

    abstract::The objective of this paper is to develop statistical methods for estimating current and future numbers of individuals in different stages of the natural history of the human immunodeficiency (AIDS) virus infection and to evaluate the impact of therapeutic advances on these numbers. The approach is to extend the metho...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Brookmeyer R,Liao JG

    更新日期:1990-12-01 00:00:00

  • A partially linear tree-based regression model for multivariate outcomes.

    abstract::In the genetic study of complex traits, especially behavior related ones, such as smoking and alcoholism, usually several phenotypic measurements are obtained for the description of the complex trait, but no single measurement can quantify fully the complicated characteristics of the symptom because of our lack of und...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01235.x

    authors: Yu K,Wheeler W,Li Q,Bergen AW,Caporaso N,Chatterjee N,Chen J

    更新日期:2010-03-01 00:00:00

  • A unified parametric regression model for recapture studies with random removals in continuous time.

    abstract::Conditional likelihood based on counting processes are combined with a Horvitz-Thompson estimator to yield a population size estimator that is more efficient than the existing ones. Random removals are allowed in the recapturing process. Simulation studies are shown to assess the performance of the proposed estimators...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00192.x

    authors: Yip PS,Wang Y

    更新日期:2002-03-01 00:00:00

  • Comparing the performances of Diggle's tests of spatial randomness for small samples with and without edge-effect correction: application to ecological data.

    abstract::Diggle's tests of spatial randomness based on empirical distributions of interpoint distances can be performed with and without edge-effect correction. We present here numerical results illustrating that tests without the edge-effect correction proposed by Diggle (1979, Biometrics 35, 87-101) have a higher power for s...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00156.x

    authors: Gignoux J,Duby C,Barot S

    更新日期:1999-03-01 00:00:00

  • Bayesian dose-finding in phase I/II clinical trials using toxicity and efficacy odds ratios.

    abstract::A Bayesian adaptive design is proposed for dose-finding in phase I/II clinical trials to incorporate the bivariate outcomes, toxicity and efficacy, of a new treatment. Without specifying any parametric functional form for the drug dose-response curve, we jointly model the bivariate binary data to account for the corre...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00534.x

    authors: Yin G,Li Y,Ji Y

    更新日期:2006-09-01 00:00:00

  • Additive gamma frailty models with applications to competing risks in related individuals.

    abstract::Epidemiological studies of related individuals are often complicated by the fact that follow-up on the event type of interest is incomplete due to the occurrence of other events. We suggest a class of frailty models with cause-specific hazards for correlated competing events in related individuals. The frailties are b...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12326

    authors: Eriksson F,Scheike T

    更新日期:2015-09-01 00:00:00

  • Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.

    abstract::Genetic variants associated with disease outcomes can be used to develop personalized treatment. To reach this precision medicine goal, hundreds of large-scale genome-wide association studies (GWAS) have been conducted in the past decade to search for promising genetic variants associated with various traits. They hav...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13076

    authors: Masotti M,Guo B,Wu B

    更新日期:2019-12-01 00:00:00

  • Growth curve models of repeated binary response.

    abstract::Experimental designs that include repeated measures of binary response variables over time and under different conditions are common in biology. In such settings, it is often desirable to characterize the response pattern over time. When response variables are continuous, this characterization can be made in terms of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Stanek EJ 3rd,Diehl SR

    更新日期:1988-12-01 00:00:00

  • Inference for reaction networks using the linear noise approximation.

    abstract::We consider inference for the reaction rates in discretely observed networks such as those found in models for systems biology, population ecology, and epidemics. Most such networks are neither slow enough nor small enough for inference via the true state-dependent Markov jump process to be feasible. Typically, infere...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12152

    authors: Fearnhead P,Giagos V,Sherlock C

    更新日期:2014-06-01 00:00:00