Receiver operating characteristic curves and confidence bands for support vector machines.

Abstract:

:Many problems that appear in biomedical decision-making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The support vector machine (SVM) is a popular classification technique that is robust to model misspecification and effectively handles high-dimensional data. The relative costs of false positives and false negatives can vary across application domains. The receiving operating characteristic (ROC) curve provides a visual representation of the trade-off between these two types of errors. Because the SVM does not produce a predicted probability, an ROC curve cannot be constructed in the traditional way of thresholding a predicted probability. However, a sequence of weighted SVMs can be used to construct an ROC curve. Although ROC curves constructed using weighted SVMs have great potential for allowing ROC curves analyses that cannot be done by thresholding predicted probabilities, their theoretical properties have heretofore been underdeveloped. We propose a method for constructing confidence bands for the SVM ROC curve and provide the theoretical justification for the SVM ROC curve by showing that the risk function of the estimated decision rule is uniformly consistent across the weight parameter. We demonstrate the proposed confidence band method using simulation studies. We present a predictive model for treatment response in breast cancer as an illustrative example.

journal_name

Biometrics

journal_title

Biometrics

authors

Luckett DJ,Laber EB,El-Kamary SS,Fan C,Jhaveri R,Perou CM,Shebl FM,Kosorok MR

doi

10.1111/biom.13365

subject

Has Abstract

pub_date

2020-08-31 00:00:00

eissn

0006-341X

issn

1541-0420

pub_type

杂志文章
  • A Markov model for analysing cancer markers and disease states in survival studies.

    abstract::In studies of serial cancer markers or disease states and their relation to survival, data on the marker or state are usually obtained at infrequent time points during follow-up. A Markov model is developed to assess the dependence of risk of death on marker level or disease state and inferences within this model are ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kay R

    更新日期:1986-12-01 00:00:00

  • Order-restricted tests for stratified comparisons of binomial proportions.

    abstract::The data set presented relates a binomial response to ordered levels of an explanatory variable, representing doses of a drug, with data collected at several centers. A study goal is to test independence of the response and the ordinal factor, assuming under the alternative only that the binomial parameter is a monoto...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Agresti A,Coull BA

    更新日期:1996-09-01 00:00:00

  • Time-varying functional regression for predicting remaining lifetime distributions from longitudinal trajectories.

    abstract::A recurring objective in longitudinal studies on aging and longevity has been the investigation of the relationship between age-at-death and current values of a longitudinal covariate trajectory that quantifies reproductive or other behavioral activity. We propose a novel technique for predicting age-at-death distribu...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2005.00378.x

    authors: Müller HG,Zhang Y

    更新日期:2005-12-01 00:00:00

  • Simple test for the Hardy-Weinberg law for HLA data with no observed double blanks.

    abstract::Eguchi and Matsuura (1990, Biometrics 46, 415-426) noted that the generalized Stevens test statistic for the Hardy-Weinberg law for human leukocyte antigen (HLA) data yields an excessively large value when no double blanks are observed. In this paper, we investigated this aberrant case. The inflated value of the test ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Nam J

    更新日期:1995-03-01 00:00:00

  • The effect of conditional dependence on the evaluation of diagnostic tests.

    abstract::The accuracy of a new diagnostic test is often determined by comparison with a reference test which also has unknown error rates. Maximum likelihood estimation of the error rates of both tests is possible if they are simultaneously applied to two populations with different disease prevalences. The estimation procedure...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Vacek PM

    更新日期:1985-12-01 00:00:00

  • Composite sampling.

    abstract::In sampling certain types of materials, such as bags of fertilizer, or subsampling large quantities of water, as might be done in investigating the density of plankton in certain environmental situations, it is customary to composite the samples. That is, several samples are drawn, mixed into a composite sample, and a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Rohde CA

    更新日期:1976-06-01 00:00:00

  • On the use of the variogram in checking for independence in spatial data.

    abstract::The variogram is a standard tool in the analysis of spatial data, and its shape provides useful information on the form of spatial correlation that may be present. However, it is also useful to be able to assess the evidence for the presence of any spatial correlation. A method of doing this, based on an assessment of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00211.x

    authors: Diblasi A,Bowman AW

    更新日期:2001-03-01 00:00:00

  • A semiparametric empirical likelihood method for biased sampling schemes with auxiliary covariates.

    abstract::We consider a semiparametric inference procedure for data from epidemiologic studies conducted with a two-component sampling scheme where both a simple random sample and multiple outcome- or outcome-/auxiliary-dependent samples are observed. This sampling scheme allows the investigators to oversample certain subpopula...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00612.x

    authors: Wang X,Zhou H

    更新日期:2006-12-01 00:00:00

  • Cohort case-control design and analysis for clustered failure-time data.

    abstract::Cohort case-control design is an efficient and economical design to study risk factors for disease incidence or mortality in a large cohort. In the last few decades, a variety of cohort case-control designs have been developed and theoretically justified. These designs have been exclusively applied to the analysis of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00764.x

    authors: Lu SE,Wang MC

    更新日期:2002-12-01 00:00:00

  • A Bayesian approach to jointly modeling toxicity and biomarker expression in a phase I/II dose-finding trial.

    abstract::In this article, we propose a Bayesian approach to phase I/II dose-finding oncology trials by jointly modeling a binary toxicity outcome and a continuous biomarker expression outcome. We apply our method to a clinical trial of a new gene therapy for bladder cancer patients. In this trial, the biomarker expression indi...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:10.1111/j.1541-0420.2005.00314.x

    authors: Bekele BN,Shen Y

    更新日期:2005-06-01 00:00:00

  • Group sequential tests for bivariate response: interim analyses of clinical trials with both efficacy and safety endpoints.

    abstract::We describe group sequential tests for a bivariate response. The tests are defined in terms of the two response components jointly, rather than through a single summary statistic. Such methods are appropriate when the two responses concern different aspects of a treatment; for example, one might wish to show that a ne...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Jennison C,Turnbull BW

    更新日期:1993-09-01 00:00:00

  • Sparse generalized eigenvalue problem with application to canonical correlation analysis for integrative analysis of methylation and gene expression data.

    abstract::We present a method for individual and integrative analysis of high dimension, low sample size data that capitalizes on the recurring theme in multivariate analysis of projecting higher dimensional data onto a few meaningful directions that are solutions to a generalized eigenvalue problem. We propose a general framew...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12886

    authors: Safo SE,Ahn J,Jeon Y,Jung S

    更新日期:2018-12-01 00:00:00

  • Analysis of ordered categorical data: two score-independent approaches.

    abstract:SUMMARY:A trend test is often employed to analyze ordered categorical data, in which a set of increasing scores is assigned a priori. There is a drawback in this approach, because how to choose a set of scores is not clear. There have been debates on which scores should be used (e.g., Graubard and Korn, 1987, Biometric...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.00992.x

    authors: Zheng G

    更新日期:2008-12-01 00:00:00

  • Dose-finding designs for HIV studies.

    abstract::We present a class of simple designs that can be used in early dose-finding studies in HIV. Such designs, in contrast with Phase I designs in cancer, have a lot of the Phase II flavor about them. Information on efficacy is obtained during the trial and is as important as that relating to toxicity. The designs proposed...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.01018.x

    authors: O'Quigley J,Hughes MD,Fenton T

    更新日期:2001-12-01 00:00:00

  • Locally efficient estimation of the quality-adjusted lifetime distribution with right-censored data and covariates.

    abstract::Zhao and Tsiatis (1997) consider the problem of estimation of the distribution of the quality-adjusted lifetime when the chronological survival time is subject to right censoring. The quality-adjusted lifetime is typically defined as a weighted sum of the times spent in certain states up until death or some other fail...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00530.x

    authors: van der Laan MJ,Hubbard A

    更新日期:1999-06-01 00:00:00

  • Testing for cubic smoothing splines under dependent data.

    abstract::In most research on smoothing splines the focus has been on estimation, while inference, especially hypothesis testing, has received less attention. By defining design matrices for fixed and random effects and the structure of the covariance matrices of random errors in an appropriate way, the cubic smoothing spline a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01537.x

    authors: Nummi T,Pan J,Siren T,Liu K

    更新日期:2011-09-01 00:00:00

  • Combining multivariate bioassays.

    abstract::Linear multivariate theory is applied to the problem of combining several multivariate bioassays. Results are an asymptotic test of the hypothesis of a common log relative potency; the maximum likelihood estimator of the common log relative potency; and an exact and asymptotic confidence interval estimator for log rel...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Meisner M,Kushner HB,Laska EM

    更新日期:1986-06-01 00:00:00

  • Capture-recapture estimation using finite mixtures of arbitrary dimension.

    abstract::Reversible jump Markov chain Monte Carlo (RJMCMC) methods are used to fit Bayesian capture-recapture models incorporating heterogeneity in individuals and samples. Heterogeneity in capture probabilities comes from finite mixtures and/or fixed sample effects allowing for interactions. Estimation by RJMCMC allows automa...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01289.x

    authors: Arnold R,Hayakawa Y,Yip P

    更新日期:2010-06-01 00:00:00

  • Some distribution properties of the sample species-diversity indices and their applications.

    abstract::In the area of ecological research the study of species diversity of a community or population seems to have been fully developed. However, the problem of how the distributions and expectations of the sample diversity indices are affected by the population diversity has received little attention. In this paper we show...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Tong YL

    更新日期:1983-12-01 00:00:00

  • Regression analysis of multivariate grouped survival data.

    abstract::Multivariate failure time data arise when each study subject may experience several types of event or when there are clusterings of observational units such that failure times within the same cluster are correlated. The failure times are often subject to interval grouping or have truly discrete measurements. In this p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Guo SW,Lin DY

    更新日期:1994-09-01 00:00:00

  • Estimating the average treatment effect on survival based on observational data and using partly conditional modeling.

    abstract::Treatments are frequently evaluated in terms of their effect on patient survival. In settings where randomization of treatment is not feasible, observational data are employed, necessitating correction for covariate imbalances. Treatments are usually compared using a hazard ratio. Most existing methods which quantify ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12542

    authors: Gong Q,Schaubel DE

    更新日期:2017-03-01 00:00:00

  • A semiparametric joint model for longitudinal and survival data with application to hemodialysis study.

    abstract::In many longitudinal clinical studies, the level and progression rate of repeatedly measured biomarkers on each subject quantify the severity of the disease and that subject's susceptibility to progression of the disease. It is of scientific and clinical interest to relate such quantities to a later time-to-event clin...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01168.x

    authors: Li L,Hu B,Greene T

    更新日期:2009-09-01 00:00:00

  • Summary rates.

    abstract::The use of summary rates of reporting health data is discussed. Directly and indirectly adjusted rates are compared under two models of no interaction. A method of analysis is described whereby a model is first fitted to data in order to investigate assumptions, and subsequently the smoothed rates based on the model a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Freeman DH Jr,Holford TR

    更新日期:1980-06-01 00:00:00

  • Testing equality of survival functions based on both paired and unpaired censored data.

    abstract::We introduce two test procedures for comparing two survival distributions on the basis of randomly right-censored data consisting of both paired and unpaired observations. Our procedures are based on generalizations of a pooled rank test statistic previously proposed for uncensored data. One generalization adapts the ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00154.x

    authors: Dallas MJ,Rao PV

    更新日期:2000-03-01 00:00:00

  • Estimating differences in restricted mean lifetime using observational data subject to dependent censoring.

    abstract::In epidemiologic studies of time to an event, mean lifetime is often of direct interest. We propose methods to estimate group- (e.g., treatment-) specific differences in restricted mean lifetime for studies where treatment is not randomized and lifetimes are subject to both dependent and independent censoring. The pro...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01503.x

    authors: Zhang M,Schaubel DE

    更新日期:2011-09-01 00:00:00

  • Semiparametric estimation of proportional mean residual life model in presence of censoring.

    abstract::A mean residual life function is the average remaining life of a surviving subject, as it varies with time. The proportional mean residual life model was proposed by Oakes and Dasu (1990, Biometrika77, 409-410) in regression analysis to study its association with related covariates in absence of censoring. In this art...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2005.030224.x

    authors: Chen YQ,Jewell NP,Lei X,Cheng SC

    更新日期:2005-03-01 00:00:00

  • Estimation in a Cox proportional hazards cure model.

    abstract::Some failure time data come from a population that consists of some subjects who are susceptible to and others who are nonsusceptible to the event of interest. The data typically have heavy censoring at the end of the follow-up period, and a standard survival analysis would not always be appropriate. In such situation...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.00227.x

    authors: Sy JP,Taylor JM

    更新日期:2000-03-01 00:00:00

  • Calculating sample size for studies with expected all-or-none nonadherence and selection bias.

    abstract:SUMMARY:We develop sample size formulas for studies aiming to test mean differences between a treatment and control group when all-or-none nonadherence (noncompliance) and selection bias are expected. Recent work by Fay, Halloran, and Follmann (2007, Biometrics 63, 465-474) addressed the increased variances within grou...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01114.x

    authors: Shardell MD,El-Kamary SS

    更新日期:2009-06-01 00:00:00

  • A Monte Carlo investigation of homogeneity tests of the odds ratio under various sample size configurations.

    abstract::Epidemiologic data for case-control studies are often summarized into K 2 x 2 tables. Given a fixed number of cases and controls, the degree of sparseness in the data depends on the number of strata, K. The effect of increasing stratification on size and power of seven tests of homogeneity of the odds ratio is studied...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Jones MP,O'Gorman TW,Lemke JH,Woolson RF

    更新日期:1989-03-01 00:00:00

  • Case-control analysis with partial knowledge of exposure misclassification probabilities.

    abstract::Consider case control analysis with a dichotomous exposure variable that is subject to misclassification. If the classification probabilities are known, then methods are available to adjust odds-ratio estimates in light of the misclassification. We study the realistic scenario where reasonable guesses, but not exact v...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00598.x

    authors: Gustafson P,Le ND,Saskin R

    更新日期:2001-06-01 00:00:00