Data-adaptive additive modeling.

Abstract:

:In this paper, we consider fitting a flexible and interpretable additive regression model in a data-rich setting. We wish to avoid pre-specifying the functional form of the conditional association between each covariate and the response, while still retaining interpretability of the fitted functions. A number of recent proposals in the literature for nonparametric additive modeling are data adaptive, in the sense that they can adjust the level of flexibility in the functional fits to the data at hand. For instance, the sparse additive model makes it possible to adaptively determine which features should be included in the fitted model, the sparse partially linear additive model allows each feature in the fitted model to take either a linear or a nonlinear functional form, and the recent fused lasso additive model and additive trend filtering proposals allow the knots in each nonlinear function fit to be selected from the data. In this paper, we combine the strengths of each of these recent proposals into a single proposal that uses the data to determine which features to include in the model, whether to model each feature linearly or nonlinearly, and what form to use for the nonlinear functions. We establish connections between our approach and recent proposals from the literature, and we demonstrate its strengths in a simulation study.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Petersen A,Witten D

doi

10.1002/sim.7859

subject

Has Abstract

pub_date

2019-02-20 00:00:00

pages

583-600

issue

4

eissn

0277-6715

issn

1097-0258

journal_volume

38

pub_type

杂志文章
  • Hierarchical nested trial design (HNTD) for demonstrating treatment efficacy of new antibacterial drugs in patient populations with emerging bacterial resistance.

    abstract::In the last decade or so, pharmaceutical drug development activities in the area of new antibacterial drugs for treating serious bacterial diseases have declined, and at the same time, there are worries that the increased prevalence of antibiotic-resistant bacterial infections, especially the increase in drug-resistan...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6233

    authors: Huque MF,Valappil T,Soon GG

    更新日期:2014-11-10 00:00:00

  • A new approach to designing phase I-II cancer trials for cytotoxic chemotherapies.

    abstract::Recently, there has been much work on early phase cancer designs that incorporate both toxicity and efficacy data, called phase I-II designs because they combine elements of both phases. However, they do not explicitly address the phase II hypothesis test of H0 : p ≤ p0 , where p is the probability of efficacy at the ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6124

    authors: Bartroff J,Lai TL,Narasimhan B

    更新日期:2014-07-20 00:00:00

  • Statistical issues related to dietary intake as the response variable in intervention trials.

    abstract::The focus of this paper is dietary intervention trials. We explore the statistical issues involved when the response variable, intake of a food or nutrient, is based on self-report data that are subject to inherent measurement error. There has been little work on handling error in this context. A particular feature of...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7011

    authors: Keogh RH,Carroll RJ,Tooze JA,Kirkpatrick SI,Freedman LS

    更新日期:2016-11-10 00:00:00

  • Methods for assessing reliability and validity for a measurement tool: a case study and critique using the WHO haemoglobin colour scale.

    abstract::Before introducing a new measurement tool it is necessary to evaluate its performance. Several statistical methods have been developed, or used, to evaluate the reliability and validity of a new assessment method in such circumstances. In this paper we review some commonly used methods. Data from a study that was cond...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1804

    authors: White SA,van den Broek NR

    更新日期:2004-05-30 00:00:00

  • Cutoff designs for community-based intervention studies.

    abstract::Public health interventions are often designed to target communities defined either geographically (e.g. cities, counties) or socially (e.g. schools or workplaces). The group randomized trial (GRT) is regarded as the gold standard for evaluating these interventions. However, community leaders may object to randomizati...

    journal_title:Statistics in medicine

    pub_type: 杂志文章,随机对照试验

    doi:10.1002/sim.4237

    authors: Pennell ML,Hade EM,Murray DM,Rhoda DA

    更新日期:2011-07-10 00:00:00

  • Estimation of death rates in US states with small subpopulations.

    abstract::In US states with small subpopulations, the observed mortality rates are often zero, particularly among young ages. Because in life tables, death rates are reported mostly on a log scale, zero mortality rates are problematic. To overcome the observed zero death rates problem, appropriate probability models are used. U...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6385

    authors: Voulgaraki A,Wei R,Kedem B

    更新日期:2015-05-20 00:00:00

  • Methodological considerations on the design and analysis of an equivalence stratified cluster randomization trial.

    abstract::The World Health Organization and collaborating institutions in four developing countries have conducted a multi-centre randomized controlled trial, in which clinics were allocated at random to two antenatal care (ANC) models. These were the standard 'Western' ANC model and a 'new' ANC model consisting of tests, clini...

    journal_title:Statistics in medicine

    pub_type: 临床试验,杂志文章,随机对照试验

    doi:10.1002/1097-0258(20010215)20:3<401::aid-sim801>3.

    authors: Piaggio G,Carroli G,Villar J,Pinol A,Bakketeig L,Lumbiganon P,Bergsjø P,Al-Mazrou Y,Ba'aqeel H,Belizán JM,Farnot U,Berendes H,WHO Antenatal Care Trial Research Group.

    更新日期:2001-02-15 00:00:00

  • Measurement in clinical trials: a neglected issue for statisticians?

    abstract::Biostatisticians have frequently uncritically accepted the measurements provided by their medical colleagues engaged in clinical research. Such measures often involve considerable loss of information. Particularly, unfortunate is the widespread use of the so-called 'responder analysis', which may involve not only a lo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3603

    authors: Senn S,Julious S

    更新日期:2009-11-20 00:00:00

  • Testing the equality of two Poisson means using the rate ratio.

    abstract::In this article, we investigate procedures for comparing two independent Poisson variates that are observed over unequal sampling frames (i.e. time intervals, populations, areas or any combination thereof). We consider two statistics (with and without the logarithmic transformation) for testing the equality of two Poi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1949

    authors: Ng HK,Tang ML

    更新日期:2005-03-30 00:00:00

  • Complete imputation of missing repeated categorical data: one-sample applications.

    abstract::Longitudinal studies with repeated measures are often subject to non-response. Methods currently employed to alleviate the difficulties caused by missing data are typically unsatisfactory, especially when the cause of the missingness is related to the outcomes. We present an approach for incomplete categorical data in...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.982

    authors: West CP,Dawson JD

    更新日期:2002-01-30 00:00:00

  • Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond.

    abstract::Identification of key factors associated with the risk of developing cardiovascular disease and quantification of this risk using multivariable prediction algorithms are among the major advances made in preventive cardiology and cardiovascular epidemiology in the 20th century. The ongoing discovery of new risk markers...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2929

    authors: Pencina MJ,D'Agostino RB Sr,D'Agostino RB Jr,Vasan RS

    更新日期:2008-01-30 00:00:00

  • Semiparametric ROC surfaces for continuous diagnostic tests based on two test measurements.

    abstract::We propose a semiparametric method for estimating ROC surfaces for continuous diagnostic tests based on two test measurements. Such a three-class diagnostic problem based on two test measurements arises naturally from some DNA amplification-related diagnostic scenarios. Simulation results show that our proposed semipa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3625

    authors: Wan S,Zhang B

    更新日期:2009-08-15 00:00:00

  • Measuring vaccine efficacy from epidemics of acute infectious agents.

    abstract::A good measure of field vaccine efficacy should evaluate the direct protective effect of vaccination on the person who receives the vaccine. The conventional estimator for vaccine efficacy depends on population level factors that are either unrelated or indirectly related to the direct biological action of the vaccine...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780120309

    authors: Longini IM Jr,Halloran ME,Haber M,Chen RT

    更新日期:1993-02-01 00:00:00

  • Methods for proper handling of overrunning and underrunning in phase II designs for oncology trials.

    abstract::Phase II studies in oncology are frequently conducted as two-stage single-arm trials with a binary endpoint indicating tumor response. As a common feature of these designs, the sample sizes of the two stages and the decision rules for the interim and the final analysis have to be pre-specified and adhered to strictly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6479

    authors: Englert S,Kieser M

    更新日期:2015-06-15 00:00:00

  • Prevalence-dependent diagnostic accuracy measures.

    abstract::We study prevalence-dependent diagnostic accuracy measures, specifically, positive and negative predictive values. These measures permit an assessment of the clinical utility of diagnostic tests across populations with different disease prevalences. In many cases, prevalence may not be known with certainty and the eva...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2812

    authors: Li J,Fine JP,Safdar N

    更新日期:2007-07-30 00:00:00

  • Multistate models and lifetime risk estimation: Application to Alzheimer's disease.

    abstract::The lifetime risk of a clinical condition is the probability of onset of the condition during one's lifespan. Recent advances in Alzheimer's disease (AD) research have identified screening tests for biomarkers that can identify persons who are in the earliest stages of the AD process but who do not yet have any clinic...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8056

    authors: Brookmeyer R,Abdalla N

    更新日期:2019-04-30 00:00:00

  • Assurance calculations for planning clinical trials with time-to-event outcomes.

    abstract::We consider the use of the assurance method in clinical trial planning. In the assurance method, which is an alternative to a power calculation, we calculate the probability of a clinical trial resulting in a successful outcome, via eliciting a prior probability distribution about the relevant treatment effect. This i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5916

    authors: Ren S,Oakley JE

    更新日期:2014-01-15 00:00:00

  • Association models for periodontal disease progression: a comparison of methods for clustered binary data.

    abstract::We investigate population-averaged (PA) and cluster-specific (CS) associations for clustered binary logistic regression in the context of a longitudinal clinical trial that investigated the association between tooth-specific visual elastase kit results and periodontal disease progression within 26 weeks of follow-up. ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780140407

    authors: Ten Have TR,Landis JR,Weaver SL

    更新日期:1995-02-28 00:00:00

  • A practical approach for the assessment of bioequivalence under selected higher-order cross-over designs.

    abstract::The two-period cross-over design with two sequences of drug administration is a standard experimental design when bioequivalence of one test formulation is to be assessed in comparison with a reference formulation. Previously, an approach based on Fieller's confidence interval has been presented for the assessment of ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19971015)16:19<2229::aid-s

    authors: Vuorinen J

    更新日期:1997-10-15 00:00:00

  • A mechanistic breast cancer survival modelling through the axillary lymph node chain.

    abstract::In this paper, we proposed a mechanistic breast cancer survival model based on the axillary lymph node chain structure, considering lymph nodes as a potential dissemination arrangement. We assume a naive breast cancer treatment protocol consisting of exposing patients first to a chemotherapy treatment on r intervals a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5576

    authors: Cobre J,Castro Perdoná GS,Peria FM,Louzada F

    更新日期:2013-04-30 00:00:00

  • Assessing surrogates as trial endpoints using mixed models.

    abstract::Having a surrogate for a definitive endpoint in a clinical trial can sometimes be useful when it is impractical, invasive or very time consuming to obtain the definitive endpoint. This paper discusses methods for assessing whether the surrogate-endpoint results of a trial can be used in place of definitive-endpoint re...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1779

    authors: Korn EL,Albert PS,McShane LM

    更新日期:2005-01-30 00:00:00

  • British 1990 growth reference centiles for weight, height, body mass index and head circumference fitted by maximum penalized likelihood.

    abstract::To update the British growth reference, anthropometric data for weight, height, body mass index (weight/height2) and head circumference from 17 distinct surveys representative of England, Scotland and Wales (37,700 children, age range 23 weeks gestation to 23 years) were analysed by maximum penalized likelihood using ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:

    authors: Cole TJ,Freeman JV,Preece MA

    更新日期:1998-02-28 00:00:00

  • Assessing time-by-covariate interactions in proportional hazards regression models using cubic spline functions.

    abstract::Proportional hazards (or Cox) regression is a popular method for modelling the effects of prognostic factors on survival. Use of cubic spline functions to model time-by-covariate interactions in Cox regression allows investigation of the shape of a possible covariate-time dependence without having to specify a specifi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780131007

    authors: Hess KR

    更新日期:1994-05-30 00:00:00

  • On Bayesian methods of exploring qualitative interactions for targeted treatment.

    abstract::Providing personalized treatments designed to maximize benefits and minimizing harms is of tremendous current medical interest. One problem in this area is the evaluation of the interaction between the treatment and other predictor variables. Treatment effects in subgroups having the same direction but different magni...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5429

    authors: Chen W,Ghosh D,Raghunathan TE,Norkin M,Sargent DJ,Bepler G

    更新日期:2012-12-10 00:00:00

  • The SAEM algorithm for group comparison tests in longitudinal data analysis based on non-linear mixed-effects model.

    abstract::Non-linear mixed-effects models (NLMEMs) are used to improve information gathering from longitudinal studies and are applied to treatment evaluation in disease-evolution studies, such as human immunodeficiency virus (HIV) infection. The estimation of parameters and the statistical tests are critical issues in NLMEMs s...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2950

    authors: Samson A,Lavielle M,Mentré F

    更新日期:2007-11-30 00:00:00

  • On tests of the overall treatment effect in meta-analysis with normally distributed responses.

    abstract::For the meta-analysis of controlled clinical trials or epidemiological studies, in which the responses are at least approximately normally distributed, a refined test for the hypothesis of no overall treatment effect is proposed. The test statistic is based on a direct estimation function for the variance of the overa...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.791

    authors: Hartung J,Knapp G

    更新日期:2001-06-30 00:00:00

  • Estimating the stage-specific numbers of HIV infection using a Markov model and back-calculation.

    abstract::The back-calculation method has been used to estimate the number of HIV infections from AIDS incidence data in a particular population. We present an extension of back calculation that provides estimates of the numbers of HIV infectives in different stages of infection. We model the staging process with a time-depende...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780110612

    authors: Longini IM Jr,Byers RH,Hessol NA,Tan WY

    更新日期:1992-04-01 00:00:00

  • Play the winner for phase II/III clinical trials.

    abstract::In comparing two treatments under a typical sequential clinical trial setting, a 50-50 randomization design generates reliable data for making efficient inferences about the treatment difference for the benefit of patients in the general population. However, if the treatment difference is large and the endpoint of the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19961130)15:22<2413::aid-s

    authors: Yao Q,Wei LJ

    更新日期:1996-11-15 00:00:00

  • A general frailty model to accommodate individual heterogeneity in the acquisition of multiple infections: An application to bivariate current status data.

    abstract::The analysis of multivariate time-to-event (TTE) data can become complicated due to the presence of clustering, leading to dependence between multiple event times. For a long time, (conditional) frailty models and (marginal) copula models have been used to analyze clustered TTE data. In this article, we propose a gene...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8506

    authors: Tran TMP,Abrams S,Braekers R

    更新日期:2020-05-30 00:00:00

  • Discriminant analysis using a multivariate linear mixed model with a normal mixture in the random effects distribution.

    abstract::We have developed a method to longitudinally classify subjects into two or more prognostic groups using longitudinally observed values of markers related to the prognosis. We assume the availability of a training data set where the subjects' allocation into the prognostic group is known. The proposed method proceeds i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3849

    authors: Komárek A,Hansen BE,Kuiper EM,van Buuren HR,Lesaffre E

    更新日期:2010-12-30 00:00:00