Bayesian variable selection in the accelerated failure time model with an application to the surveillance, epidemiology, and end results breast cancer data.

Abstract:

:Accelerated failure time model is a popular model to analyze censored time-to-event data. Analysis of this model without assuming any parametric distribution for the model error is challenging, and the model complexity is enhanced in the presence of large number of covariates. We developed a nonparametric Bayesian method for regularized estimation of the regression parameters in a flexible accelerated failure time model. The novelties of our method lie in modeling the error distribution of the accelerated failure time nonparametrically, modeling the variance as a function of the mean, and adopting a variable selection technique in modeling the mean. The proposed method allowed for identifying a set of important regression parameters, estimating survival probabilities, and constructing credible intervals of the survival probabilities. We evaluated operating characteristics of the proposed method via simulation studies. Finally, we apply our new comprehensive method to analyze the motivating breast cancer data from the Surveillance, Epidemiology, and End Results Program, and estimate the five-year survival probabilities for women included in the Surveillance, Epidemiology, and End Results database who were diagnosed with breast cancer between 1990 and 2000.

journal_name

Stat Methods Med Res

authors

Zhang Z,Sinha S,Maiti T,Shipp E

doi

10.1177/0962280215626947

subject

Has Abstract

pub_date

2018-04-01 00:00:00

pages

971-990

issue

4

eissn

0962-2802

issn

1477-0334

journal_volume

27

pub_type

杂志文章
  • Forensic inference from genetic markers.

    abstract::This review provides an overview of forensic inference from genetic markers. Because the judge and jurors are charged with decision-making, the forensic expert's job is to provide a useful summary of the evidence to the court. Hence, this review focuses on the likelihood ratio as a means of summarizing the genetic dat...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029300200304

    authors: Devlin B

    更新日期:1993-01-01 00:00:00

  • Receiver operating characteristic curve estimation for time to event with semicompeting risks and interval censoring.

    abstract::Semicompeting risks and interval censoring are frequent in medical studies, for instance when a disease may be diagnosed only at times of visit and disease onset is in competition with death. To evaluate the ability of markers to predict disease onset in this context, estimators of discrimination measures must account...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214531691

    authors: Jacqmin-Gadda H,Blanche P,Chary E,Touraine C,Dartigues JF

    更新日期:2016-12-01 00:00:00

  • armDNA: A functional beta model for detecting age-related genomewide DNA methylation marks.

    abstract::DNA methylation has been shown to play an important role in many complex diseases. The rapid development of high-throughput DNA methylation scan technologies provides great opportunities for genomewide DNA methylation-disease association studies. As methylation is a dynamic process involving time, it is quite plausibl...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216683571

    authors: Wang C,Shen Q,Du L,Xu J,Zhang H

    更新日期:2018-09-01 00:00:00

  • Bayesian nonparametric mixed-effects joint model for longitudinal-competing risks data analysis in presence of multiple data features.

    abstract::Recently, the joint analysis of longitudinal and survival data has been an active research area. Most joint models focus on survival data with only one type of failure. The research on joint modeling of longitudinal and competing risks survival data is sparse. Even so, many joint models for this type of data assume pa...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215597939

    authors: Lu T

    更新日期:2017-10-01 00:00:00

  • A test of inflated zeros for Poisson regression models.

    abstract::Excessive zeros are common in practice and may cause overdispersion and invalidate inference when fitting Poisson regression models. There is a large body of literature on zero-inflated Poisson models. However, methods for testing whether there are excessive zeros are less well developed. The Vuong test comparing a Po...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217749991

    authors: He H,Zhang H,Ye P,Tang W

    更新日期:2019-04-01 00:00:00

  • An ad hoc method for dual adjusting for measurement errors and nonresponse bias for estimating prevalence in survey data: Application to Iranian mental health survey on any illicit drug use.

    abstract::Purpose The prevalence estimates of binary variables in sample surveys are often subject to two systematic errors: measurement error and nonresponse bias. A multiple-bias analysis is essential to adjust for both biases. Methods In this paper, we linked the latent class log-linear and proxy pattern-mixture models to ad...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217690939

    authors: Khalagi K,Mansournia MA,Motevalian SA,Nourijelyani K,Rahimi-Movaghar A,Bakhtiyari M

    更新日期:2018-10-01 00:00:00

  • Accurate quantification of uncertainty in epidemic parameter estimates and predictions using stochastic compartmental models.

    abstract::Stochastic transmission dynamic models are needed to quantify the uncertainty in estimates and predictions during outbreaks of infectious diseases. We previously developed a calibration method for stochastic epidemic compartmental models, called Multiple Shooting for Stochastic Systems (MSS), and demonstrated its comp...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218805780

    authors: Zimmer C,Leuba SI,Cohen T,Yaesoubi R

    更新日期:2019-12-01 00:00:00

  • Interval estimation of a population mean using existing knowledge or data on effect sizes.

    abstract::Bayes or empirical Bayes methods to improve inferential accuracy for a population mean has been widely adopted in medical research. As the joint prior distribution of both the mean and variance parameters can be difficult to specify or estimate, most of these methods have relied on certain level of simplifications of ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218773537

    authors: Shen C

    更新日期:2019-06-01 00:00:00

  • A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies.

    abstract:BACKGROUND:Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the m...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213502437

    authors: Khondoker M,Dobson R,Skirrow C,Simmons A,Stahl D

    更新日期:2016-10-01 00:00:00

  • A quick and accurate method for the estimation of covariate effects based on empirical Bayes estimates in mixed-effects modeling: Correction of bias due to shrinkage.

    abstract::Nonlinear mixed-effects modeling is a popular approach to describe the temporal trajectory of repeated measurements of clinical endpoints collected over time in clinical trials, to distinguish the within-subject and the between-subject variabilities, and to investigate clinically important risk factors (covariates) th...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218812595

    authors: Yuan M,Xu XS,Yang Y,Xu J,Huang X,Tao F,Zhao L,Zhang L,Pinheiro J

    更新日期:2019-12-01 00:00:00

  • Prediction intervals with random forests.

    abstract::The classical and most commonly used approach to building prediction intervals is the parametric approach. However, its main drawback is that its validity and performance highly depend on the assumed functional link between the covariates and the response. This research investigates new methods that improve the perfor...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219829885

    authors: Roy MH,Larocque D

    更新日期:2020-01-01 00:00:00

  • Interpretation of mixed models and marginal models with cohort attrition due to death and drop-out.

    abstract::Mixed models estimated by maximum likelihood and marginal models estimated by generalized estimating equations are the standard methods for the analysis of longitudinal data. However, their use is highly debated when attrition may be due to death. While some authors consider that mixed model estimates are interpretabl...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217723675

    authors: Rouanet A,Helmer C,Dartigues JF,Jacqmin-Gadda H

    更新日期:2019-02-01 00:00:00

  • Gene selection for survival data under dependent censoring: A copula-based approach.

    abstract::Dependent censoring arises in biomedical studies when the survival outcome of interest is censored by competing risks. In survival data with microarray gene expressions, gene selection based on the univariate Cox regression analyses has been used extensively in medical research, which however, is only valid under the ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214533378

    authors: Emura T,Chen YH

    更新日期:2016-12-01 00:00:00

  • Modelling of zero-inflation improves inference of metagenomic gene count data.

    abstract::Metagenomics enables the study of gene abundances in complex mixtures of microorganisms and has become a standard methodology for the analysis of the human microbiome. However, gene abundance data is inherently noisy and contains high levels of biological and technical variability as well as an excess of zeros due to ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218811354

    authors: Jonsson V,Österlund T,Nerman O,Kristiansson E

    更新日期:2019-12-01 00:00:00

  • Estimating the average treatment effects of nutritional label use using subclassification with regression adjustment.

    abstract::Propensity score methods are common for estimating a binary treatment effect when treatment assignment is not randomized. When exposure is measured on an ordinal scale (i.e. low-medium-high), however, propensity score inference requires extensions which have received limited attention. Estimands of possible interest w...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214560046

    authors: Lopez MJ,Gutman R

    更新日期:2017-04-01 00:00:00

  • Measuring agreement in method comparison studies.

    abstract::Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference +/- 1.96 standard deviation of the differences, provide an interval within which 95% of differenc...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029900800204

    authors: Bland JM,Altman DG

    更新日期:1999-06-01 00:00:00

  • Bayesian sample size calculation for estimation of the difference between two binomial proportions.

    abstract::In this study, we discuss a decision theoretic or fully Bayesian approach to the sample size question in clinical trials with binary responses. Data are assumed to come from two binomial distributions. A Dirichlet distribution is assumed to describe prior knowledge of the two success probabilities p1 and p2. The param...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211399562

    authors: Pezeshk H,Nematollahi N,Maroufy V,Marriott P,Gittins J

    更新日期:2013-12-01 00:00:00

  • Predicting brain activity using a Bayesian spatial model.

    abstract::Increasing the clinical applicability of functional neuroimaging technology is an emerging objective, e.g. for diagnostic and treatment purposes. We propose a novel Bayesian spatial hierarchical framework for predicting follow-up neural activity based on an individual's baseline functional neuroimaging data. Our appro...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212448972

    authors: Derado G,Bowman FD,Zhang L,Alzheimer's Disease Neuroimaging Initiative Investigators.

    更新日期:2013-08-01 00:00:00

  • The EM algorithm in medical imaging.

    abstract::This article outlines the statistical developments that have taken place in the use of the EM algorithm in emission and transmission tomography during the past decade or so. We discuss the statistical aspects of the modelling of the projection data for both the emission and transmission cases and define the relevant p...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228029700600105

    authors: Kay J

    更新日期:1997-03-01 00:00:00

  • Bayesian latent structure modeling of walking behavior in a physical activity intervention.

    abstract::The analysis of walking behavior in a physical activity intervention is considered. A Bayesian latent structure modeling approach is proposed whereby the ability and willingness of participants is modeled via latent effects. The dropout process is jointly modeled via a linked survival model. Computational issues are a...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214529932

    authors: Lawson AB,Ellerbe C,Carroll R,Alia K,Coulon S,Wilson DK,VanHorn ML,George SM

    更新日期:2016-12-01 00:00:00

  • Efficient two-stage sequential arrays of proof of concept studies for pharmaceutical portfolios.

    abstract::Previous work has shown that individual randomized "proof-of-concept" (PoC) studies may be designed to maximize cost-effectiveness, subject to an overall PoC budget constraint. Maximizing cost-effectiveness has also been considered for arrays of simultaneously executed PoC studies. Defining Type III error as the oppor...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220958177

    authors: He L,Du L,Antonijevic Z,Posch M,Korostyshevskiy VR,Beckman RA

    更新日期:2020-09-21 00:00:00

  • Multilevel models for censored and latent responses.

    abstract::Multilevel models were originally developed to allow linear regression or ANOVA models to be applied to observations that are not mutually independent. This lack of independence commonly arises due to clustering of the units of observations into 'higher level units' such as patients in hospitals. In linear mixed model...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228020101000604

    authors: Rabe-Hesketh S,Yang S,Pickles A

    更新日期:2001-12-01 00:00:00

  • Joint nested frailty models for clustered recurrent and terminal events: An application to colonoscopy screening visits and colorectal cancer risks in Lynch Syndrome families.

    abstract::Joint models for recurrent and terminal events have not been yet developed for clustered data. The goals of our study are to develop a statistical framework for modelling clustered recurrent and terminal events and to perform dynamic predictions of the terminal event in family studies. We propose a joint nested frailt...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219863076

    authors: Choi YH,Jacqmin-Gadda H,Król A,Parfrey P,Briollais L,Rondeau V

    更新日期:2020-05-01 00:00:00

  • A generalization of functional clustering for discrete multivariate longitudinal data.

    abstract::This paper presents a new model-based generalized functional clustering method for discrete longitudinal data, such as multivariate binomial and Poisson distributed data. For this purpose, we propose a multivariate functional principal component analysis (MFPCA)-based clustering procedure for a latent multivariate Gau...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220921912

    authors: Lim Y,Cheung YK,Oh HS

    更新日期:2020-11-01 00:00:00

  • Model selection in multivariate semiparametric regression.

    abstract::Variable selection in semiparametric mixed models for longitudinal data remains a challenge, especially in the presence of multiple correlated outcomes. In this paper, we propose a model selection procedure that simultaneously selects fixed and random effects using a maximum penalized likelihood method with the adapti...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217690769

    authors: Li Z,Liu H,Tu W

    更新日期:2018-10-01 00:00:00

  • The application of multidimensional scaling methods to epidemiological data.

    abstract::This paper illustrates the use of multidimensional scaling methods (MDS) to examine space-time patterns in epidemic data. The paper begins by outlining the principles of MDS. The model is then formally specified and illustrated by application to two data sets. The first is partly a tutorial example. It uses monthly re...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228029500400202

    authors: Cliff AD,Haggett P,Smallman-Raynor MR,Stroup DF,Williamson GD

    更新日期:1995-06-01 00:00:00

  • Combining estimates of the odds ratio: the state of the art.

    abstract::Medical research commonly relies on the combination of 2 x 2 tables of counted data for making inferences about treatment effects or about the causes of disease. This article reviews point estimation and interval estimation for a common odds ratio. Traditional methods for providing these estimates face special challen...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029400300204

    authors: Emerson JD

    更新日期:1994-01-01 00:00:00

  • Inferring the direction of a causal link and estimating its effect via a Bayesian Mendelian randomization approach.

    abstract::The use of genetic variants as instrumental variables - an approach known as Mendelian randomization - is a popular epidemiological method for estimating the causal effect of an exposure (phenotype, biomarker, risk factor) on a disease or health-related outcome from observational data. Instrumental variables must sati...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219851817

    authors: Bucur IG,Claassen T,Heskes T

    更新日期:2020-04-01 00:00:00

  • Promoting structural effects of covariates in the cure rate model with penalization.

    abstract::Cure rate models have been widely adopted for characterizing survival data that have long-term survivors. Under a mixture cure rate model where the population is a mixture of cured and susceptible subjects, a primary goal is to study covariate effects on the cure probability and survival function of the susceptible su...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217708684

    authors: Fan X,Liu M,Fang K,Huang Y,Ma S

    更新日期:2017-10-01 00:00:00

  • A comparison of power analysis methods for evaluating effects of a predictor on slopes in longitudinal designs with missing data.

    abstract::In many longitudinal studies, evaluating the effect of a binary or continuous predictor variable on the rate of change of the outcome, i.e. slope, is often of primary interest. Sample size determination of these studies, however, is complicated by the expectation that missing data will occur due to missed visits, earl...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212437452

    authors: Wang C,Hall CB,Kim M

    更新日期:2015-12-01 00:00:00