Pattern discovery of health curves using an ordered probit model with Bayesian smoothing and functional principal component analysis.

Abstract:

:This article is motivated by the need for discovering patterns of patients' health based on their daily settings of care to aid the health policy-makers to improve the effectiveness of distributing funding for health services. The hidden process of one's health status is assumed to be a continuous smooth function, called the health curve, ranging from perfectly healthy to dead. The health curves are linked to the categorical setting of care using an ordered probit model and are inferred through Bayesian smoothing. The challenges include the nontrivial constraints on the lower bound of the health status (death) and on the model parameters to ensure model identifiability. We use the Markov chain Monte Carlo method to estimate the parameters and health curves. The functional principal component analysis is applied to the patients' estimated health curves to discover common health patterns. The proposed method is demonstrated through an application to patients hospitalized from strokes in Ontario. Whilst this paper focuses on the method's application to a health care problem, the proposed model and its implementation have the potential to be applied to many application domains in which the response variable is ordinal and there is a hidden process. Our implementation is available at https://github.com/liangliangwangsfu/healthCurveCode.

journal_name

Stat Methods Med Res

authors

Wang S,Nie Y,Sutherland JM,Wang L

doi

10.1177/0962280220951834

subject

Has Abstract

pub_date

2020-09-25 00:00:00

pages

962280220951834

eissn

0962-2802

issn

1477-0334

pub_type

杂志文章
  • Letter to the editor: Fitting truncated normal distributions.

    abstract::I comment here on a recent paper in this journal, on the fitting of truncated normal distributions by the EM algorithm. I show that the fitting of such distributions by direct numerical maximization of likelihood (rather than EM) is straightforward, contrary to an assertion made by the authors of that paper. ...

    journal_title:Statistical methods in medical research

    pub_type: 评论,信件

    doi:10.1177/0962280217712089

    authors: MacDonald IL

    更新日期:2018-12-01 00:00:00

  • A monotone data augmentation algorithm for longitudinal data analysis via multivariate skew-t, skew-normal or t distributions.

    abstract::The mixed effects model for repeated measures has been widely used for the analysis of longitudinal clinical data collected at a number of fixed time points. We propose a robust extension of the mixed effects model for repeated measures for skewed and heavy-tailed data on basis of the multivariate skew-t distribution,...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219865579

    authors: Tang Y

    更新日期:2020-06-01 00:00:00

  • Bayesian variable selection in the accelerated failure time model with an application to the surveillance, epidemiology, and end results breast cancer data.

    abstract::Accelerated failure time model is a popular model to analyze censored time-to-event data. Analysis of this model without assuming any parametric distribution for the model error is challenging, and the model complexity is enhanced in the presence of large number of covariates. We developed a nonparametric Bayesian met...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215626947

    authors: Zhang Z,Sinha S,Maiti T,Shipp E

    更新日期:2018-04-01 00:00:00

  • Linear time-dependent reference intervals where there is measurement error in the time variable-a parametric approach.

    abstract::This article re-examines parametric methods for the calculation of time specific reference intervals where there is measurement error present in the time covariate. Previous published work has commonly been based on the standard ordinary least squares approach, weighted where appropriate. In fact, this is an incorrect...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211426617

    authors: Gillard J

    更新日期:2015-12-01 00:00:00

  • Inferring the direction of a causal link and estimating its effect via a Bayesian Mendelian randomization approach.

    abstract::The use of genetic variants as instrumental variables - an approach known as Mendelian randomization - is a popular epidemiological method for estimating the causal effect of an exposure (phenotype, biomarker, risk factor) on a disease or health-related outcome from observational data. Instrumental variables must sati...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219851817

    authors: Bucur IG,Claassen T,Heskes T

    更新日期:2020-04-01 00:00:00

  • The asymptotic maximal procedure for subject randomization in clinical trials.

    abstract::The maximal procedure is a restricted randomization method that maximizes the number of feasible allocation sequences under the constraints of the maximum tolerated imbalance and the allocation sequence length. It assigns an equal probability to all feasible sequences. However, its implementation is not easy due to th...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216677107

    authors: Zhao W,Berger VW,Yu Z

    更新日期:2018-07-01 00:00:00

  • Hierarchical mixture models for longitudinal immunologic data with heterogeneity, non-normality, and missingness.

    abstract::It is a common practice to analyze longitudinal data frequently arisen in medical studies using various mixed-effects models in the literature. However, the following issues may standout in longitudinal data analysis: (i) In clinical practice, the profile of each subject's response from a longitudinal study may follow...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214544207

    authors: Huang Y,Chen J,Yin P

    更新日期:2017-02-01 00:00:00

  • Towards joint disease mapping.

    abstract::This article discusses and extends statistical models to jointly analyse the spatial variation of rates of several diseases with common risk factors. We start with a review of methods for separate analyses of diseases, then move to ecological regression approaches, where the rates from one of the diseases enter as sur...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1191/0962280205sm389oa

    authors: Held L,Natário I,Fenton SE,Rue H,Becker N

    更新日期:2005-02-01 00:00:00

  • Promoting structural effects of covariates in the cure rate model with penalization.

    abstract::Cure rate models have been widely adopted for characterizing survival data that have long-term survivors. Under a mixture cure rate model where the population is a mixture of cured and susceptible subjects, a primary goal is to study covariate effects on the cure probability and survival function of the susceptible su...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217708684

    authors: Fan X,Liu M,Fang K,Huang Y,Ma S

    更新日期:2017-10-01 00:00:00

  • Sample size calculation for treatment effects in randomized trials with fixed cluster sizes and heterogeneous intraclass correlations and variances.

    abstract::When comparing two different kinds of group therapy or two individual treatments where patients within each arm are nested within care providers, clustering of observations may occur in both arms. The arms may differ in terms of (a) the intraclass correlation, (b) the outcome variance, (c) the cluster size, and (d) th...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214563100

    authors: Candel MJ,van Breukelen GJ

    更新日期:2015-10-01 00:00:00

  • Expected p-values in light of an ROC curve analysis applied to optimal multiple testing procedures.

    abstract::Many statistical studies report p-values for inferential purposes. In several scenarios, the stochastic aspect of p-values is neglected, which may contribute to drawing wrong conclusions in real data experiments. The stochastic nature of p-values makes their use to examine the performance of given testing procedures o...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217704451

    authors: Vexler A,Yu J,Zhao Y,Hutson AD,Gurevich G

    更新日期:2018-12-01 00:00:00

  • Measuring agreement in method comparison studies.

    abstract::Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference +/- 1.96 standard deviation of the differences, provide an interval within which 95% of differenc...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029900800204

    authors: Bland JM,Altman DG

    更新日期:1999-06-01 00:00:00

  • Semi-supervised identification of cancer subgroups using survival outcomes and overlapping grouping information.

    abstract::Identification of cancer patient subgroups using high throughput genomic data is of critical importance to clinicians and scientists because it can offer opportunities for more personalized treatment and overlapping treatments of cancers. In spite of tremendous efforts, this problem still remains challenging because o...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217752980

    authors: Wei W,Sun Z,da Silveira WA,Yu Z,Lawson A,Hardiman G,Kelemen LE,Chung D

    更新日期:2019-07-01 00:00:00

  • Maximum likelihood estimation of time to first event in the presence of data gaps and multiple events.

    abstract::We propose a novel likelihood method for analyzing time-to-event data when multiple events and multiple missing data intervals are possible prior to the first observed event for a given subject. This research is motivated by data obtained from a heart monitor used to track the recovery process of subjects experiencing...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212466089

    authors: Green CL,Brownie C,Boos DD,Lu JC,Krucoff MW

    更新日期:2016-04-01 00:00:00

  • Probability intervals of toxicity and efficacy design for dose-finding clinical trials in oncology.

    abstract::Immunotherapy, gene therapy or adoptive cell therapies, such as the chimeric antigen receptor+ T-cell therapies, have demonstrated promising therapeutic effects in oncology patients. We consider statistical designs for dose-finding adoptive cell therapy trials, in which the monotonic dose-response relationship assumed...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220977009

    authors: Lin X,Ji Y

    更新日期:2020-12-16 00:00:00

  • Nonparametric estimation of risk tracking indices for longitudinal studies.

    abstract::Tracking a subject's risk factors or health status over time is an important objective in long-term epidemiological studies with repeated measurements. An important issue of time-trend tracking is to define appropriate statistical indices to quantitatively measure the tracking abilities of the targeted risk factors or...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219839427

    authors: Wu CO,Tian X,Tian L,Reis JP,Zhao L,Allen NB,Bae S,Liu K

    更新日期:2020-02-01 00:00:00

  • Testing hypotheses under adaptive randomization with continuous covariates in clinical trials.

    abstract::Covariate-adaptive designs are widely used to balance covariates and maintain randomization in clinical trials. Adaptive designs for discrete covariates and their asymptotic properties have been well studied in the literature. However, important continuous covariates are often involved in clinical studies. Simply disc...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218770231

    authors: Li X,Zhou J,Hu F

    更新日期:2019-06-01 00:00:00

  • Optimal scheduling of post-therapeutic follow-up of patients treated for cancer for early detection of relapses.

    abstract::Post-therapeutic surveillance is one important component of cancer care. However, there still is no evidence-based strategies to schedule patients' follow-up examinations. Our approach is based on the modeling of the probability of the onset of relapse at an early asymptotic or preclinical stage and its transition to ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214524178

    authors: Somda SM,Leconte E,Boher JM,Asselain B,Kramar A,Filleron T

    更新日期:2016-12-01 00:00:00

  • Joint modelling for organ transplantation outcomes for patients with diabetes and the end-stage renal disease.

    abstract::This article is motivated by jointly modelling longitudinal and time-to-event clinical data of patients with diabetes and end-stage renal disease. All patients are on the waiting list for the pancreas transplant after kidney transplant, and some of them have a pancreas transplant before kidney transplant failure or de...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218786980

    authors: Dong JJ,Wang S,Wang L,Gill J,Cao J

    更新日期:2019-09-01 00:00:00

  • Power and sample size for multivariate logistic modeling of unmatched case-control studies.

    abstract::Sample size calculations are needed to design and assess the feasibility of case-control studies. Although such calculations are readily available for simple case-control designs and univariate analyses, there is limited theory and software for multivariate unconditional logistic analysis of case-control data. Here we...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217737157

    authors: Gail MH,Haneuse S

    更新日期:2019-03-01 00:00:00

  • Joint nested frailty models for clustered recurrent and terminal events: An application to colonoscopy screening visits and colorectal cancer risks in Lynch Syndrome families.

    abstract::Joint models for recurrent and terminal events have not been yet developed for clustered data. The goals of our study are to develop a statistical framework for modelling clustered recurrent and terminal events and to perform dynamic predictions of the terminal event in family studies. We propose a joint nested frailt...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219863076

    authors: Choi YH,Jacqmin-Gadda H,Król A,Parfrey P,Briollais L,Rondeau V

    更新日期:2020-05-01 00:00:00

  • A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies.

    abstract:BACKGROUND:Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the m...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213502437

    authors: Khondoker M,Dobson R,Skirrow C,Simmons A,Stahl D

    更新日期:2016-10-01 00:00:00

  • Efficient estimation of a linear transformation model for current status data via penalized splines.

    abstract::We propose a flexible and computationally efficient penalized estimation method for a semi-parametric linear transformation model with current status data. To facilitate model fitting, the unknown monotone function is approximated by monotone B-splines, and a computationally efficient hybrid algorithm involving the Fi...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218820406

    authors: Lu M,Liu Y,Li CS

    更新日期:2020-01-01 00:00:00

  • A transformation class for spatio-temporal survival data with a cure fraction.

    abstract::We propose a hierarchical Bayesian methodology to model spatially or spatio-temporal clustered survival data with possibility of cure. A flexible continuous transformation class of survival curves indexed by a single parameter is used. This transformation model is a larger class of models containing two special cases ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212445658

    authors: Hurtado Rúa SM,Dey DK

    更新日期:2016-02-01 00:00:00

  • Semiparametric models for multilevel overdispersed count data with extra zeros.

    abstract::This study proposes semiparametric models for analysis of hierarchical count data containing excess zeros and overdispersion simultaneously. The methods discussed in this paper handle nonlinear covariate effects through flexible semiparametric multilevel regression techniques. This is performed by providing a comprehe...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280216657376

    authors: Mahmoodi M,Moghimbeigi A,Mohammad K,Faradmal J

    更新日期:2018-04-01 00:00:00

  • Obtaining evidence by a single well-powered trial or several modestly powered trials.

    abstract::There is debate whether clinical trials with suboptimal power are justified and whether results from large studies are more reliable than the (combined) results of smaller trials. We quantified the error rates for evaluations based on single conventionally powered trials (80% or 90% power) versus evaluations based on ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212461098

    authors: IntHout J,Ioannidis JP,Borm GF

    更新日期:2016-04-01 00:00:00

  • The cross-validated AUC for MCP-logistic regression with high-dimensional data.

    abstract::We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selec...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211428385

    authors: Jiang D,Huang J,Zhang Y

    更新日期:2013-10-01 00:00:00

  • Estimation of regression quantiles in complex surveys with data missing at random: An application to birthweight determinants.

    abstract::The estimation of population parameters using complex survey data requires careful statistical modelling to account for the design features. This is further complicated by unit and item nonresponse for which a number of methods have been developed in order to reduce estimation bias. In this paper, we address some issu...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280213484401

    authors: Geraci M

    更新日期:2016-08-01 00:00:00

  • Checking linearity of non-parametric component in partially linear models with an application in systemic inflammatory response syndrome study.

    abstract::Two tests are proposed for checking the linearity of nonparametric function in partially linear models. The first one is based on a Crámer-von Mises statistic. This test can detect the local alternative converging to the null at the parametric rate 1/square root n. A bootstrap resample technique is provided to calcula...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1191/0962280206sm440oa

    authors: Liang H

    更新日期:2006-06-01 00:00:00

  • Analysis of clustered competing risks data using subdistribution hazard models with multivariate frailties.

    abstract::Competing risks data often exist within a center in multi-center randomized clinical trials where the treatment effects or baseline risks may vary among centers. In this paper, we propose a subdistribution hazard regression model with multivariate frailty to investigate heterogeneity in treatment effects among centers...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214526193

    authors: Ha ID,Christian NJ,Jeong JH,Park J,Lee Y

    更新日期:2016-12-01 00:00:00