Mixture modelling for cluster analysis.

Abstract:

:Cluster analysis via a finite mixture model approach is considered. With this approach to clustering, the data can be partitioned into a specified number of clusters g by first fitting a mixture model with g components. An outright clustering of the data is then obtained by assigning an observation to the component to which it has the highest estimated posterior probability of belonging; that is, the ith cluster consists of those observations assigned to the ith component (i = 1,..., g). The focus is on the use of mixtures of normal components for the cluster analysis of data that can be regarded as being continuous. But attention is also given to the case of mixed data, where the observations consist of both continuous and discrete variables.

journal_name

Stat Methods Med Res

authors

McLachlan GJ,Chang SU

doi

10.1191/0962280204sm372ra

subject

Has Abstract

pub_date

2004-10-01 00:00:00

pages

347-61

issue

5

eissn

0962-2802

issn

1477-0334

journal_volume

13

pub_type

杂志文章
  • Estimating the dependence of mixed sensitive response types in randomized response technique.

    abstract::Sensitive questions are often involved in healthcare or medical survey research. Much empirical evidence has shown that the randomized response technique is useful for the collection of truthful responses. However, few studies have discussed methods to estimate the dependence of sensitive responses of multiple types. ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219847492

    authors: Chu AM,So MK,Chan TW,Tiwari A

    更新日期:2020-03-01 00:00:00

  • A unified approach for assessing heterogeneity in age-period-cohort model parameters using random effects.

    abstract::Age-period-cohort models are a popular tool for studying population-level rates; for example, trends in cancer incidence and mortality. Age-period-cohort models decompose observed trends into age effects that correlate with natural history, period effects that reveal factors impacting all ages simultaneously (e.g. inn...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217713033

    authors: Chernyavskiy P,Little MP,Rosenberg PS

    更新日期:2019-01-01 00:00:00

  • Long-term frailty modeling using a non-proportional hazards model: Application with a melanoma dataset.

    abstract::The semiparametric Cox regression model is often fitted in the modeling of survival data. One of its main advantages is the ease of interpretation, as long as the hazards rates for two individuals do not vary over time. In practice the proportionality assumption of the hazards may not be true in some situations. In ad...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219883905

    authors: Calsavara VF,Milani EA,Bertolli E,Tomazella V

    更新日期:2020-08-01 00:00:00

  • Sample size calculation for treatment effects in randomized trials with fixed cluster sizes and heterogeneous intraclass correlations and variances.

    abstract::When comparing two different kinds of group therapy or two individual treatments where patients within each arm are nested within care providers, clustering of observations may occur in both arms. The arms may differ in terms of (a) the intraclass correlation, (b) the outcome variance, (c) the cluster size, and (d) th...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214563100

    authors: Candel MJ,van Breukelen GJ

    更新日期:2015-10-01 00:00:00

  • Prospective analysis of infectious disease surveillance data using syndromic information.

    abstract::In this paper, we describe a Bayesian hierarchical Poisson model for the prospective analysis of data for infectious diseases. The proposed model consists of two components. The first component describes the behavior of disease during nonepidemic periods and the second component represents the increase in disease coun...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214527385

    authors: Corberán-Vallet A,Lawson AB

    更新日期:2014-12-01 00:00:00

  • Adjustment for treatment changes in epilepsy trials: A comparison of causal methods for time-to-event outcomes.

    abstract:BACKGROUND:When trials are subject to departures from randomised treatment, simple statistical methods that aim to estimate treatment efficacy, such as per protocol or as treated analyses, typically introduce selection bias. More appropriate methods to adjust for departure from randomised treatment are rarely employed,...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217735560

    authors: Dodd S,Williamson P,White IR

    更新日期:2019-03-01 00:00:00

  • A test of inflated zeros for Poisson regression models.

    abstract::Excessive zeros are common in practice and may cause overdispersion and invalidate inference when fitting Poisson regression models. There is a large body of literature on zero-inflated Poisson models. However, methods for testing whether there are excessive zeros are less well developed. The Vuong test comparing a Po...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217749991

    authors: He H,Zhang H,Ye P,Tang W

    更新日期:2019-04-01 00:00:00

  • Joint nested frailty models for clustered recurrent and terminal events: An application to colonoscopy screening visits and colorectal cancer risks in Lynch Syndrome families.

    abstract::Joint models for recurrent and terminal events have not been yet developed for clustered data. The goals of our study are to develop a statistical framework for modelling clustered recurrent and terminal events and to perform dynamic predictions of the terminal event in family studies. We propose a joint nested frailt...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219863076

    authors: Choi YH,Jacqmin-Gadda H,Król A,Parfrey P,Briollais L,Rondeau V

    更新日期:2020-05-01 00:00:00

  • Fitting mechanistic epidemic models to data: A comparison of simple Markov chain Monte Carlo approaches.

    abstract::Simple mechanistic epidemic models are widely used for forecasting and parameter estimation of infectious diseases based on noisy case reporting data. Despite the widespread application of models to emerging infectious diseases, we know little about the comparative performance of standard computational-statistical fra...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217747054

    authors: Li M,Dushoff J,Bolker BM

    更新日期:2018-07-01 00:00:00

  • Estimating the personal cure rate of cancer patients using population-based grouped cancer survival data.

    abstract::Cancer patients are subject to multiple competing risks of death and may die from causes other than the cancer diagnosed. The probability of not dying from the cancer diagnosed, which is one of the patients' main concerns, is sometimes called the 'personal cure' rate. Two approaches of modelling competing-risk surviva...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280209347046

    authors: Binbing Yu,Tiwari RC,Feuer EJ

    更新日期:2011-06-01 00:00:00

  • Maximum likelihood estimation of time to first event in the presence of data gaps and multiple events.

    abstract::We propose a novel likelihood method for analyzing time-to-event data when multiple events and multiple missing data intervals are possible prior to the first observed event for a given subject. This research is motivated by data obtained from a heart monitor used to track the recovery process of subjects experiencing...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280212466089

    authors: Green CL,Brownie C,Boos DD,Lu JC,Krucoff MW

    更新日期:2016-04-01 00:00:00

  • Measuring agreement in method comparison studies.

    abstract::Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference +/- 1.96 standard deviation of the differences, provide an interval within which 95% of differenc...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029900800204

    authors: Bland JM,Altman DG

    更新日期:1999-06-01 00:00:00

  • Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

    abstract::Cohen's kappa coefficient, κ, is a statistical measure of inter-rater agreement or inter-annotator agreement for qualitative items. In this paper, we focus on interval estimation of κ in the case of two raters and binary items. So far, only asymptotic and bootstrap intervals are available for κ due to its complexity. ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214552881

    authors: Shan G,Wang W

    更新日期:2017-04-01 00:00:00

  • Designs in partially controlled studies: messages from a review.

    abstract::The ability to evaluate effects of factors on outcomes is increasingly important for studies that control some but not all of the factors. Although important advances have been made in methods of analysis for such partially controlled studies, work on designs has been limited. To help understand why, we review the mai...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1191/0962280205sm405oa

    authors: Li F,Frangakis CE

    更新日期:2005-08-01 00:00:00

  • Letter to the editor: Fitting truncated normal distributions.

    abstract::I comment here on a recent paper in this journal, on the fitting of truncated normal distributions by the EM algorithm. I show that the fitting of such distributions by direct numerical maximization of likelihood (rather than EM) is straightforward, contrary to an assertion made by the authors of that paper. ...

    journal_title:Statistical methods in medical research

    pub_type: 评论,信件

    doi:10.1177/0962280217712089

    authors: MacDonald IL

    更新日期:2018-12-01 00:00:00

  • A curvilinear bivariate random changepoint model to assess temporal order of markers.

    abstract::In biomedical research, various longitudinal markers measuring different quantities are often collected over time. For example, repeated measures of psychometric scores are very informative about the degradation process toward dementia. These trajectories are generally nonlinear with an acceleration of the decline a f...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219898719

    authors: Segalas C,Helmer C,Jacqmin-Gadda H

    更新日期:2020-09-01 00:00:00

  • Increasing efficiency from censored survival data by using random effects to model longitudinal covariates.

    abstract::When estimating a survival time distribution, the loss of information due to right censoring results in a loss of efficiency in the estimator. In many circumstances, however, repeated measurements on a longitudinal process which is associated with survival time are made throughout the observation time, and these measu...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/096228029800700104

    authors: Hogan JW,Laird NM

    更新日期:1998-03-01 00:00:00

  • A generalization of functional clustering for discrete multivariate longitudinal data.

    abstract::This paper presents a new model-based generalized functional clustering method for discrete longitudinal data, such as multivariate binomial and Poisson distributed data. For this purpose, we propose a multivariate functional principal component analysis (MFPCA)-based clustering procedure for a latent multivariate Gau...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280220921912

    authors: Lim Y,Cheung YK,Oh HS

    更新日期:2020-11-01 00:00:00

  • Study design for epidemiologic studies with measurement error.

    abstract::Exposure measurement error in epidemiological studies is recognized as a feature that must be considered because of the potential bias that can result in estimates of the exposure-disease association. Most of the work to date has focused on methods of analysis that adjust for the resultant bias, but the implications o...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章,评审

    doi:10.1177/096228029500400405

    authors: Holford TR,Stack C

    更新日期:1995-12-01 00:00:00

  • A kernel-based spatio-temporal surveillance system for monitoring influenza-like illness incidence.

    abstract::The threat of pandemics has made influenza surveillance systems a priority in epidemiology services around the world. The emergence of A-H1N1 influenza has required accurate surveillance systems in order to undertake specific actions only when and where they are necessary. In that sense, the main goal of this article ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280210370265

    authors: Martinez-Beneito MA,Botella-Rocamora P,Zurriaga O

    更新日期:2011-04-01 00:00:00

  • The cross-validated AUC for MCP-logistic regression with high-dimensional data.

    abstract::We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selec...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280211428385

    authors: Jiang D,Huang J,Zhang Y

    更新日期:2013-10-01 00:00:00

  • Relative efficiency of unequal cluster sizes for variance component estimation in cluster randomized and multicentre trials.

    abstract::Cluster randomized and multicentre trials evaluate the effect of a treatment on persons nested within clusters, for instance patients within clinics or pupils within schools. Although equal sample sizes per cluster are generally optimal for parameter estimation, they are rarely feasible. This paper addresses the relat...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280206079018

    authors: van Breukelen GJ,Candel MJ,Berger MP

    更新日期:2008-08-01 00:00:00

  • Efficient estimation of a linear transformation model for current status data via penalized splines.

    abstract::We propose a flexible and computationally efficient penalized estimation method for a semi-parametric linear transformation model with current status data. To facilitate model fitting, the unknown monotone function is approximated by monotone B-splines, and a computationally efficient hybrid algorithm involving the Fi...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280218820406

    authors: Lu M,Liu Y,Li CS

    更新日期:2020-01-01 00:00:00

  • Bayesian nonparametric mixed-effects joint model for longitudinal-competing risks data analysis in presence of multiple data features.

    abstract::Recently, the joint analysis of longitudinal and survival data has been an active research area. Most joint models focus on survival data with only one type of failure. The research on joint modeling of longitudinal and competing risks survival data is sparse. Even so, many joint models for this type of data assume pa...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280215597939

    authors: Lu T

    更新日期:2017-10-01 00:00:00

  • Correcting for non-participation bias in health surveys using record-linkage, synthetic observations and pattern mixture modelling.

    abstract::Surveys are key means of obtaining policy-relevant information not available from routine sources. Bias arising from non-participation is typically handled by applying weights derived from limited socio-demographic characteristics. This approach neither captures nor adjusts for differences in health and related behavi...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280219854482

    authors: Gray L,Gorman E,White IR,Katikireddi SV,McCartney G,Rutherford L,Leyland AH

    更新日期:2020-04-01 00:00:00

  • Separating variability in healthcare practice patterns from random error.

    abstract::Improving the quality of care that patients receive is a major focus of clinical research, particularly in the setting of cardiovascular hospitalization. Quality improvement studies seek to estimate and visualize the degree of variability in dichotomous treatment patterns and outcomes across different providers, where...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217754230

    authors: Thomas LE,Schulte PJ

    更新日期:2019-04-01 00:00:00

  • Power and sample size for multivariate logistic modeling of unmatched case-control studies.

    abstract::Sample size calculations are needed to design and assess the feasibility of case-control studies. Although such calculations are readily available for simple case-control designs and univariate analyses, there is limited theory and software for multivariate unconditional logistic analysis of case-control data. Here we...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217737157

    authors: Gail MH,Haneuse S

    更新日期:2019-03-01 00:00:00

  • A mixed-effects, spatially varying coefficients model with application to multi-resolution functional magnetic resonance imaging data.

    abstract::Spatial resolution plays an important role in functional magnetic resonance imaging studies as the signal-to-noise ratio increases linearly with voxel volume. In scientific studies, where functional magnetic resonance imaging is widely used, the standard spatial resolution typically used is relatively low which ensure...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280217752378

    authors: Liu Z,Bartsch AJ,Berrocal VJ,Johnson TD

    更新日期:2019-04-01 00:00:00

  • Estimating the average treatment effects of nutritional label use using subclassification with regression adjustment.

    abstract::Propensity score methods are common for estimating a binary treatment effect when treatment assignment is not randomized. When exposure is measured on an ordinal scale (i.e. low-medium-high), however, propensity score inference requires extensions which have received limited attention. Estimands of possible interest w...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280214560046

    authors: Lopez MJ,Gutman R

    更新日期:2017-04-01 00:00:00

  • Stochastic models of sequence evolution including insertion-deletion events.

    abstract::Comparison of sequences that have descended from a common ancestor based on an explicit stochastic model of substitutions, insertions and deletions has risen to prominence in the last decade. Making statements about the positions of insertions-deletions (abbr. indels) is central in sequence and genome analysis and is ...

    journal_title:Statistical methods in medical research

    pub_type: 杂志文章

    doi:10.1177/0962280208099500

    authors: Miklós I,Novák A,Satija R,Lyngsø R,Hein J

    更新日期:2009-10-01 00:00:00