Assessing the goodness-of-fit of hidden Markov models.

Abstract:

:In this article, we propose a graphical technique for assessing the goodness-of-fit of a stationary hidden Markov model (HMM). We show that plots of the estimated distribution against the empirical distribution detect lack of fit with high probability for large sample sizes. By considering plots of the univariate and multidimensional distributions, we are able to examine the fit of both the assumed marginal distribution and the correlation structure of the observed data. We provide general conditions for the convergence of the empirical distribution to the true distribution, and demonstrate that these conditions hold for a wide variety of time-series models. Thus, our method allows us to compare not only the fit of different HMMs, but also that of other models as well. We illustrate our technique using a multiple sclerosis data set.

journal_name

Biometrics

journal_title

Biometrics

authors

MacKay Altman R

doi

10.1111/j.0006-341X.2004.00189.x

subject

Has Abstract

pub_date

2004-06-01 00:00:00

pages

444-50

issue

2

eissn

0006-341X

issn

1541-0420

pii

BIOM189

journal_volume

60

pub_type

杂志文章
  • Procedures for comparing samples with multiple endpoints.

    abstract::Five procedures are considered for the comparison of two or more multivariate samples. These procedures include a newly proposed nonparametric rank-sum test and a generalized least squares test. Also considered are the following tests: ordinary least squares, Hotelling's T2, and a Bonferroni per-experiment error-rate ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: O'Brien PC

    更新日期:1984-12-01 00:00:00

  • Statistical significance for hierarchical clustering.

    abstract::Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high-dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clusteri...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12647

    authors: Kimes PK,Liu Y,Neil Hayes D,Marron JS

    更新日期:2017-09-01 00:00:00

  • The gamma-frailty Poisson model for the nonparametric estimation of panel count data.

    abstract::In this article, we study nonparametric estimation of the mean function of a counting process with panel observations. We introduce the gamma frailty variable to account for the intracorrelation between the panel counts of the counting process and construct a maximum pseudo-likelihood estimate with the frailty variabl...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2003.00126.x

    authors: Zhang Y,Jamshidian M

    更新日期:2003-12-01 00:00:00

  • Bayesian partitioning for modeling and mapping spatial case-control data.

    abstract::Methods for modeling and mapping spatial variation in disease risk continue to motivate much research. In particular, spatial analyses provide a useful tool for exploring geographical heterogeneity in health outcomes, and consequently can yield clues as to disease etiology, direct public health management, and generat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01193.x

    authors: Costain DA

    更新日期:2009-12-01 00:00:00

  • Pedigree analysis for quantitative traits: variance components without matrix inversion.

    abstract::Recent developments in the animal breeding literature facilitate estimation of the variance components in quantitative genetic models. However, computation remains intensive, and many of the procedures are restricted to specialized designs and models, unsuited to data arising from studies of natural populations. We de...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Thompson EA,Shaw RG

    更新日期:1990-06-01 00:00:00

  • Maximum likelihood estimation in dynamical models of HIV.

    abstract::The study of dynamical models of HIV infection, based on a system of nonlinear ordinary differential equations (ODE), has considerably improved the knowledge of its pathogenesis. While the first models used simplified ODE systems and analyzed each patient separately, recent works dealt with inference in non-simplified...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00812.x

    authors: Guedj J,Thiébaut R,Commenges D

    更新日期:2007-12-01 00:00:00

  • A comparison of several point estimators of the odds ratio in a single 2 x 2 contingency table.

    abstract::The relative performance of the unconditioned maximum likelihood estimators (UMLEs), conditional MLEs (CMLEs), and Jewell-type estimators of the odds ratio (OR) and its logarithm were investigated in sets of single 2 x 2 contingency tables. The tables were generated by complete enumeration of all possible cell frequen...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Walter SD,Cook RJ

    更新日期:1991-09-01 00:00:00

  • A century of biometrical genetics.

    abstract::We briefly review the major contribution of biometrics to genetics over the last century (population genetic models, familial correlations, segregation analysis, and gene mapping) and current areas of active research and then speculate about what problems will be tackled in the next century. ...

    journal_title:Biometrics

    pub_type: 社论,历史文章,评审

    doi:10.1111/j.0006-341x.2000.00659.x

    authors: Elston RC,Thompson EA

    更新日期:2000-09-01 00:00:00

  • Outlier reduction by an option-3 measurement scheme.

    abstract::Detecting changes in longitudinal data is important in medical research. However, the existence of measurement outliers can cause an unexpected increase in the false alarm rate in claiming changes. To reduce the outliers, a new method has been developed. In this scheme, two measures are initially taken and, if they ar...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Namgung YY,Yang MC

    更新日期:1994-03-01 00:00:00

  • Maximum likelihood estimation for incomplete repeated-measures experiments under an ARMA covariance structure.

    abstract::A stochastic model is presented for the analysis of incomplete repeated-measures experiments. The general linear model is used to relate the response measures to other variables which are thought to account for inherent variation; an autoregressive moving average (ARMA) time series representation is used to model dist...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Rochon J,Helms RW

    更新日期:1989-03-01 00:00:00

  • Testing for Hardy-Weinberg equilibrium.

    abstract::The class of admissible tests for Hardy-Weinberg equilibrium in a multi-allelic system is characterized. The standard goodness-of-fit chi-square tests is shown to be admissible for systems of two or more alleles. The conditional probability distribution required to determine the exact significance level of this test i...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Ledwina T,Gnot S

    更新日期:1980-03-01 00:00:00

  • Tests for monotone mean residual life, using randomly censored data.

    abstract::At any age the mean residual life function gives the expected remaining life at that age. Reliabilists and biometricians have found it useful to categorize failure distributions by the monotonicity properties of the mean residual life function. Hollander and Proschan (1975, Biometrika 62, 585-593) have derived tests o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Chen YY,Hollander M,Langberg NA

    更新日期:1983-03-01 00:00:00

  • Randomization inference with general interference and censoring.

    abstract::Interference occurs between individuals when the treatment (or exposure) of one individual affects the outcome of another individual. Previous work on causal inference methods in the presence of interference has focused on the setting where it is a priori assumed that there is "partial interference," in the sense that...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13125

    authors: Loh WW,Hudgens MG,Clemens JD,Ali M,Emch ME

    更新日期:2020-03-01 00:00:00

  • Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

    abstract::We consider a semiparametric regression model that relates a normal outcome to covariates and a genetic pathway, where the covariate effects are modeled parametrically and the pathway effect of multiple gene expressions is modeled parametrically or nonparametrically using least-squares kernel machines (LSKMs). This un...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00799.x

    authors: Liu D,Lin X,Ghosh D

    更新日期:2007-12-01 00:00:00

  • Fitting mixture models to grouped and truncated data via the EM algorithm.

    abstract::The fitting of finite mixture models via the EM algorithm is considered for data which are available only in grouped form and which may also be truncated. A practical example is presented where a mixture of two doubly truncated log-normal distributions is adopted to model the distribution of the volume of red blood ce...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: McLachlan GJ,Jones PN

    更新日期:1988-06-01 00:00:00

  • Asynchronous distance between homologous DNA sequences.

    abstract::The distance between homologous DNA sequences of two species is proposed to be -1/4 ln[det(P)], where P is the conditional probability matrix specifying the proportions of the various nucleotides in the second sequence, corresponding to each of the four nucleotides in the first sequence. A probability model is describ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Barry D,Hartigan JA

    更新日期:1987-06-01 00:00:00

  • An alternative approach to confidence interval estimation for the win ratio statistic.

    abstract::Pocock et al. (2012, European Heart Journal 33, 176-182) proposed a win ratio approach to analyzing composite endpoints comprised of outcomes with different clinical priorities. In this article, we establish a statistical framework for this approach. We derive the null hypothesis and propose a closed-form variance est...

    journal_title:Biometrics

    pub_type: 杂志文章,随机对照试验

    doi:10.1111/biom.12225

    authors: Luo X,Tian H,Mohanty S,Tsai WY

    更新日期:2015-03-01 00:00:00

  • A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    abstract::Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable a...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12007

    authors: Schörgendorfer A,Branscum AJ,Hanson TE

    更新日期:2013-06-01 00:00:00

  • A logistic-bivariate normal model for overdispersed two-state Markov processes.

    abstract::We describe a logistic-bivariate normal mixture model for a two-state Markov chain in which each individual makes transitions between states according to a subject-specific transition probability matrix. The use of the bivariate normal mixing distribution facilitates inferences regarding the correlation of the random ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Cook RJ,Ng ET

    更新日期:1997-03-01 00:00:00

  • Prediction in the presence of measurement error: general discussion and an example predicting defoliation.

    abstract::Motivated by the particular problem of predicting defoliation based on a measure of gypsy moth egg mass density, prediction in the presence of measurement error is discussed. The measurement error variances and covariances are allowed to vary from unit to unit and are estimated by some type of within unit sampling. A ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Buonaccorsi JP

    更新日期:1995-12-01 00:00:00

  • A model-based approach for making ecological inference from distance sampling data.

    abstract::We consider a fully model-based approach for the analysis of distance sampling data. Distance sampling has been widely used to estimate abundance (or density) of animals or plants in a spatially explicit study area. There is, however, no readily available method of making statistical inference on the relationships bet...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01265.x

    authors: Johnson DS,Laake JL,Ver Hoef JM

    更新日期:2010-03-01 00:00:00

  • Nonparametric discrete survival function estimation with uncertain endpoints using an internal validation subsample.

    abstract::When a true survival endpoint cannot be assessed for some subjects, an alternative endpoint that measures the true endpoint with error may be collected, which often occurs when obtaining the true endpoint is too invasive or costly. We develop an estimated likelihood function for the situation where we have both uncert...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12316

    authors: Zee J,Xie SX

    更新日期:2015-09-01 00:00:00

  • Hypothesis testing under mixture models: application to genetic linkage analysis.

    abstract::In this paper we propose a new class of statistics to test a simple hypothesis against a family of alternatives characterized by a mixture model. Unlike the likelihood ratio statistic, whose large sample distribution is still unknown in this situation, these new statistics have a simple asymptotic distribution to whic...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00065.x

    authors: Liang KY,Rathouz PJ

    更新日期:1999-03-01 00:00:00

  • Estimation and interpretation of heterogeneous vaccine efficacy against recurrent infections.

    abstract::Vaccine-induced protection may not be homogeneous across individuals. It is possible that a vaccine gives complete protection for a portion of individuals, while the rest acquire only incomplete (leaky) protection of varying magnitude. If vaccine efficacy is estimated under wrong assumptions about such individual leve...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12473

    authors: Mehtälä J,Dagan R,Auranen K

    更新日期:2016-09-01 00:00:00

  • Spatial-temporal modeling of the association between air pollution exposure and preterm birth: identifying critical windows of exposure.

    abstract::Exposure to high levels of air pollution during the pregnancy is associated with increased probability of preterm birth (PTB), a major cause of infant morbidity and mortality. New statistical methodology is required to specifically determine when a particular pollutant impacts the PTB outcome, to determine the role of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01774.x

    authors: Warren J,Fuentes M,Herring A,Langlois P

    更新日期:2012-12-01 00:00:00

  • Improved estimation of the noncentrality parameter distribution from a large number of t-statistics, with applications to false discovery rate estimation in microarray data analysis.

    abstract::Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01764.x

    authors: Qu L,Nettleton D,Dekkers JC

    更新日期:2012-12-01 00:00:00

  • Statistical methods for classification of human chromosomes.

    abstract::The basic technical facts of human cytogenetics and the laboratory methods employed in chromosome research are explained in simple terms. The main variables used to describe chromosome images are defined and discussed. Three discriminant analysis models for chromosome classification are developed: one in which each ch...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Habbema JD

    更新日期:1979-03-01 00:00:00

  • Heterogeneous capture-recapture models with covariates: a partial likelihood approach for closed populations.

    abstract::In practice, when analyzing data from a capture-recapture experiment it is tempting to apply modern advanced statistical methods to the observed capture histories. However, unless the analysis takes into account that the data have only been collected from individuals who have been captured at least once, the results m...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01596.x

    authors: Stoklosa J,Hwang WH,Wu SH,Huggins R

    更新日期:2011-12-01 00:00:00

  • On pooling across strata when frequency matching has been followed in a cohort study.

    abstract::In a study designed to assess the relationship between a dichotomous exposure and the eventual occurrence of a dichotomous outcome, frequency matching has been proposed as a way to balance the exposure cohorts with respect to the sampling distribution of potential confounding factors. This paper discusses the pooled e...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Weinberg CR

    更新日期:1985-03-01 00:00:00

  • Sequential model selection-based segmentation to detect DNA copy number variation.

    abstract::Array-based CGH experiments are designed to detect genomic aberrations or regions of DNA copy-number variation that are associated with an outcome, typically a state of disease. Most of the existing statistical methods target on detecting DNA copy number variations in a single sample or array. We focus on the detectio...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12478

    authors: Hu J,Zhang L,Wang HJ

    更新日期:2016-09-01 00:00:00