Two-group Poisson-Dirichlet mixtures for multiple testing.

Abstract:

:The simultaneous testing of multiple hypotheses is common to the analysis of high-dimensional data sets. The two-group model, first proposed by Efron, identifies significant comparisons by allocating observations to a mixture of an empirical null and an alternative distribution. In the Bayesian nonparametrics literature, many approaches have suggested using mixtures of Dirichlet Processes in the two-group model framework. Here, we investigate employing mixtures of two-parameter Poisson-Dirichlet Processes instead, and show how they provide a more flexible and effective tool for large-scale hypothesis testing. Our model further employs nonlocal prior densities to allow separation between the two mixture components. We obtain a closed-form expression for the exchangeable partition probability function of the two-group model, which leads to a straightforward Markov Chain Monte Carlo implementation. We compare the performance of our method for large-scale inference in a simulation study and illustrate its use on both a prostate cancer data set and a case-control microbiome study of the gastrointestinal tracts in children from underdeveloped countries who have been recently diagnosed with moderate-to-severe diarrhea.

journal_name

Biometrics

journal_title

Biometrics

authors

Denti F,Guindani M,Leisen F,Lijoi A,Wadsworth WD,Vannucci M

doi

10.1111/biom.13314

subject

Has Abstract

pub_date

2020-06-14 00:00:00

eissn

0006-341X

issn

1541-0420

pub_type

杂志文章
  • On identifiability in capture-recapture models.

    abstract::We study the issue of identifiability of mixture models in the context of capture-recapture abundance estimation for closed populations. Such models are used to take account of individual heterogeneity in capture probabilities, but their validity was recently questioned by Link (2003, Biometrics 59, 1123-1130) on the ...

    journal_title:Biometrics

    pub_type: 评论,杂志文章

    doi:10.1111/j.1541-0420.2006.00637_1.x

    authors: Holzmann H,Munk A,Zucchini W

    更新日期:2006-09-01 00:00:00

  • Efficient analysis of Weibull survival data from experiments on heterogeneous patient populations.

    abstract::An efficient method is presented for analyses of death rated in one-way or cross-classified experiments where expected survival time for a patient at time of entry on trial is a function of observable covariates. The survival-time distribution used is a Weibull form of Cox's (1972) model. The analysis proceeds in two ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Williams JS

    更新日期:1978-06-01 00:00:00

  • G-estimation and artificial censoring: problems, challenges, and applications.

    abstract::In principle, G-estimation is an attractive approach for dealing with confounding by variables affected by treatment. It has rarely been applied for estimation of the effects of treatment on failure-time outcomes. Part of this is due to artificial censoring, an analytic device which considers some subjects who actuall...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01656.x

    authors: Joffe MM,Yang WP,Feldman H

    更新日期:2012-03-01 00:00:00

  • Optimum experimental designs for properties of a compartmental model.

    abstract::Three properties of interest in bioavailability studies using compartmental models are the area under the concentration curve, the maximum concentration, and the time to maximum concentration. Methods are described for finding designs that minimize the variance of the estimates of these quantities in such a model. The...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Atkinson AC,Chaloner K,Herzberg AM,Juritz J

    更新日期:1993-06-01 00:00:00

  • Regional spatial modeling of topsoil geochemistry.

    abstract::Geographic information about the levels of toxics in environmental media is commonly used in regional environmental health studies when direct measurements of personal exposure is limited or unavailable. In this article, we propose a statistical framework for analyzing the spatial distribution of topsoil geochemical p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01041.x

    authors: Calder CA,Craigmile PF,Zhang J

    更新日期:2009-03-01 00:00:00

  • A Markov model for analysing cancer markers and disease states in survival studies.

    abstract::In studies of serial cancer markers or disease states and their relation to survival, data on the marker or state are usually obtained at infrequent time points during follow-up. A Markov model is developed to assess the dependence of risk of death on marker level or disease state and inferences within this model are ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Kay R

    更新日期:1986-12-01 00:00:00

  • Marginal analysis of correlated failure time data with informative cluster sizes.

    abstract::We consider modeling correlated survival data when cluster sizes may be informative to the outcome of interest based on a within-cluster resampling (WCR) approach and a weighted score function (WSF) method. We derive the large sample properties for the WCR estimators under the Cox proportional hazards model. We establ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00730.x

    authors: Cong XJ,Yin G,Shen Y

    更新日期:2007-09-01 00:00:00

  • On the use of the variogram in checking for independence in spatial data.

    abstract::The variogram is a standard tool in the analysis of spatial data, and its shape provides useful information on the form of spatial correlation that may be present. However, it is also useful to be able to assess the evidence for the presence of any spatial correlation. A method of doing this, based on an assessment of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00211.x

    authors: Diblasi A,Bowman AW

    更新日期:2001-03-01 00:00:00

  • Accelerated hazards model based on parametric families generalized with Bernstein polynomials.

    abstract::A transformed Bernstein polynomial that is centered at standard parametric families, such as Weibull or log-logistic, is proposed for use in the accelerated hazards model. This class provides a convenient way towards creating a Bayesian nonparametric prior for smooth densities, blending the merits of parametric and no...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12104

    authors: Chen Y,Hanson T,Zhang J

    更新日期:2014-03-01 00:00:00

  • Biometry and medical statistics.

    abstract::The "biometric school" founded by K. Pearson, F. Galton, and W. F. R. Weldon was concerned especially with heredity and variation, and between the wars "biometry" was not widely used as a general term for quantitative biology. The foundation of the Biometric Society encouraged this wider usage, and medical and biologi...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Armitage P

    更新日期:1985-12-01 00:00:00

  • Statistical methods in ophthalmology: an adjusted chi-square approach.

    abstract::Ophthalmologic studies often compare several groups of subjects for the presence or absence of some ocular finding, where each subject may contribute two eyes to the analysis, the values from the two eyes being highly correlated. Rosner (1982, Biometrics 38, 105-114) and Dallal (1988, Biometrics 44, 253-257) proposed ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Donner A

    更新日期:1989-06-01 00:00:00

  • A latent model to detect multiple clusters of varying sizes.

    abstract::This article develops a latent model and likelihood-based inference to detect temporal clustering of events. The model mimics typical processes generating the observed data. We apply model selection techniques to determine the number of clusters, and develop likelihood inference and a Monte Carlo expectation-maximizat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01197.x

    authors: Xie M,Sun Q,Naus J

    更新日期:2009-12-01 00:00:00

  • Prediction of random effects in linear and generalized linear models under model misspecification.

    abstract::Statistical models that include random effects are commonly used to analyze longitudinal and correlated data, often with the assumption that the random effects follow a Gaussian distribution. Via theoretical and numerical calculations and simulation, we investigate the impact of misspecification of this distribution o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01435.x

    authors: McCulloch CE,Neuhaus JM

    更新日期:2011-03-01 00:00:00

  • Estimating diagnostic accuracy of raters without a gold standard by exploiting a group of experts.

    abstract::In diagnostic medicine, estimating the diagnostic accuracy of a group of raters or medical tests relative to the gold standard is often the primary goal. When a gold standard is absent, latent class models where the unknown gold standard test is treated as a latent variable are often used. However, these models have b...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2012.01789.x

    authors: Zhang B,Chen Z,Albert PS

    更新日期:2012-12-01 00:00:00

  • First passage times as environmental safety indicators: carboxyhemoglobin from cigarette smoke.

    abstract::The concentration of carbon monoxide in the blood of a cigarette smoker varies in response to the frequency and dose of CO delivered by the cigarettes he smokes and by the rate at which CO washes out of his blood. Moments of first passage times or exit times above a nominal threshold can be calculated using a stochast...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Marcus AH,Czajkowski S Jr

    更新日期:1979-09-01 00:00:00

  • Latent variable models for longitudinal data with multiple continuous outcomes.

    abstract::Multiple outcomes are often used to properly characterize an effect of interest. This paper proposes a latent variable model for the situation where repeated measures over time are obtained on each outcome. These outcomes are assumed to measure an underlying quantity of main interest from different perspectives. We re...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2000.01047.x

    authors: Roy J,Lin X

    更新日期:2000-12-01 00:00:00

  • N-mixture models for estimating population size from spatially replicated counts.

    abstract::Spatial replication is a common theme in count surveys of animals. Such surveys often generate sparse count data from which it is difficult to estimate population size while formally accounting for detection probability. In this article, I describe a class of models (N-mixture models) which allow for estimation of pop...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00142.x

    authors: Royle JA

    更新日期:2004-03-01 00:00:00

  • An adaptive independence test for microbiome community data.

    abstract::Advances in sequencing technologies and bioinformatics tools have vastly improved our ability to collect and analyze data from complex microbial communities. A major goal of microbiome studies is to correlate the overall microbiome composition with clinical or environmental variables. La Rosa et al. recently proposed ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13154

    authors: Song Y,Zhao H,Wang T

    更新日期:2020-06-01 00:00:00

  • Connecting the latent multinomial.

    abstract::Link et al. (2010, Biometrics 66, 178-185) define a general framework for analyzing capture-recapture data with potential misidentifications. In this framework, the observed vector of counts, y, is considered as a linear function of a vector of latent counts, x, such that y=Ax, with x assumed to follow a multinomial d...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12333

    authors: Schofield MR,Bonner SJ

    更新日期:2015-12-01 00:00:00

  • A method for estimating incidence rates of onchocerciasis from skin-snip biopsies with consideration of false negatives.

    abstract::The aim of this study is to estimate incidence rates of onchocerciasis from skin-snip biopsies, based on incomplete data obtained in field surveys, with consideration of false negatives. The method of maximum likelihood is employed and the effect of false negatives on the incidence rates is discussed. ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Yanagawa T,Kasagi F,Yoshimura T

    更新日期:1984-06-01 00:00:00

  • On Bayesian methods for bioequivalence.

    abstract::Bayesian methods are presented for assessing bioequivalence for studies in which a new formulation and a standard are administered simultaneously, and for Latin square designs which compare two or more new formulations to a standard. Two examples illustrate the application of the methods. ...

    journal_title:Biometrics

    pub_type: 临床试验,杂志文章

    doi:

    authors: Selwyn MR,Hall NR

    更新日期:1984-12-01 00:00:00

  • A network-based analysis of the 1861 Hagelloch measles data.

    abstract::In this article, we demonstrate a statistical method for fitting the parameters of a sophisticated network and epidemic model to disease data. The pattern of contacts between hosts is described by a class of dyadic independence exponential-family random graph models (ERGMs), whereas the transmission process that runs ...

    journal_title:Biometrics

    pub_type: 历史文章,杂志文章

    doi:10.1111/j.1541-0420.2012.01748.x

    authors: Groendyke C,Welch D,Hunter DR

    更新日期:2012-09-01 00:00:00

  • Joint modeling of progression of HIV resistance mutations measured with uncertainty and failure time data.

    abstract::Development of HIV resistance mutations is a major cause for failure of antiretroviral treatment. This article proposes a method for jointly modeling the processes of viral genetic changes and treatment failure. Because the viral genome is measured with uncertainty, a hidden Markov model is used to fit the viral genet...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00635.x

    authors: Hu C,De Gruttola V

    更新日期:2007-03-01 00:00:00

  • Analysis of ordered categorical data: two score-independent approaches.

    abstract:SUMMARY:A trend test is often employed to analyze ordered categorical data, in which a set of increasing scores is assigned a priori. There is a drawback in this approach, because how to choose a set of scores is not clear. There have been debates on which scores should be used (e.g., Graubard and Korn, 1987, Biometric...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.00992.x

    authors: Zheng G

    更新日期:2008-12-01 00:00:00

  • Some scale estimators and lack-of-fit tests for the censored two-sample accelerated life model.

    abstract::Some new scale estimators for the censored two-sample accelerated life model are introduced. They are zeros of some integrated weighted difference between the two cumulative hazard estimators. These estimators are asymptotically normal. The weight is chosen to result in estimators whose asymptotic variances do not inv...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Yang S

    更新日期:1998-09-01 00:00:00

  • Linear mixed models with flexible distributions of random effects for longitudinal data.

    abstract::Normality of random effects is a routine assumption for the linear mixed model, but it may be unrealistic, obscuring important features of among-individual variation. We relax this assumption by approximating the random effects density by the seminonparameteric (SNP) representation of Gallant and Nychka (1987, Econome...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00795.x

    authors: Zhang D,Davidian M

    更新日期:2001-09-01 00:00:00

  • A general methodology for the analysis of experiments with repeated measurement of categorical data.

    abstract::This paper is concerned with the analysis of multivariate categorical data which are obtained from repeated measurement experiments. An expository discussion of pertinent hypotheses for such situations is given, and appropriate test statistics are developed through the application of weighted least squares methods. Sp...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Koch GG,Landis JR,Freeman JL,Freeman DH Jr,Lehnen RC

    更新日期:1977-03-01 00:00:00

  • Coregionalized single- and multiresolution spatially varying growth curve modeling with application to weed growth.

    abstract::Modeling of longitudinal data from agricultural experiments using growth curves helps understand conditions conducive or unconducive to crop growth. Recent advances in Geographical Information Systems (GIS) now allow geocoding of agricultural data that help understand spatial patterns. A particularly common problem is...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2006.00535.x

    authors: Banerjee S,Johnson GA

    更新日期:2006-09-01 00:00:00

  • A score regression approach to assess calibration of continuous probabilistic predictions.

    abstract::Calibration, the statistical consistency of forecast distributions and the observations, is a central requirement for probabilistic predictions. Calibration of continuous forecasts is typically assessed using the probability integral transform histogram. In this article, we propose significance tests based on scoring ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01406.x

    authors: Held L,Rufibach K,Balabdaoui F

    更新日期:2010-12-01 00:00:00

  • The use of frailty hazard models for unrecognized heterogeneity that interacts with treatment: considerations of efficiency and power.

    abstract::Increasingly, genetic studies of tumors of the same histologic diagnosis are elucidating subtypes that are distinct with respect to clinical endpoints such as response to treatment and survival. This raises concerns about the efficiency of using the simple log-rank test for analysis of treatment effect on survival in ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00232.x

    authors: Li Y,Betensky RA,Louis DN,Cairncross JG

    更新日期:2002-03-01 00:00:00