Large scale maximum average power multiple inference on time-course count data with application to RNA-seq analysis.

Abstract:

:Experiments that longitudinally collect RNA sequencing (RNA-seq) data can provide transformative insights in biology research by revealing the dynamic patterns of genes. Such experiments create a great demand for new analytic approaches to identify differentially expressed (DE) genes based on large-scale time-course count data. Existing methods, however, are suboptimal with respect to power and may lack theoretical justification. Furthermore, most existing tests are designed to distinguish among conditions based on overall differential patterns across time, though in practice, a variety of composite hypotheses are of more scientific interest. Finally, some current methods may fail to control the false discovery rate. In this paper, we propose a new model and testing procedure to address the above issues simultaneously. Specifically, conditional on a latent Gaussian mixture with evolving means, we model the data by negative binomial distributions. Motivated by Storey (2007) and Hwang and Liu (2010), we introduce a general testing framework based on the proposed model and show that the proposed test enjoys the optimality property of maximum average power. The test allows not only identification of traditional DE genes but also testing of a variety of composite hypotheses of biological interest. We establish the identifiability of the proposed model, implement the proposed method via efficient algorithms, and demonstrate its good performance via simulation studies. The procedure reveals interesting biological insights, when applied to data from an experiment that examines the effect of varying light environments on the fundamental physiology of the marine diatom Phaeodactylum tricornutum.

journal_name

Biometrics

journal_title

Biometrics

authors

Cao M,Zhou W,Breidt FJ,Peers G

doi

10.1111/biom.13144

subject

Has Abstract

pub_date

2020-03-01 00:00:00

pages

9-22

issue

1

eissn

0006-341X

issn

1541-0420

journal_volume

76

pub_type

杂志文章
  • Valid inference in random effects meta-analysis.

    abstract::The standard approach to inference for random effects meta-analysis relies on approximating the null distribution of a test statistic by a standard normal distribution. This approximation is asymptotic on k, the number of studies, and can be substantially in error in medical meta-analyses, which often have only a few ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00732.x

    authors: Follmann DA,Proschan MA

    更新日期:1999-09-01 00:00:00

  • Correcting for the effect of misclassification bias in a case-control study using data from two different questionnaires.

    abstract::In an epidemiological study of risk factors in breast cancer, data are available on confirmed cases from a diagnostic clinic and on controls from a screening clinic that sampled the general population. Relative risk estimation is complicated by differences in the interviewing environment and in the wording and order o...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Elton RA,Duffy SW

    更新日期:1983-09-01 00:00:00

  • Bayesian inference for two-phase studies with categorical covariates.

    abstract::In this article, we consider two-phase sampling in the situation in which all covariates are categorical. Two-phase designs are appealing from an efficiency perspective since they allow sampling to be concentrated in informative cells. A number of likelihood-based methods have been developed for the analysis of two-ph...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12019

    authors: Ross M,Wakefield J

    更新日期:2013-06-01 00:00:00

  • Semiparametric estimation of the covariate-specific ROC curve in presence of ignorable verification bias.

    abstract::Covariate-specific receiver operating characteristic (ROC) curves are often used to evaluate the classification accuracy of a medical diagnostic test or a biomarker, when the accuracy of the test is associated with certain covariates. In many large-scale screening tests, the gold standard is subject to missingness due...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01562.x

    authors: Liu D,Zhou XH

    更新日期:2011-09-01 00:00:00

  • A study of deleterious gene structure in plants using Markov chain Monte Carlo.

    abstract::The characteristics of deleterious genes have been of great interest in both theory and practice in genetics. Because of the complex genetic mechanism of these deleterious genes, most current studies try to estimate the overall magnitude of mortality effects on a population, which is characterized classically by the n...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.1999.00376.x

    authors: Lee JK,Lascoux M,Newton MA,Nordheim EV

    更新日期:1999-06-01 00:00:00

  • A generalized concordance correlation coefficient based on the variance components generalized linear mixed models for overdispersed count data.

    abstract::The classical concordance correlation coefficient (CCC) to measure agreement among a set of observers assumes data to be distributed as normal and a linear relationship between the mean and the subject and observer effects. Here, the CCC is generalized to afford any distribution from the exponential family by means of...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2009.01335.x

    authors: Carrasco JL

    更新日期:2010-09-01 00:00:00

  • Sequential equivalence testing and repeated confidence intervals, with applications to normal and binary responses.

    abstract::We propose group sequential tests of the equivalence of two treatments based on ideas related to repeated confidence intervals. These tests adapt readily to unpredictable group sizes, to the possibility of continuing even though a boundary has been crossed, and to nonnormal observations. In comparing two binomial dist...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Jennison C,Turnbull BW

    更新日期:1993-03-01 00:00:00

  • Bayesian partitioning for modeling and mapping spatial case-control data.

    abstract::Methods for modeling and mapping spatial variation in disease risk continue to motivate much research. In particular, spatial analyses provide a useful tool for exploring geographical heterogeneity in health outcomes, and consequently can yield clues as to disease etiology, direct public health management, and generat...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2008.01193.x

    authors: Costain DA

    更新日期:2009-12-01 00:00:00

  • Estimating treatment effect in a proportional hazards model in randomized clinical trials with all-or-nothing compliance.

    abstract::We consider methods for estimating the treatment effect and/or the covariate by treatment interaction effect in a randomized clinical trial under noncompliance with time-to-event outcome. As in Cuzick et al. (2007), assuming that the patient population consists of three (possibly latent) subgroups based on treatment p...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12472

    authors: Li S,Gray RJ

    更新日期:2016-09-01 00:00:00

  • Improved doubly robust estimation when data are monotonely coarsened, with application to longitudinal studies with dropout.

    abstract::A routine challenge is that of making inference on parameters in a statistical model of interest from longitudinal data subject to dropout, which are a special case of the more general setting of monotonely coarsened data. Considerable recent attention has focused on doubly robust (DR) estimators, which in this contex...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2010.01476.x

    authors: Tsiatis AA,Davidian M,Cao W

    更新日期:2011-06-01 00:00:00

  • On the near-singularity of models for animal recovery data.

    abstract::Certain probability models sometimes provide poor descriptions when fitted to data by maximum likelihood. We examine one such model for the survival of wild animals, which is fitted to two sets of data. When the model behaves poorly, its expected information matrix, evaluated at the maximum likelihood estimate of para...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00720.x

    authors: Catchpole EA,Kgosi PM,Morgan BJ

    更新日期:2001-09-01 00:00:00

  • Statistical modelling of the AIDS epidemic for forecasting health care needs.

    abstract::The objective of this paper is to develop statistical methods for estimating current and future numbers of individuals in different stages of the natural history of the human immunodeficiency (AIDS) virus infection and to evaluate the impact of therapeutic advances on these numbers. The approach is to extend the metho...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Brookmeyer R,Liao JG

    更新日期:1990-12-01 00:00:00

  • Approximate Bayesian inference for discretely observed continuous-time multi-state models.

    abstract::Inference for continuous time multi-state models presents considerable computational difficulties when the process is only observed at discrete time points with no additional information about the state transitions. In fact, for general multi-state Markov model, evaluation of the likelihood function is possible only v...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.13019

    authors: Tancredi A

    更新日期:2019-09-01 00:00:00

  • Order-restricted inference for means with missing values.

    abstract::Missing values appear very often in many applications, but the problem of missing values has not received much attention in testing order-restricted alternatives. Under the missing at random (MAR) assumption, we impute the missing values nonparametrically using kernel regression. For data with imputation, the classica...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12658

    authors: Wang H,Zhong PS

    更新日期:2017-09-01 00:00:00

  • Ranked set sampling with unequal samples.

    abstract::A ranked set sampling procedure with unequal samples (RSSU) is proposed and used to estimate the population mean. This estimator is then compared with the estimators based on the ranked set sampling (RSS) and median ranked set sampling (MRSS) procedures. It is shown that the relative precisions of the estimator based ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00957.x

    authors: Bhoj DS

    更新日期:2001-09-01 00:00:00

  • Partially supervised learning using an EM-boosting algorithm.

    abstract::Training data in a supervised learning problem consist of the class label and its potential predictors for a set of observations. Constructing effective classifiers from training data is the goal of supervised learning. In biomedical sciences and other scientific applications, class labels may be subject to errors. We...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00156.x

    authors: Yasui Y,Pepe M,Hsu L,Adam BL,Feng Z

    更新日期:2004-03-01 00:00:00

  • Adaptive decision making in a lymphocyte infusion trial.

    abstract::We describe an adaptive Bayesian design for a clinical trial of an experimental treatment for patients with hematologic malignancies who initially received an allogeneic bone marrow transplant but subsequently suffered a disease recurrence. Treatment consists of up to two courses of targeted immunotherapy followed by ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2002.00560.x

    authors: Thall PF,Inoue LY,Martin TG

    更新日期:2002-09-01 00:00:00

  • Use of historical marker data for assessing treatment effects in phase I/II trials when subject selection is determined by baseline marker level.

    abstract::Although the primary focus of Phase I clinical trials is to assess clinical pharmacology and possible toxicities, any information on the potential effect of treatment would be useful in helping to determine priorities between treatments for further study. We consider the scenario where data are routinely collected on ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lin HM,Hughes MD

    更新日期:1995-09-01 00:00:00

  • Heterogeneous capture-recapture models with covariates: a partial likelihood approach for closed populations.

    abstract::In practice, when analyzing data from a capture-recapture experiment it is tempting to apply modern advanced statistical methods to the observed capture histories. However, unless the analysis takes into account that the data have only been collected from individuals who have been captured at least once, the results m...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01596.x

    authors: Stoklosa J,Hwang WH,Wu SH,Huggins R

    更新日期:2011-12-01 00:00:00

  • Efficient experimental designs for the estimation of genetic parameters in plant populations.

    abstract::Procedures for estimating the genetic parameters of plant populations frequently employ progeny testing to ascertain the genotype of maternal plants. However, when experimental resources are limited (e.g., electrophoretic markers), the large progeny sizes required for accurate typing severely restricts the numbers of ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Brown AH

    更新日期:1975-03-01 00:00:00

  • Aberrant crypt foci and semiparametric modeling of correlated binary data.

    abstract::Motivated by the spatial modeling of aberrant crypt foci (ACF) in colon carcinogenesis, we consider binary data with probabilities modeled as the sum of a nonparametric mean plus a latent Gaussian spatial process that accounts for short-range dependencies. The mean is modeled in a general way using regression splines....

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2007.00892.x

    authors: Apanasovich TV,Ruppert D,Lupton JR,Popovic N,Turner ND,Chapkin RS,Carroll RJ

    更新日期:2008-06-01 00:00:00

  • Nonparametric discrete survival function estimation with uncertain endpoints using an internal validation subsample.

    abstract::When a true survival endpoint cannot be assessed for some subjects, an alternative endpoint that measures the true endpoint with error may be collected, which often occurs when obtaining the true endpoint is too invasive or costly. We develop an estimated likelihood function for the situation where we have both uncert...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12316

    authors: Zee J,Xie SX

    更新日期:2015-09-01 00:00:00

  • Bayesian modeling of multiple lesion onset and growth from interval-censored data.

    abstract::In studying rates of occurrence and progression of lesions (or tumors), it is typically not possible to obtain exact onset times for each lesion. Instead, data consist of the number of lesions that reach a detectable size between screening examinations, along with measures of the size/severity of individual lesions at...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00217.x

    authors: Dunson DB,Holloman C,Calder C,Gunn LH

    更新日期:2004-09-01 00:00:00

  • Discriminant diagnostics.

    abstract::I discuss diagnostic methods for discriminant analysis. The equivalence with linear regression is noted and regression diagnostics are considered. The leverage is a function of the linear discriminant function and the Mahalanobis distance of the observation from the group mean. The distribution of this distance is app...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Lachenbruch PA

    更新日期:1997-12-01 00:00:00

  • Extraction of food consumption systems by nonnegative matrix factorization (NMF) for the assessment of food choices.

    abstract::In Western countries where food supply is satisfactory, consumers organize their diets around a large combination of foods. It is the purpose of this article to examine how recent nonnegative matrix factorization (NMF) techniques can be applied to food consumption data to understand these combinations. Such data are n...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.1541-0420.2011.01588.x

    authors: Zetlaoui M,Feinberg M,Verger P,Clémençon S

    更新日期:2011-12-01 00:00:00

  • Instrumental variable method for time-to-event data using a pseudo-observation approach.

    abstract::Observational studies are often in peril of unmeasured confounding. Instrumental variable analysis is a method for controlling for unmeasured confounding. As yet, theory on instrumental variable analysis of censored time-to-event data is scarce. We propose a pseudo-observation approach to instrumental variable analysi...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12451

    authors: Kjaersgaard MI,Parner ET

    更新日期:2016-06-01 00:00:00

  • Linear mixed models with flexible distributions of random effects for longitudinal data.

    abstract::Normality of random effects is a routine assumption for the linear mixed model, but it may be unrealistic, obscuring important features of among-individual variation. We relax this assumption by approximating the random effects density by the seminonparameteric (SNP) representation of Gallant and Nychka (1987, Econome...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341x.2001.00795.x

    authors: Zhang D,Davidian M

    更新日期:2001-09-01 00:00:00

  • On longitudinal prediction with time-to-event outcome: Comparison of modeling options.

    abstract::Long-term follow-up is common in many medical investigations where the interest lies in predicting patients' risks for a future adverse outcome using repeatedly measured predictors over time. A key quantity is the likelihood of developing an adverse outcome among individuals who survived up to time s given their covar...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/biom.12562

    authors: Maziarz M,Heagerty P,Cai T,Zheng Y

    更新日期:2017-03-01 00:00:00

  • Efficient analysis of Weibull survival data from experiments on heterogeneous patient populations.

    abstract::An efficient method is presented for analyses of death rated in one-way or cross-classified experiments where expected survival time for a patient at time of entry on trial is a function of observable covariates. The survival-time distribution used is a Weibull form of Cox's (1972) model. The analysis proceeds in two ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:

    authors: Williams JS

    更新日期:1978-06-01 00:00:00

  • A general model for the analysis of mark-resight, mark-recapture, and band-recovery data under tag loss.

    abstract::Estimates of waterfowl demographic parameters often come from resighting studies where birds fit with individually identifiable neck collars are resighted at a distance. Concerns have been raised about the effects of collar loss on parameter estimates, and the reliability of extrapolating from collared individuals to ...

    journal_title:Biometrics

    pub_type: 杂志文章

    doi:10.1111/j.0006-341X.2004.00245.x

    authors: Conn PB,Kendall WL,Samuel MD

    更新日期:2004-12-01 00:00:00