Abstract:
:Metagenomics enables the study of gene abundances in complex mixtures of microorganisms and has become a standard methodology for the analysis of the human microbiome. However, gene abundance data is inherently noisy and contains high levels of biological and technical variability as well as an excess of zeros due to non-detected genes. This makes the statistical analysis challenging. In this study, we present a new hierarchical Bayesian model for inference of metagenomic gene abundance data. The model uses a zero-inflated overdispersed Poisson distribution which is able to simultaneously capture the high gene-specific variability as well as zero observations in the data. By analysis of three comprehensive datasets, we show that zero-inflation is common in metagenomic data from the human gut and, if not correctly modelled, it can lead to substantial reductions in statistical power. We also show, by using resampled metagenomic data, that our model has, compared to other methods, a higher and more stable performance for detecting differentially abundant genes. We conclude that proper modelling of the gene-specific variability, including the excess of zeros, is necessary to accurately describe gene abundances in metagenomic data. The proposed model will thus pave the way for new biological insights into the structure of microbial communities.
journal_name
Stat Methods Med Resjournal_title
Statistical methods in medical researchauthors
Jonsson V,Österlund T,Nerman O,Kristiansson Edoi
10.1177/0962280218811354subject
Has Abstractpub_date
2019-12-01 00:00:00pages
3712-3728issue
12eissn
0962-2802issn
1477-0334journal_volume
28pub_type
杂志文章abstract::Pattern-mixture model (PMM)-based controlled imputations have become a popular tool to assess the sensitivity of primary analysis inference to different post-dropout assumptions or to estimate treatment effectiveness. The methodology is well established for continuous responses but less well established for binary res...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280220941880
更新日期:2020-12-01 00:00:00
abstract::Immunotherapy, gene therapy or adoptive cell therapies, such as the chimeric antigen receptor+ T-cell therapies, have demonstrated promising therapeutic effects in oncology patients. We consider statistical designs for dose-finding adoptive cell therapy trials, in which the monotonic dose-response relationship assumed...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280220977009
更新日期:2020-12-16 00:00:00
abstract::Sensitive questions are often involved in healthcare or medical survey research. Much empirical evidence has shown that the randomized response technique is useful for the collection of truthful responses. However, few studies have discussed methods to estimate the dependence of sensitive responses of multiple types. ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280219847492
更新日期:2020-03-01 00:00:00
abstract::Cluster randomized and multicentre trials evaluate the effect of a treatment on persons nested within clusters, for instance patients within clinics or pupils within schools. Although equal sample sizes per cluster are generally optimal for parameter estimation, they are rarely feasible. This paper addresses the relat...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280206079018
更新日期:2008-08-01 00:00:00
abstract::Many statistical studies report p-values for inferential purposes. In several scenarios, the stochastic aspect of p-values is neglected, which may contribute to drawing wrong conclusions in real data experiments. The stochastic nature of p-values makes their use to examine the performance of given testing procedures o...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217704451
更新日期:2018-12-01 00:00:00
abstract::We propose a flexible and computationally efficient penalized estimation method for a semi-parametric linear transformation model with current status data. To facilitate model fitting, the unknown monotone function is approximated by monotone B-splines, and a computationally efficient hybrid algorithm involving the Fi...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218820406
更新日期:2020-01-01 00:00:00
abstract::Medical research commonly relies on the combination of 2 x 2 tables of counted data for making inferences about treatment effects or about the causes of disease. This article reviews point estimation and interval estimation for a common odds ratio. Traditional methods for providing these estimates face special challen...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/096228029400300204
更新日期:1994-01-01 00:00:00
abstract::Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers of such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine the minimal sample size required and the maximum number of candidate predictor...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218784726
更新日期:2019-08-01 00:00:00
abstract::Statistical models of breast cancer tumour progression have been used to further our knowledge of the natural history of breast cancer, to evaluate mammography screening in terms of mortality, to estimate overdiagnosis, and to estimate the impact of lead-time bias when comparing survival times between screen detected ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217734583
更新日期:2019-03-01 00:00:00
abstract::Mixed models are widely used for the analysis of one repeatedly measured outcome. If more than one outcome is present, a mixed model can be used for each one. These separate models can be tied together into a multivariate mixed model by specifying a joint distribution for their random effects. This strategy has been u...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280206075305
更新日期:2007-10-01 00:00:00
abstract::The accuracy of a diagnostic test, which is often quantified by a pair of measures such as sensitivity and specificity, is critical for medical decision making. Separate studies of an investigational diagnostic test can be combined through meta-analysis; however, such an analysis can be threatened by publication bias....
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218791602
更新日期:2019-10-01 00:00:00
abstract::Cancer mortality risk estimates are essential for planning resource allocation and designing and evaluating cancer prevention and management strategies. However, mortality figures generally become available after a few years, making necessary to develop reliable procedures to provide current and near future mortality ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280212446366
更新日期:2012-10-01 00:00:00
abstract::The analysis of fecundity data is challenging and requires consideration of both highly timed and interrelated biologic processes in the context of essential behaviors such as sexual intercourse during the fertile window. Understanding human fecundity is further complicated by presence of a sterile population, i.e. co...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280212438646
更新日期:2016-02-01 00:00:00
abstract::The analysis of health care costs is complicated by the skewed and heteroscedastic nature of their distribution with possibly additional zero values. Statistical methods that do not adjust for these features can lead to incorrect conclusions. This paper reviews recent developments in statistical methods for the analys...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1191/0962280202sm290ra
更新日期:2002-08-01 00:00:00
abstract::Measurement error is a serious problem in the analysis of epidemiological data. In the past 20 years, a large number of methods for the correction of measurement error have been developed. While at the beginning mostly methods for cohort studies were considered, recently more attention has been paid to case-control st...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/096228020000900504
更新日期:2000-10-01 00:00:00
abstract::The analysis of walking behavior in a physical activity intervention is considered. A Bayesian latent structure modeling approach is proposed whereby the ability and willingness of participants is modeled via latent effects. The dropout process is jointly modeled via a linked survival model. Computational issues are a...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214529932
更新日期:2016-12-01 00:00:00
abstract::In this paper, we describe a Bayesian hierarchical Poisson model for the prospective analysis of data for infectious diseases. The proposed model consists of two components. The first component describes the behavior of disease during nonepidemic periods and the second component represents the increase in disease coun...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214527385
更新日期:2014-12-01 00:00:00
abstract::This paper illustrates the use of multidimensional scaling methods (MDS) to examine space-time patterns in epidemic data. The paper begins by outlining the principles of MDS. The model is then formally specified and illustrated by application to two data sets. The first is partly a tutorial example. It uses monthly re...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/096228029500400202
更新日期:1995-06-01 00:00:00
abstract::The estimation of population parameters using complex survey data requires careful statistical modelling to account for the design features. This is further complicated by unit and item nonresponse for which a number of methods have been developed in order to reduce estimation bias. In this paper, we address some issu...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280213484401
更新日期:2016-08-01 00:00:00
abstract::Statistical methods for carrying out asymptotic inferences (tests or confidence intervals) relative to one or two independent binomial proportions are very frequent. However, inferences about a linear combination of K independent proportions L = Σβ(i)p(i) (in which the first two are special cases) have had very little...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280209347953
更新日期:2011-08-01 00:00:00
abstract::Clinical trials investigating the efficacy of two or more doses of an experimental treatment compared to a single reference arm are not uncommon. In such situations, if each dose is compared to the reference arm using an un-adjusted significance level, consideration of the Type I familywise error is likely to be requi...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280210378943
更新日期:2011-12-01 00:00:00
abstract::Early phase trials of complex interventions currently focus on assessing the feasibility of a large randomised control trial and on conducting pilot work. Assessing the efficacy of the proposed intervention is generally discouraged, due to concerns of underpowered hypothesis testing. In contrast, early assessment of e...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280215589507
更新日期:2016-06-01 00:00:00
abstract::Bayes or empirical Bayes methods to improve inferential accuracy for a population mean has been widely adopted in medical research. As the joint prior distribution of both the mean and variance parameters can be difficult to specify or estimate, most of these methods have relied on certain level of simplifications of ...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280218773537
更新日期:2019-06-01 00:00:00
abstract::In this paper, we develop a simple diagnostic test for the random-effects distribution in mixed models. The test is based on the gradient function, a graphical tool proposed by Verbeke and Molenberghs to check the impact of assumptions about the random-effects distribution in mixed models on inferences. Inference is c...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280214564721
更新日期:2017-04-01 00:00:00
abstract::This paper presents a new model-based generalized functional clustering method for discrete longitudinal data, such as multivariate binomial and Poisson distributed data. For this purpose, we propose a multivariate functional principal component analysis (MFPCA)-based clustering procedure for a latent multivariate Gau...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280220921912
更新日期:2020-11-01 00:00:00
abstract::This work presents a brief overview of Markov models in cancer screening evaluation and focuses on two specific models. A three-state model was first proposed to estimate jointly the sensitivity of the screening procedure and the average duration in the preclinical phase, i.e. the period when the cancer is asymptomati...
journal_title:Statistical methods in medical research
pub_type: 杂志文章,评审
doi:10.1177/0962280209359848
更新日期:2010-10-01 00:00:00
abstract::This paper focuses on inferential tools in the logistic regression model fitted by the Firth penalized likelihood. In this context, the Likelihood Ratio statistic is often reported to be the preferred choice as compared to the 'traditional' Wald statistic. In this work, we consider and discuss a wider range of test st...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280216661213
更新日期:2018-05-01 00:00:00
abstract::Couples with diseases associated with the sexual chromosomes, as well as families in countries where the desire for a male is extreme, are interested in influencing the sex of the baby. We propose an original composite likelihood approach to analyse the relation between sex of the newborn and timing of the intercourse...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217702415
更新日期:2018-11-01 00:00:00
abstract::Classification with a large number of predictors and biomarker discovery become increasingly important in biological and medical research. This paper focuses on performing classification of cardiovascular diseases based on electrocardiogram analysis which deals with many variables and a lot of measurements within vari...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217699996
更新日期:2018-11-01 00:00:00
abstract::The three-class Youden index serves both as a measure of medical test accuracy and a criterion to choose the optimal pair of cutoff values for classifying subjects into three ordinal disease categories (e.g. no disease, mild disease, advanced disease). We present a Bayesian nonparametric approach for estimating the th...
journal_title:Statistical methods in medical research
pub_type: 杂志文章
doi:10.1177/0962280217742538
更新日期:2018-03-01 00:00:00