Mediation effect selection in high-dimensional and compositional microbiome data.

Abstract:

:The microbiome plays an important role in human health by mediating the path from environmental exposures to health outcomes. The relative abundances of the high-dimensional microbiome data have an unit-sum restriction, rendering standard statistical methods in the Euclidean space invalid. To address this problem, we use the isometric log-ratio transformations of the relative abundances as the mediator variables. To select significant mediators, we consider a closed testing-based selection procedure with desirable confidence. Simulations are provided to verify the effectiveness of our method. As an illustrative example, we apply the proposed method to study the mediation effects of murine gut microbiome between subtherapeutic antibiotic treatment and body weight gain, and identify Coprobacillus and Adlercreutzia as two significant mediators.

journal_name

Stat Med

journal_title

Statistics in medicine

authors

Zhang H,Chen J,Feng Y,Wang C,Li H,Liu L

doi

10.1002/sim.8808

subject

Has Abstract

pub_date

2021-02-20 00:00:00

pages

885-896

issue

4

eissn

0277-6715

issn

1097-0258

journal_volume

40

pub_type

杂志文章
  • Emergence of childhood psychiatric disorders: a multivariate probit analysis.

    abstract::We applied a computationally practical form of probit analysis for multiple response variables to data on early childhood development of four psychiatric disorders: disruptive disorders (DD-attention deficit disorders, oppositional defiant disorder, conduct disorder); adjustment disorders (ADJ); emotional disorders (E...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19981115)17:21<2487::aid-s

    authors: Gibbons RD,Lavigne JV

    更新日期:1998-11-15 00:00:00

  • Smooth bootstrap methods for analysis of longitudinal data.

    abstract::In analysis of longitudinal data, the variance matrix of the parameter estimates is usually estimated by the 'sandwich' method, in which the variance for each subject is estimated by its residual products. We propose smooth bootstrap methods by perturbing the estimating functions to obtain 'bootstrapped' realizations ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3027

    authors: Li Y,Wang YG

    更新日期:2008-03-30 00:00:00

  • Hypothesis testing in the polychotomous logistic model with an application to detecting gastrointestinal cancer.

    abstract::We discuss the use of the trichotomous logistic model to discriminate between patients with gastrointestinal (GI) cancer, patients with benign GI disease and 'normal' subjects, using symptoms and the concentrations of some serum proteins that are potentially indicative of malignancy as covariates. A parsimonious model...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780040313

    authors: Marshall RJ,Chisholm EM

    更新日期:1985-07-01 00:00:00

  • Hierarchical multiple informants models: examining food environment contributions to the childhood obesity epidemic.

    abstract::Methods for multiple informants help to estimate the marginal effect of each multiple source predictor and formally compare the strength of their association with an outcome. We extend multiple informant methods to the case of hierarchical data structures to account for within cluster correlation. We apply the propose...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5967

    authors: Baek J,Sánchez BN,Sanchez-Vaznaugh EV

    更新日期:2014-02-20 00:00:00

  • Exact equivalence test for risk ratio and its sample size determination under inverse sampling.

    abstract::When data are dichotomous, this paper notes the utility of inverse sampling in establishing equivalence with respect to the risk ratio. This paper develops an exact equivalence test that accounts for the risk ratio under inverse sampling and further discusses the relationship between the exact equivalence test and the...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19970815)16:15<1777::aid-s

    authors: Lui KJ

    更新日期:1997-08-15 00:00:00

  • Methods for proper handling of overrunning and underrunning in phase II designs for oncology trials.

    abstract::Phase II studies in oncology are frequently conducted as two-stage single-arm trials with a binary endpoint indicating tumor response. As a common feature of these designs, the sample sizes of the two stages and the decision rules for the interim and the final analysis have to be pre-specified and adhered to strictly ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6479

    authors: Englert S,Kieser M

    更新日期:2015-06-15 00:00:00

  • Easy and accurate variance estimation of the nonparametric estimator of the partial area under the ROC curve and its application.

    abstract::The receiver operating characteristic (ROC) curve is a popular technique with applications, for example, investigating an accuracy of a biomarker to delineate between disease and non-disease groups. A common measure of accuracy of a given diagnostic marker is the area under the ROC curve (AUC). In contrast with the AU...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6863

    authors: Yu J,Yang L,Vexler A,Hutson AD

    更新日期:2016-06-15 00:00:00

  • The effects of measurement error in response variables and tests of association of explanatory variables in change models.

    abstract::Biomedical studies often measure variables with error. Examples in the literature include investigation of the association between the change in some outcome variable (blood pressure, cholesterol level etc.) and a set of explanatory variables (age, smoking status etc.). Typically, one fits linear regression models to ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19981130)17:22<2597::aid-s

    authors: Yanez ND 3rd,Kronmal RA,Shemanski LR

    更新日期:1998-11-30 00:00:00

  • Four-fold table cell frequencies imputation in meta analysis.

    abstract::Meta analysis is a collection of quantitative methods devoted to combine summary information from related but independent studies. Because research reports usually present only data reductions and summary statistics rather than detailed data, the reviewer must often resort to rather crude methods for constructing summ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2287

    authors: Di Pietrantonj C

    更新日期:2006-07-15 00:00:00

  • Semiparametric Bayesian variable selection for gene-environment interactions.

    abstract::Many complex diseases are known to be affected by the interactions between genetic variants and environmental exposures beyond the main genetic and environmental effects. Study of gene-environment (G×E) interactions is important for elucidating the disease etiology. Existing Bayesian methods for G×E interaction studie...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8434

    authors: Ren J,Zhou F,Li X,Chen Q,Zhang H,Ma S,Jiang Y,Wu C

    更新日期:2020-02-28 00:00:00

  • Graphical model checking with correlated response data.

    abstract::Correlated response data arise often in biomedical studies. The generalized estimation equation (GEE) approach is widely used in regression analysis for such data. However, there are few methods available to check the adequacy of regression models in GEE. In this paper, a graphical method is proposed based on Cook and...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.889

    authors: Pan W,Connett JE,Porzio GC,Weisberg S

    更新日期:2001-10-15 00:00:00

  • Covariate adjusted mixture models and disease mapping with the program DismapWin.

    abstract::The analysis and recognition of disease clustering in space and its representation on a map is an important problem in epidemiology. An approach using mixture models to identify spatial heterogeneity in disease risk and map construction within an empirical Bayes framework is described. Once heterogeneity is detected, ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19960415)15:7/9<919::aid-s

    authors: Schlattmann P,Dietz E,Böhning D

    更新日期:1996-04-15 00:00:00

  • An examination of the efficiency of some quality assurance methods commonly employed in clinical trials.

    abstract::The cost and efficiency of training clinical centre staff and of duplicate data entry in clinical trials is reviewed. Training is an essential component of quality assurance programmes and it is usually carried out at regular intervals in long-term clinical trials. Initial training of staff and regular retraining is i...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780090118

    authors: Neaton JD,Duchene AG,Svendsen KH,Wentworth D

    更新日期:1990-01-01 00:00:00

  • Outcome-adaptive randomization for a delayed outcome with a short-term predictor: imputation-based designs.

    abstract::Delay in the outcome variable is challenging for outcome-adaptive randomization, as it creates a lag between the number of subjects accrued and the information known at the time of the analysis. Motivated by a real-life pediatric ulcerative colitis trial, we consider a case where a short-term predictor is available fo...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6222

    authors: Kim MO,Liu C,Hu F,Lee JJ

    更新日期:2014-10-15 00:00:00

  • A Bayesian semiparametric Markov regression model for juvenile dermatomyositis.

    abstract::Juvenile dermatomyositis (JDM) is a rare autoimmune disease that may lead to serious complications, even to death. We develop a 2-state Markov regression model in a Bayesian framework to characterise disease progression in JDM over time and gain a better understanding of the factors influencing disease risk. The trans...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7613

    authors: De Iorio M,Gallot N,Valcarcel B,Wedderburn L

    更新日期:2018-05-10 00:00:00

  • Nonparametric comparison of two survival functions with dependent censoring via nonparametric multiple imputation.

    abstract::When the event time of interest depends on the censoring time, conventional two-sample test methods, such as the log-rank and Wilcoxon tests, can produce an invalid test result. We extend our previous work on estimation using auxiliary variables to adjust for dependent censoring via multiple imputation, to the compari...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3480

    authors: Hsu CH,Taylor JM

    更新日期:2009-02-01 00:00:00

  • Modelling the geographical distribution of co-infection risk from single-disease surveys.

    abstract:BACKGROUND:The need to deliver interventions targeting multiple diseases in a cost-effective manner calls for integrated disease control efforts. Consequently, maps are required that show where the risk of co-infection is particularly high. Co-infection risk is preferably estimated via Bayesian geostatistical multinomi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4243

    authors: Schur N,Gosoniu L,Raso G,Utzinger J,Vounatsou P

    更新日期:2011-06-30 00:00:00

  • Application of the parallel line assay to assessment of biosimilar products based on binary endpoints.

    abstract::Biological drug products are therapeutic moieties manufactured by a living system or organisms. These are important life-saving drug products for patients with unmet medical needs. Because of expensive cost, only a few patients have access to life-saving biological products. Most of the early biological products will ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5565

    authors: Lin JR,Chow SC,Chang CH,Lin YC,Liu JP

    更新日期:2013-02-10 00:00:00

  • Accounting for informatively missing data in logistic regression by means of reassessment sampling.

    abstract::We explore the 'reassessment' design in a logistic regression setting, where a second wave of sampling is applied to recover a portion of the missing data on a binary exposure and/or outcome variable. We construct a joint likelihood function based on the original model of interest and a model for the missing data mech...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.6456

    authors: Lin J,Lyles RH

    更新日期:2015-05-20 00:00:00

  • Spatially regularized estimation for the analysis of dynamic contrast-enhanced magnetic resonance imaging data.

    abstract::Competing compartment models of different complexities have been used for the quantitative analysis of dynamic contrast-enhanced magnetic resonance imaging data. We present a spatial elastic net approach that allows to estimate the number of compartments for each voxel such that the model complexity is not fixed a pri...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.5997

    authors: Sommer JC,Gertheiss J,Schmid VJ

    更新日期:2014-03-15 00:00:00

  • Methods for assessing reliability and validity for a measurement tool: a case study and critique using the WHO haemoglobin colour scale.

    abstract::Before introducing a new measurement tool it is necessary to evaluate its performance. Several statistical methods have been developed, or used, to evaluate the reliability and validity of a new assessment method in such circumstances. In this paper we review some commonly used methods. Data from a study that was cond...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1804

    authors: White SA,van den Broek NR

    更新日期:2004-05-30 00:00:00

  • Human disease cost network analysis.

    abstract::Diseases can be interconnected. In the recent years, there has been a surge of multidisease studies. Among them, HDN (human disease network) analysis takes a system perspective, examines the interconnections among diseases along with their individual properties, and has demonstrated great potential. Most of the existi...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.8472

    authors: Ma C,Li Y,Shia B,Ma S

    更新日期:2020-04-30 00:00:00

  • Describing time and age variations in the risk of radiation-induced solid tumour incidence in the Japanese atomic bomb survivors using generalized relative and absolute risk models.

    abstract::Generalized relative and absolute risk models, in which various functions of time and age modify the excess relative or absolute risk of radiation-induced cancer, are fitted to the Japanese atomic bomb survivor cancer incidence data set. Among generalized relative risk models, those in which a product of powers of tim...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/(sici)1097-0258(19990115)18:1<17::aid-sim9

    authors: Little MP,Muirhead CR,Charles MW

    更新日期:1999-01-15 00:00:00

  • An analysis of eight 95 per cent confidence intervals for a ratio of Poisson parameters when events are rare.

    abstract::We compared eight nominal 95 per cent confidence intervals for the ratio of two Poisson parameters, both assumed small, on their true coverage (the probability that the interval includes the ratio of Poisson parameters) and median width. The commonly used log-linear interval, justified by asymptotic considerations, pr...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3234

    authors: Barker L,Cadwell BL

    更新日期:2008-09-10 00:00:00

  • Statistical methods for active extension trials.

    abstract::This paper develops methods of analysis for active extension clinical trials. Under this design, patients are randomized to treatment or placebo for a period of time (period 1), and then all patients receive treatment for an additional period of time (period 2). We assume a continuous outcome is measured at baseline a...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2720

    authors: Hu Z,Follmann D

    更新日期:2007-05-30 00:00:00

  • Testing goodness-of-fit of the logistic regression model in case-control studies using sample reweighting.

    abstract::A new goodness-of-fit test for the logistic regression model is proposed. It exploits the property of this model that when it is correct, i.e. not misspecified, the parameter estimates are (asymptotically) invariant under reweighting the observations by weights wi that are a function of the binary (0/1) outcomes yi. M...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.1997

    authors: Nagelkerke N,Smits J,le Cessie S,van Houwelingen H

    更新日期:2005-01-15 00:00:00

  • A semi-parametric Bayesian approach to average bioequivalence.

    abstract::Bioequivalence assessment is an issue of great interest. Development of statistical methods for assessing bioequivalence is an important area of research for statisticians. Bioequivalence is usually determined based on the normal distribution. We relax this assumption and develop a semi-parametric mixed model for bioe...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.2620

    authors: Ghosh P,Rosner GL

    更新日期:2007-03-15 00:00:00

  • Confidence intervals for a ratio of two independent binomial proportions.

    abstract::Several large-sample confidence intervals for the ratio of independent binomial proportions are compared in terms of exact coverage probability and width. A non-iterative approximate Bayesian interval is derived and its frequency properties are superior to all of the non-iterative confidence intervals considered. The ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.3376

    authors: Price RM,Bonett DG

    更新日期:2008-11-20 00:00:00

  • Conditional power and predictive power based on right censored data with supplementary auxiliary information.

    abstract::Conditional power and predictive power provide estimates of the probability of success at the end of the trial based on the information from the interim analysis. The observed value of the time to event endpoint at the interim analysis could be biased for the true treatment effect due to early censoring, leading to a ...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.7673

    authors: Sun L,Wan Y

    更新日期:2018-08-15 00:00:00

  • Estimating prediction equations in repeated measures designs.

    abstract::Experimental designs with repeated measures allow response patterns over time (or dose) to be modelled and compared between different homogeneous groups. Issues in data analysis often focus on the pattern of variation of the repeated measures, the appropriateness of a univariate or multivariate analysis, and the shape...

    journal_title:Statistics in medicine

    pub_type: 杂志文章

    doi:10.1002/sim.4780100116

    authors: Stanek EJ 3rd,Kline G

    更新日期:1991-01-01 00:00:00