A small-sample multivariate kernel machine test for microbiome association studies.

Abstract:

:High-throughput sequencing technologies have enabled large-scale studies of the role of the human microbiome in health conditions and diseases. Microbial community level association test, as a critical step to establish the connection between overall microbiome composition and an outcome of interest, has now been routinely performed in many studies. However, current microbiome association tests all focus on a single outcome. It has become increasingly common for a microbiome study to collect multiple, possibly related, outcomes to maximize the power of discovery. As these outcomes may share common mechanisms, jointly analyzing these outcomes can amplify the association signal and improve statistical power to detect potential associations. We propose the multivariate microbiome regression-based kernel association test (MMiRKAT) for testing association between multiple continuous outcomes and overall microbiome composition, where the kernel used in MMiRKAT is based on Bray-Curtis or UniFrac distance. MMiRKAT directly regresses all outcomes on the microbiome profiles via a semiparametric kernel machine regression framework, which allows for covariate adjustment and evaluates the association via a variance-component score test. Because most of the current microbiome studies have small sample sizes, a novel small-sample correction procedure is implemented in MMiRKAT to correct for the conservativeness of the association test when the sample size is small or moderate. The proposed method is assessed via simulation studies and an application to a real data set examining the association between host gene expression and mucosal microbiome composition. We demonstrate that MMiRKAT is more powerful than large sample based multivariate kernel association test, while controlling the type I error. A free implementation of MMiRKAT in R language is available at http://research.fhcrc.org/wu/en.html.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Zhan X,Tong X,Zhao N,Maity A,Wu MC,Chen J

doi

10.1002/gepi.22030

subject

Has Abstract

pub_date

2017-04-01 00:00:00

pages

210-220

issue

3

eissn

0741-0395

issn

1098-2272

journal_volume

41

pub_type

杂志文章
  • To type or not to type: the use of unaffected siblings in nonparametric linkage analysis.

    abstract::Unaffected individuals are often disregarded in nonparametric linkage analysis. Because of the presumed high complexity of genetic interactions and the resulting low penetrance of any single genetic effect, the statistical contribution of unaffected sib pairs is thought to be considerably lower than that of the affect...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s522

    authors: Majewski J

    更新日期:2001-01-01 00:00:00

  • Allelic association in large pedigrees.

    abstract::We subjected the first replication of the simulated isolated population data set to a novel analysis for association between marker alleles and either disease phenotypes or quantitative variable. The analysis depends on being able to reliably reconstruct all haplotypes in the pedigree. This was achieved using the MCLI...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s571

    authors: Gutin A,Abkevich V,Camp NJ,Farnham JM,Cannon-Albright L,Thomas A

    更新日期:2001-01-01 00:00:00

  • Familial aggregation of breast cancer with early onset lung cancer.

    abstract::Site-specific familial aggregation and evidence supporting Mendelian codominant inheritance have been shown in lung cancer. In characterizing lung cancer families, a number of other cancers have been observed. The current study evaluates whether first-degree relatives of early onset lung cancer cases are at increased ...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(199911)17:4<274::AID-GEPI3

    authors: Schwartz AG,Siegfried JM,Weiss L

    更新日期:1999-11-01 00:00:00

  • Genetic analysis of IDDM: summary of GAW5 IDDM results.

    abstract::This paper summarizes the analyses by participants in the insulin-dependent diabetes mellitus (IDDM) component of Genetic Analysis Workshop 5 (GAW5). The data were obtained from 94 families with two or more IDDM sibs. Topics treated in the Workshop analysis included the following: methods for detecting associations an...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,评审

    doi:10.1002/gepi.1370060111

    authors: Spielman RS,Baur MP,Clerget-Darpoux F

    更新日期:1989-01-01 00:00:00

  • Testing the utility of mod scores and sib-pair analysis to detect presence of disease susceptibility loci.

    abstract::Linkage analyses and association studies were employed to detect disease susceptibility loci leading to elevated Q1 levels in Problem 2B. Phenotypes were defined to be the dichotomous affection status, the quantitative value for Q1, and Q1 adjusted for covariates. The method of mod-scores (for the dichotomous phenotyp...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<1035::AID-GEPI79

    authors: Neuman RJ,Xian H

    更新日期:1997-01-01 00:00:00

  • Statistical considerations for the analysis of massively parallel reporter assays data.

    abstract::Noncoding DNA contains gene regulatory elements that alter gene expression, and the function of these elements can be modified by genetic variation. Massively parallel reporter assays (MPRA) enable high-throughput identification and characterization of functional genetic variants, but the statistical methods to identi...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22337

    authors: Qiao D,Zigler CM,Cho MH,Silverman EK,Zhou X,Castaldi PJ,Laird NH

    更新日期:2020-10-01 00:00:00

  • APO B 3' HVR polymorphism in healthy population: relationships to serum lipid levels.

    abstract::We have analyzed allele frequency distribution at the hypervariable locus 3' to the apolipoprotein B gene in a healthy population sample (241 women and 246 men) from the Belgrade area. The bimodal distribution of sixteen different hypervariable region (HVR) alleles and the heterozygosity index (average 0.76) in both s...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:2<113::AID-GEPI1>3

    authors: Alavantić D,Glisić S,Kandić I

    更新日期:1998-01-01 00:00:00

  • Genome-wide association studies for discrete traits.

    abstract::Genome-wide association studies of discrete traits generally use simple methods of analysis based on chi(2) tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that analyze multiple markers in combin...

    journal_title:Genetic epidemiology

    pub_type:

    doi:10.1002/gepi.20465

    authors: Thomas DC

    更新日期:2009-01-01 00:00:00

  • Commentary: the affected sib-pair method in the context of an epidemiologic study design.

    abstract::The purpose of this commentary is to provide a framework for using the well-known sib-pair methodology in the context of epidemiologic study designs. Using examples from the Pittsburgh family studies of insulin-dependent diabetes mellitus, we illustrate that the sib-pair method can be used in family-based epidemiologi...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370080408

    authors: Khoury MJ,Flanders WD,Lipton RB,Dorman JS

    更新日期:1991-01-01 00:00:00

  • A flexible and parallelizable approach to genome-wide polygenic risk scores.

    abstract::The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22245

    authors: Newcombe PJ,Nelson CP,Samani NJ,Dudbridge F

    更新日期:2019-10-01 00:00:00

  • Genetic epidemiology of cleft lip with or without cleft palate in the population of Hawaii.

    abstract::Orientals consisting of Japanese, Chinese, Koreans, and Filipinos are clearly at higher risk for cleft lip with or without cleft palate [CL(P)] than whites, Puerto Ricans, and Hawaiians/part-Hawaiians in Hawaii. Using the model of diallele cross, CL(P) incidences in incrosses and outcrosses involving 564,002 live birt...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040603

    authors: Chung CS,Mi MP,Beechert AM

    更新日期:1987-01-01 00:00:00

  • Commingling analysis of memory performance in elderly men.

    abstract::Smalley et al. [(1992) Genet Epidemiol 9:333-345] found evidence of a mixture of two distributions in memory performance among offspring of patients with dementia of the Alzheimer type (DAT), suggesting that these groups reflect genotypic subgroups of carriers and non-carriers of a putative DAT gene. One prediction of...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370110506

    authors: Palmer CG,Wolkenstein BH,La Rue A,Swan GE,Smalley SL

    更新日期:1994-01-01 00:00:00

  • Lessons learned from Genetic Analysis Workshop 17: transitioning from genome-wide association studies to whole-genome statistical genetic analysis.

    abstract::Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20659

    authors: Wilson AF,Ziegler A

    更新日期:2011-01-01 00:00:00

  • Genetic association tests based on ranks (GATOR) for quantitative traits with and without censoring.

    abstract::Linkage disequilibrium mapping of quantitative traits is a powerful method for dissecting the genetic etiology of complex phenotypes. Quantitative traits, however, often exhibit characteristics that make their use problematic. For example, the distribution of the trait may be censored, highly skewed, or contaminated w...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20141

    authors: Allen AS,Martin ER,Qin X,Li YJ

    更新日期:2006-04-01 00:00:00

  • Evaluation of methods accounting for population structure with pedigree data and continuous outcomes.

    abstract::Methods to account for population structure (PS) in genome-wide association studies have been well developed in samples of unrelated individuals, but when a sample is composed of families, the task of finding and accounting for PS is not as straight forward. Family-based tests that condition on parental genotypes or t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20590

    authors: Peloso GM,Dupuis J,Lunetta KL

    更新日期:2011-09-01 00:00:00

  • eQuIPS: eQTL Analysis Using Informed Partitioning of SNPs - A Fully Bayesian Approach.

    abstract::We develop a Bayesian multi-SNP Markov chain Monte Carlo approach that allows published functional significance scores to objectively inform single nucleotide polymorphism (SNP) prior effect sizes in expression quantitative trait locus (eQTL) studies. We developed the Normal Gamma prior to allow the inclusion of funct...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21961

    authors: Boggis EM,Milo M,Walters K

    更新日期:2016-05-01 00:00:00

  • Two common polymorphisms in the APO A-IV coding gene: their evolution and linkage disequilibrium.

    abstract::Human apolipoprotein A-IV (APO A-IV) exhibits a common protein polymorphism detectable by isoelectric focusing (IEF) due to a single base substitution at codon 360 which replaces the frequently occurring glutamine residue (allele 1) with histidine (allele 2). Recently, sequence analysis of the APO A-IV coding region h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370090503

    authors: Kamboh MI,Hamman RF,Ferrell RE

    更新日期:1992-01-01 00:00:00

  • Estimating gene penetrance from family data.

    abstract::Family data are useful for estimating disease risk in carriers of specific genotypes of a given gene (penetrance). Penetrance is frequently estimated assuming that relatives' phenotypes are independent, given their genotypes for the gene of interest. This assumption is unrealistic when multiple shared risk factors con...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20493

    authors: Gong G,Hannon N,Whittemore AS

    更新日期:2010-05-01 00:00:00

  • Variance component models for X-linked QTLs.

    abstract::This paper discusses the theory and implementation of a model for mapping X-linked quantitative trait loci (QTL). As a result of X inactivation, a female's body is subdivided into a number of patches. In each patch one of her two X chromosomes is randomly switched off. This smooths the allelic contributions in a heter...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20158

    authors: Lange K,Sobel E

    更新日期:2006-07-01 00:00:00

  • Major genetic effects on airway-parenchymal dysanapsis of the lung: the Humboldt family study.

    abstract::We examined familial resemblance and performed segregation analysis for the maximal expiratory flow rate at 50% of vital capacity (Vmax50) and the ratio of Vmax50 to forced vital capacity (FVC), based on data from 309 nuclear families with 1,045 individuals in the town of Humboldt, Saskatchewan, in 1993. Vmax50 is con...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:1<95::AID-GEPI8>3.

    authors: Chen Y,Dosman JA,Rennie DC,Lockinger LA

    更新日期:1999-01-01 00:00:00

  • Estimation of genetic and environmental components in colorectal and lung cancer and melanoma.

    abstract::Cancer has predominant environmental and somatic causes but the assessment of hereditary (genetic) causes is difficult, except for highly penetrant single-gene causes. Family studies are only partially informative in this regard because family members share diet and life-styles. Twin studies have been classically used...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200101)20:1<107::AID-GEPI9>3.0.C

    authors: Hemminki K,Lönnstedt I,Vaittinen P,Lichtenstein P

    更新日期:2001-01-01 00:00:00

  • SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

    abstract::Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true perti...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20543

    authors: Ayers KL,Cordell HJ

    更新日期:2010-12-01 00:00:00

  • Sample size calculations for linkage analysis using extreme sib pairs based on segregation analysis with the quantitative phenotype body weight as an example.

    abstract::One approach to establish linkage is based on allele-sharing methods for sib pairs. Recently, the use of extreme sib pairs (ESP) has been proposed to increase power for mapping quantitative traits in humans. Several approaches have been discussed. In this study, we calculate sample sizes for the various ESP approaches...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:6<577::AID-GEPI3>3

    authors: Ziegler A,Hebebrand J

    更新日期:1998-01-01 00:00:00

  • Multipoint linkage mapping using sibpairs: non-parametric estimation of trait effects with quantitative covariates.

    abstract::Multipoint linkage analysis using sibpair designs remains a common approach to help investigators to narrow chromosomal regions for traits (either qualitative or quantitative) of interest. Despite its popularity, the success of this approach depends heavily on how issues such as genetic heterogeneity, gene-gene, and g...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20036

    authors: Chiou JM,Liang KY,Chiu YF

    更新日期:2005-01-01 00:00:00

  • Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants.

    abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21740

    authors: Lin WY,Yi N,Lou XY,Zhi D,Zhang K,Gao G,Tiwari HK,Liu N

    更新日期:2013-09-01 00:00:00

  • A combination test for detection of gene-environment interaction in cohort studies.

    abstract::Identifying gene-environment (G-E) interactions can contribute to a better understanding of disease etiology, which may help researchers develop disease prevention strategies and interventions. One big criticism of studying G-E interaction is the lack of power due to sample size. Studies often restrict the interaction...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22043

    authors: Coombes B,Basu S,McGue M

    更新日期:2017-07-01 00:00:00

  • Bayesian meta-analysis across genome-wide association studies of diverse phenotypes.

    abstract::Genome-wide association studies (GWAS) are a powerful tool for understanding the genetic basis of diseases and traits, but most studies have been conducted in isolation, with a focus on either a single or a set of closely related phenotypes. We describe MetABF, a simple Bayesian framework for performing integrative me...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章,meta分析

    doi:10.1002/gepi.22202

    authors: Trochet H,Pirinen M,Band G,Jostins L,McVean G,Spencer CCA

    更新日期:2019-07-01 00:00:00

  • Linkage analysis of asthma and atopy including models with genomic imprinting.

    abstract::Asthma and atopy are two closely related, common complex traits in which a number of genetic and environmental factors are suspected to play a role. We have performed parametric and nonparametric multi-marker linkage analysis for the Busselton data set, which is part of problem 1 of Genetic Analysis Workshop 12. In pa...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s204

    authors: Strauch K,Bogdanow M,Fimmers R,Baur MP,Wienker TF

    更新日期:2001-01-01 00:00:00

  • Familial resemblance of bone mass in adult women.

    abstract::Bone mass may be so reduced in some individuals as to be characterized as osteoporotic, with resulting fracture, particularly of the proximal femur, vertebrae, or wrist. We identified 34 mother-daughter sets (n = 70) and 29 sibling sets (n = 59) from a community study of bone mass correlates to assess the degree of re...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370030204

    authors: Sowers MR,Burns TL,Wallace RB

    更新日期:1986-01-01 00:00:00

  • Demonstration of a common major gene with pleiotropic effects on immunoglobulin E levels and allergy.

    abstract::Atopic disease is generally recognized to be familial, although specific genetic components have yet to be identified. High levels of a unique class of immunoglobulins, immunoglobulin E (IgE), have been shown to be associated with allergies. Several investigators have reported evidence indicating a recessive regulator...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370020402

    authors: Borecki IB,Rao DC,Lalouel JM,McGue M,Gerrard JW

    更新日期:1985-01-01 00:00:00