Abstract:
:High-throughput sequencing technologies have enabled large-scale studies of the role of the human microbiome in health conditions and diseases. Microbial community level association test, as a critical step to establish the connection between overall microbiome composition and an outcome of interest, has now been routinely performed in many studies. However, current microbiome association tests all focus on a single outcome. It has become increasingly common for a microbiome study to collect multiple, possibly related, outcomes to maximize the power of discovery. As these outcomes may share common mechanisms, jointly analyzing these outcomes can amplify the association signal and improve statistical power to detect potential associations. We propose the multivariate microbiome regression-based kernel association test (MMiRKAT) for testing association between multiple continuous outcomes and overall microbiome composition, where the kernel used in MMiRKAT is based on Bray-Curtis or UniFrac distance. MMiRKAT directly regresses all outcomes on the microbiome profiles via a semiparametric kernel machine regression framework, which allows for covariate adjustment and evaluates the association via a variance-component score test. Because most of the current microbiome studies have small sample sizes, a novel small-sample correction procedure is implemented in MMiRKAT to correct for the conservativeness of the association test when the sample size is small or moderate. The proposed method is assessed via simulation studies and an application to a real data set examining the association between host gene expression and mucosal microbiome composition. We demonstrate that MMiRKAT is more powerful than large sample based multivariate kernel association test, while controlling the type I error. A free implementation of MMiRKAT in R language is available at http://research.fhcrc.org/wu/en.html.
journal_name
Genet Epidemioljournal_title
Genetic epidemiologyauthors
Zhan X,Tong X,Zhao N,Maity A,Wu MC,Chen Jdoi
10.1002/gepi.22030subject
Has Abstractpub_date
2017-04-01 00:00:00pages
210-220issue
3eissn
0741-0395issn
1098-2272journal_volume
41pub_type
杂志文章abstract::Unaffected individuals are often disregarded in nonparametric linkage analysis. Because of the presumed high complexity of genetic interactions and the resulting low penetrance of any single genetic effect, the statistical contribution of unaffected sib pairs is thought to be considerably lower than that of the affect...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s522
更新日期:2001-01-01 00:00:00
abstract::We subjected the first replication of the simulated isolated population data set to a novel analysis for association between marker alleles and either disease phenotypes or quantitative variable. The analysis depends on being able to reliably reconstruct all haplotypes in the pedigree. This was achieved using the MCLI...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s571
更新日期:2001-01-01 00:00:00
abstract::Site-specific familial aggregation and evidence supporting Mendelian codominant inheritance have been shown in lung cancer. In characterizing lung cancer families, a number of other cancers have been observed. The current study evaluates whether first-degree relatives of early onset lung cancer cases are at increased ...
journal_title:Genetic epidemiology
pub_type: 临床试验,杂志文章
doi:10.1002/(SICI)1098-2272(199911)17:4<274::AID-GEPI3
更新日期:1999-11-01 00:00:00
abstract::This paper summarizes the analyses by participants in the insulin-dependent diabetes mellitus (IDDM) component of Genetic Analysis Workshop 5 (GAW5). The data were obtained from 94 families with two or more IDDM sibs. Topics treated in the Workshop analysis included the following: methods for detecting associations an...
journal_title:Genetic epidemiology
pub_type: 杂志文章,评审
doi:10.1002/gepi.1370060111
更新日期:1989-01-01 00:00:00
abstract::Linkage analyses and association studies were employed to detect disease susceptibility loci leading to elevated Q1 levels in Problem 2B. Phenotypes were defined to be the dichotomous affection status, the quantitative value for Q1, and Q1 adjusted for covariates. The method of mod-scores (for the dichotomous phenotyp...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1997)14:6<1035::AID-GEPI79
更新日期:1997-01-01 00:00:00
abstract::Noncoding DNA contains gene regulatory elements that alter gene expression, and the function of these elements can be modified by genetic variation. Massively parallel reporter assays (MPRA) enable high-throughput identification and characterization of functional genetic variants, but the statistical methods to identi...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22337
更新日期:2020-10-01 00:00:00
abstract::We have analyzed allele frequency distribution at the hypervariable locus 3' to the apolipoprotein B gene in a healthy population sample (241 women and 246 men) from the Belgrade area. The bimodal distribution of sixteen different hypervariable region (HVR) alleles and the heterozygosity index (average 0.76) in both s...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1998)15:2<113::AID-GEPI1>3
更新日期:1998-01-01 00:00:00
abstract::Genome-wide association studies of discrete traits generally use simple methods of analysis based on chi(2) tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that analyze multiple markers in combin...
journal_title:Genetic epidemiology
pub_type:
doi:10.1002/gepi.20465
更新日期:2009-01-01 00:00:00
abstract::The purpose of this commentary is to provide a framework for using the well-known sib-pair methodology in the context of epidemiologic study designs. Using examples from the Pittsburgh family studies of insulin-dependent diabetes mellitus, we illustrate that the sib-pair method can be used in family-based epidemiologi...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370080408
更新日期:1991-01-01 00:00:00
abstract::The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic ...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22245
更新日期:2019-10-01 00:00:00
abstract::Orientals consisting of Japanese, Chinese, Koreans, and Filipinos are clearly at higher risk for cleft lip with or without cleft palate [CL(P)] than whites, Puerto Ricans, and Hawaiians/part-Hawaiians in Hawaii. Using the model of diallele cross, CL(P) incidences in incrosses and outcrosses involving 564,002 live birt...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370040603
更新日期:1987-01-01 00:00:00
abstract::Smalley et al. [(1992) Genet Epidemiol 9:333-345] found evidence of a mixture of two distributions in memory performance among offspring of patients with dementia of the Alzheimer type (DAT), suggesting that these groups reflect genotypic subgroups of carriers and non-carriers of a putative DAT gene. One prediction of...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370110506
更新日期:1994-01-01 00:00:00
abstract::Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions t...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20659
更新日期:2011-01-01 00:00:00
abstract::Linkage disequilibrium mapping of quantitative traits is a powerful method for dissecting the genetic etiology of complex phenotypes. Quantitative traits, however, often exhibit characteristics that make their use problematic. For example, the distribution of the trait may be censored, highly skewed, or contaminated w...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20141
更新日期:2006-04-01 00:00:00
abstract::Methods to account for population structure (PS) in genome-wide association studies have been well developed in samples of unrelated individuals, but when a sample is composed of families, the task of finding and accounting for PS is not as straight forward. Family-based tests that condition on parental genotypes or t...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20590
更新日期:2011-09-01 00:00:00
abstract::We develop a Bayesian multi-SNP Markov chain Monte Carlo approach that allows published functional significance scores to objectively inform single nucleotide polymorphism (SNP) prior effect sizes in expression quantitative trait locus (eQTL) studies. We developed the Normal Gamma prior to allow the inclusion of funct...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21961
更新日期:2016-05-01 00:00:00
abstract::Human apolipoprotein A-IV (APO A-IV) exhibits a common protein polymorphism detectable by isoelectric focusing (IEF) due to a single base substitution at codon 360 which replaces the frequently occurring glutamine residue (allele 1) with histidine (allele 2). Recently, sequence analysis of the APO A-IV coding region h...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370090503
更新日期:1992-01-01 00:00:00
abstract::Family data are useful for estimating disease risk in carriers of specific genotypes of a given gene (penetrance). Penetrance is frequently estimated assuming that relatives' phenotypes are independent, given their genotypes for the gene of interest. This assumption is unrealistic when multiple shared risk factors con...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20493
更新日期:2010-05-01 00:00:00
abstract::This paper discusses the theory and implementation of a model for mapping X-linked quantitative trait loci (QTL). As a result of X inactivation, a female's body is subdivided into a number of patches. In each patch one of her two X chromosomes is randomly switched off. This smooths the allelic contributions in a heter...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20158
更新日期:2006-07-01 00:00:00
abstract::We examined familial resemblance and performed segregation analysis for the maximal expiratory flow rate at 50% of vital capacity (Vmax50) and the ratio of Vmax50 to forced vital capacity (FVC), based on data from 309 nuclear families with 1,045 individuals in the town of Humboldt, Saskatchewan, in 1993. Vmax50 is con...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1999)16:1<95::AID-GEPI8>3.
更新日期:1999-01-01 00:00:00
abstract::Cancer has predominant environmental and somatic causes but the assessment of hereditary (genetic) causes is difficult, except for highly penetrant single-gene causes. Family studies are only partially informative in this regard because family members share diet and life-styles. Twin studies have been classically used...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/1098-2272(200101)20:1<107::AID-GEPI9>3.0.C
更新日期:2001-01-01 00:00:00
abstract::Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true perti...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20543
更新日期:2010-12-01 00:00:00
abstract::One approach to establish linkage is based on allele-sharing methods for sib pairs. Recently, the use of extreme sib pairs (ESP) has been proposed to increase power for mapping quantitative traits in humans. Several approaches have been discussed. In this study, we calculate sample sizes for the various ESP approaches...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/(SICI)1098-2272(1998)15:6<577::AID-GEPI3>3
更新日期:1998-01-01 00:00:00
abstract::Multipoint linkage analysis using sibpair designs remains a common approach to help investigators to narrow chromosomal regions for traits (either qualitative or quantitative) of interest. Despite its popularity, the success of this approach depends heavily on how issues such as genetic heterogeneity, gene-gene, and g...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.20036
更新日期:2005-01-01 00:00:00
abstract::For most complex diseases, the fraction of heritability that can be explained by the variants discovered from genome-wide association studies is minor. Although the so-called "rare variants" (minor allele frequency [MAF] < 1%) have attracted increasing attention, they are unlikely to account for much of the "missing h...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.21740
更新日期:2013-09-01 00:00:00
abstract::Identifying gene-environment (G-E) interactions can contribute to a better understanding of disease etiology, which may help researchers develop disease prevention strategies and interventions. One big criticism of studying G-E interaction is the lack of power due to sample size. Studies often restrict the interaction...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.22043
更新日期:2017-07-01 00:00:00
abstract::Genome-wide association studies (GWAS) are a powerful tool for understanding the genetic basis of diseases and traits, but most studies have been conducted in isolation, with a focus on either a single or a set of closely related phenotypes. We describe MetABF, a simple Bayesian framework for performing integrative me...
journal_title:Genetic epidemiology
pub_type: 杂志文章,meta分析
doi:10.1002/gepi.22202
更新日期:2019-07-01 00:00:00
abstract::Asthma and atopy are two closely related, common complex traits in which a number of genetic and environmental factors are suspected to play a role. We have performed parametric and nonparametric multi-marker linkage analysis for the Busselton data set, which is part of problem 1 of Genetic Analysis Workshop 12. In pa...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.2001.21.s1.s204
更新日期:2001-01-01 00:00:00
abstract::Bone mass may be so reduced in some individuals as to be characterized as osteoporotic, with resulting fracture, particularly of the proximal femur, vertebrae, or wrist. We identified 34 mother-daughter sets (n = 70) and 29 sibling sets (n = 59) from a community study of bone mass correlates to assess the degree of re...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370030204
更新日期:1986-01-01 00:00:00
abstract::Atopic disease is generally recognized to be familial, although specific genetic components have yet to be identified. High levels of a unique class of immunoglobulins, immunoglobulin E (IgE), have been shown to be associated with allergies. Several investigators have reported evidence indicating a recessive regulator...
journal_title:Genetic epidemiology
pub_type: 杂志文章
doi:10.1002/gepi.1370020402
更新日期:1985-01-01 00:00:00