Estimation of a significance threshold for epigenome-wide association studies.

Abstract:

:Epigenome-wide association studies (EWAS) are designed to characterise population-level epigenetic differences across the genome and link them to disease. Most commonly, they assess DNA-methylation status at cytosine-guanine dinucleotide (CpG) sites, using platforms such as the Illumina 450k array that profile a subset of CpGs genome wide. An important challenge in the context of EWAS is determining a significance threshold for declaring a CpG site as differentially methylated, taking multiple testing into account. We used a permutation method to estimate a significance threshold specifically for the 450k array and a simulation extrapolation approach to estimate a genome-wide threshold. These methods were applied to five different EWAS datasets derived from a variety of populations and tissue types. We obtained an estimate of α=2.4×10-7 for the 450k array, and a genome-wide estimate of α=3.6×10-8. We further demonstrate the importance of these results by showing that previously recommended sample sizes for EWAS should be adjusted upwards, requiring samples between ∼10% and ∼20% larger in order to maintain type-1 errors at the desired level.

journal_name

Genet Epidemiol

journal_title

Genetic epidemiology

authors

Saffari A,Silver MJ,Zavattari P,Moi L,Columbano A,Meaburn EL,Dudbridge F

doi

10.1002/gepi.22086

subject

Has Abstract

pub_date

2018-02-01 00:00:00

pages

20-33

issue

1

eissn

0741-0395

issn

1098-2272

journal_volume

42

pub_type

杂志文章
  • Genome-wide association studies for discrete traits.

    abstract::Genome-wide association studies of discrete traits generally use simple methods of analysis based on chi(2) tests for contingency tables or logistic regression, at least for an initial scan of the entire genome. Nevertheless, more power might be obtained by using various methods that analyze multiple markers in combin...

    journal_title:Genetic epidemiology

    pub_type:

    doi:10.1002/gepi.20465

    authors: Thomas DC

    更新日期:2009-01-01 00:00:00

  • Segregation analysis of cardiovascular reactivity to laboratory stressors.

    abstract::To better understand the contribution of major gene influences to individual differences in cardiovascular reactivity, we performed a segregation analysis on blood pressure responses to two laboratory tasks, mental arithmetic and bicycle exercise. The study population consisted of 1,451 adults (age > or = 18 years) wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:1<35::AID-GEPI3>3.

    authors: Cheng LS,Carmelli D,Hunt SC,Williams RR

    更新日期:1997-01-01 00:00:00

  • Linkage analysis of candidate obesity genes among the Mexican-American population of Starr County, Texas.

    abstract::Recent advances in the molecular basis of body fat regulation have identified several genes in which genetic variation may influence obesity and related measures in human populations. Genes that have been shown to have a regulatory function in the control of body fat utilization, eating behavior, and/or metabolic rate...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1999)16:4<397::AID-GEPI6>3

    authors: Bray MS,Boerwinkle E,Hanis CL

    更新日期:1999-01-01 00:00:00

  • Comparison of the QTDT analysis for IgE in the CSGA data set.

    abstract::Over the past few years at least 13 transmission/disequilibrium test (TDT)-based tests have been developed for quantitative (Q) traits for the assessment of association or linkage in the presence of the other. A total of six of these QTDT methods were used to analyze log10IgE in the Collaborative Study on the Genetics...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.2001.21.s1.s312

    authors: Page GP,Wilcox MA,Occhiuto J,Adak S,Neuberg D,Bajorunaite R,George V

    更新日期:2001-01-01 00:00:00

  • Regressive logistic modeling of familial aggregation for asthma in 7,394 population-based nuclear families.

    abstract::The aim of this population-based study was to determine whether asthma aggregates in families, and if so, whether aggregation was consistent with environmental and/or genetic etiologies. Data were from 7,394 nuclear families (41,506 individuals) from the 1968 Tasmanian Asthma Survey, in which all Tasmanian schoolchild...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:3<317::AID-GEPI9>3

    authors: Jenkins MA,Hopper JL,Giles GG

    更新日期:1997-01-01 00:00:00

  • A Bayesian integrative genomic model for pathway analysis of complex traits.

    abstract::With new technologies, multiple types of genomic data are commonly collected on a single set of samples. However, standard analysis methods concentrate on a single data type at a time and ignore the relationships between genes, proteins, and biochemical reactions that give rise to complex phenotypes. In this paper, we...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21628

    authors: Fridley BL,Lund S,Jenkins GD,Wang L

    更新日期:2012-05-01 00:00:00

  • Efficient computation of patterned covariance matrix mixed models in quantitative segregation analysis.

    abstract::The use of patterned covariance matrices in forming pedigree-based mixed models for quantitative traits is discussed. It is suggested that patterned covariance matrix models provide intuitive, theoretically appealing, and flexible genetic modeling devices for pedigree data. It is suggested further that the very great ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370080104

    authors: Schork N

    更新日期:1991-01-01 00:00:00

  • Quantitative allelic test--a fast test for very large association studies.

    abstract::Advances in high throughput technology have enabled the generation of unprecedented amounts of genomic data (e.g., next-generation sequence data, transcriptomics, metabolomics, and proteomics), which promises to unravel the genetic architecture of complex traits. These discoveries may lead to novel therapeutic targets...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.21768

    authors: Lee SM,Karrison TG,Cox NJ,Im HK

    更新日期:2013-12-01 00:00:00

  • Mortality differences by APOE genotype estimated from demographic synthesis.

    abstract::The 4 allele of apolipoprotein E (APOE) is associated with increased risk of two major causes of death in low-mortality populations: ischemic heart disease and Alzheimer's disease. It is less common among centenarians than at younger ages. Therefore, it is likely that it is associated with excess risk of death. This a...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.0164

    authors: Ewbank DC

    更新日期:2002-02-01 00:00:00

  • Bayesian linkage and segregation analysis: factoring the problem.

    abstract::Complex segregation analysis and linkage methods are mathematical techniques for the genetic dissection of complex diseases. They are used to delineate complex modes of familial transmission and to localize putative disease susceptibility loci to specific chromosomal locations. The computational problem of Bayesian li...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(2000)19:1+<::AID-GEPI8>3.0.CO;2-

    authors: Matthysse S

    更新日期:2000-01-01 00:00:00

  • Major locus inheritance of apolipoprotein B in Utah pedigrees.

    abstract::A major locus that determines levels of apolipoprotein B (apoB) was revealed by likelihood analysis on 331 members of 36 pedigrees. The major locus explained 43.2% of the observed variance, with the remainder attributed to random environmental factors. Estimated mean apoB levels (mg/dl) were 110.5 +/- 2.5, 141.9 +/- 4...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370040202

    authors: Hasstedt SJ,Wu L,Williams RR

    更新日期:1987-01-01 00:00:00

  • Genome-wide detection and characterization of mating asymmetry in human populations.

    abstract::The study of the genetic component of early-onset diseases requires investigation into parental genetic effects, particularly those mediated by the mother who can influence the offspring's risk of disease through the effects of her genes acting directly on the intrauterine milieu or indirectly through maternal-gene ch...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20602

    authors: Bourgey M,Healy J,Saint-Onge P,Massé H,Sinnett D,Roy-Gagnon MH

    更新日期:2011-09-01 00:00:00

  • Variance component models for X-linked QTLs.

    abstract::This paper discusses the theory and implementation of a model for mapping X-linked quantitative trait loci (QTL). As a result of X inactivation, a female's body is subdivided into a number of patches. In each patch one of her two X chromosomes is randomly switched off. This smooths the allelic contributions in a heter...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20158

    authors: Lange K,Sobel E

    更新日期:2006-07-01 00:00:00

  • GAW10: simulated family data for a common oligogenic disease with quantitative risk factors.

    abstract::GAW10 Problem 2 involves a simulated common disease defined by imposing a threshold, T, on a quantitative trait, Q1. Every individual with a value of Q1 > or = T (where T = 40) is defined as affected. Also thought to be associated with the disease as intervening variables are four other quantitative traits (Q2, Q3, Q4...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<737::AID-GEPI29>

    authors: MacCluer JW,Blangero J,Dyer TD,Speer MC

    更新日期:1997-01-01 00:00:00

  • Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.

    abstract::The multiplicity problem has become increasingly important in genetic studies as the capacity for high-throughput genotyping has increased. The control of False Discovery Rate (FDR) (Benjamini and Hochberg. [1995] J. R. Stat. Soc. Ser. B 57:289-300) has been adopted to address the problems of false positive control an...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20164

    authors: Sun L,Craiu RV,Paterson AD,Bull SB

    更新日期:2006-09-01 00:00:00

  • Biochemical intermediates in alpha 1-antitrypsin deficiency: residual family resemblance for total alpha 1-antitrypsin, oxidized alpha 1-antitrypsin, and immunoglobulin E after adjustment for the effect of the Pi locus.

    abstract::alpha 1-antitrypsin (alpha 1 AT) deficiency is variably associated with the development of pulmonary emphysema. To gain insight into the process which begins the Z point mutation at the Protease Inhibitor (Pi) locus and results in the variable development of emphysema, three quantitative phenotypes, including total al...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370070204

    authors: Silverman EK,Province MA,Campbell EJ,Pierce JA,Rao DC

    更新日期:1990-01-01 00:00:00

  • Modelling the major histocompatibility complex susceptibility to RA using the MASC method.

    abstract::To explain the association between HLA-DRB1 gene and rheumatoid arthritis (RA), two main hypotheses have been proposed. The first, the shared epitope hypothesis, assumes a direct role of DRB1 in RA susceptibility. The second hypothesis assumes a recessive disease susceptibility gene in linkage disequilibrium with DRB1...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:4<419::AID-GEPI7>3

    authors: Génin E,Babron MC,McDermott MF,Mulcahy B,Waldron-Lynch F,Adams C,Clegg DO,Ward RH,Shanahan F,Molloy MG,O'Gara F,Clerget-Darpoux F

    更新日期:1998-01-01 00:00:00

  • How can maximum likelihood methods reveal candidate gene effects on a quantitative trait?

    abstract::Different maximum likelihood approaches were used to explore the role of candidate genes in the variability of quantitative trait Q1 while accounting for the effects of age, Q2, and Q3. Segregation analysis, under the class D regressive model, provides evidence for a Mendelian gene effect on the adjusted trait Q1. Res...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1370120643

    authors: Martinez M,Abel L,Demenais F

    更新日期:1995-01-01 00:00:00

  • A flexible and parallelizable approach to genome-wide polygenic risk scores.

    abstract::The heritability of most complex traits is driven by variants throughout the genome. Consequently, polygenic risk scores, which combine information on multiple variants genome-wide, have demonstrated improved accuracy in genetic risk prediction. We present a new two-step approach to constructing genome-wide polygenic ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.22245

    authors: Newcombe PJ,Nelson CP,Samani NJ,Dudbridge F

    更新日期:2019-10-01 00:00:00

  • Effect of physical activity on lipid levels in a population-based sample of men with and without the Arg192 variant of the human paraoxonase gene.

    abstract::The prevalence of cardiovascular risk factors in Gerona, Spain, is high for the low myocardial infarction incidence and mortality rates in the province. Physical activity is a protective factor against coronary heart disease. We investigated whether the genetic variants Q and R of the paraoxonase Gln-Arg 192 polymorph...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(200003)18:3<276::AID-GEPI6

    authors: Sentí M,Aubó C,Elosua R,Sala J,Tomás M,Marrugat J

    更新日期:2000-03-01 00:00:00

  • Parental transmission and D18S37 allele sharing in bipolar affective disorder.

    abstract::We combined the five chromosome 18 bipolar affective disorder data sets provided by GAW10, totaling 185 families with 3,394 individuals, and performed analysis of differential parental transmission and chromosome 18 marker allele sharing in families with transmission through fathers vs those through mothers. Results i...

    journal_title:Genetic epidemiology

    pub_type: 临床试验,杂志文章

    doi:10.1002/(SICI)1098-2272(1997)14:6<665::AID-GEPI19>

    authors: Lin JP,Bale SJ

    更新日期:1997-01-01 00:00:00

  • Modeling the HLA component in rheumatoid arthritis: sensitivity to DRB1 allele frequencies.

    abstract::Rheumatoid arthritis is an inflammatory disease for which positive associations have been described with some HLA-DRB1 alleles. The associated alleles share a similar amino acid sequence in the third hypervariable region, the shared epitope, but differ at position 71 and 86. It has been suggested that HLA susceptibili...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/1098-2272(200012)19:4<422::AID-GEPI12>3.0.

    authors: Tézenas du Montcel S,Reviron D,Genin E,Roudier J,Mercier P,Clerget-Darpoux F

    更新日期:2000-12-01 00:00:00

  • Pleiotropy and principal components of heritability combine to increase power for association analysis.

    abstract::When many correlated traits are measured the potential exists to discover the coordinated control of these traits via genotyped polymorphisms. A common statistical approach to this problem involves assessing the relationship between each phenotype and each single nucleotide polymorphism (SNP) individually (PHN); and t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20257

    authors: Klei L,Luca D,Devlin B,Roeder K

    更新日期:2008-01-01 00:00:00

  • Conditional multipoint linkage analysis using affected sib pairs: an alternative approach.

    abstract::Recently, Liang et al. ([2001b] Genet. Epidemiol. 21:105-122) proposed a conditional approach to assess linkage evidence on the target region by incorporating linkage information from an unlinked (reference) region using allele shared IBD (identity-by-decent) from affected sib pairs. This is carried out by conditionin...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10305

    authors: Chiu YF,Liang KY

    更新日期:2004-02-01 00:00:00

  • Effect of linkage disequilibrium between markers in linkage and association analyses.

    abstract::Contributions to Group 17 of the Genetic Analysis Workshop 15 considered dense markers in linkage disequilibrium (LD) in the context of either linkage or association analysis. Three contributions reported on methods for modeling LD or selecting a subset of markers in linkage equilibrium to perform linkage analysis. Wh...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20291

    authors: Dupuis J,Albers K,Allen-Brady K,Cho K,Elston RC,Kappen HJ,Tang H,Thomas A,Thomson G,Tsung E,Yang Q,Zhang W,Zhao K,Zheng G,Ziegler JT

    更新日期:2007-01-01 00:00:00

  • Lessons learned from Genetic Analysis Workshop 17: transitioning from genome-wide association studies to whole-genome statistical genetic analysis.

    abstract::Genetic Analysis Workshop 17 (GAW17) focused on the transition from genome-wide association study designs and methods to the study designs and statistical genetic methods that will be required for the analysis of next-generation sequence data including both common and rare sequence variants. In the 166 contributions t...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20659

    authors: Wilson AF,Ziegler A

    更新日期:2011-01-01 00:00:00

  • Tests for gene-environment interaction from case-control data: a novel study of type I error, power and designs.

    abstract::To evaluate the risk of a disease associated with the joint effects of genetic susceptibility and environmental exposures, epidemiologic researchers often test for non-multiplicative gene-environment effects from case-control studies. In this article, we present a comparative study of four alternative tests for intera...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.20337

    authors: Mukherjee B,Ahn J,Gruber SB,Rennert G,Moreno V,Chatterjee N

    更新日期:2008-11-01 00:00:00

  • Mantel statistics to correlate gene expression levels from microarrays with clinical covariates.

    abstract::Mantel statistics provide an additional step to standard approaches in the analysis of gene expression and covariate data, allow the calculation of standard statistics such as correlation, partial correlation, and regression coefficients, and, with permutation tests, provide P values for these statistics to relate the...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.1115

    authors: Shannon WD,Watson MA,Perry A,Rich K

    更新日期:2002-06-01 00:00:00

  • Data mining and computationally intensive methods: summary of Group 7 contributions to Genetic Analysis Workshop 13.

    abstract::The Framingham Heart Study data, as well as a related simulated data set, were generously provided to the participants of the Genetic Analysis Workshop 13 in order that newly developed and emerging statistical methodologies could be tested on that well-characterized data set. The impetus driving the development of nov...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/gepi.10285

    authors: Costello TJ,Falk CT,Ye KQ

    更新日期:2003-01-01 00:00:00

  • National database of familial cancer in Sweden.

    abstract::A family cancer database was constructed from the nationwide Swedish registries and includes approximately 6 million persons and >30,000 cancers in offspring diagnosed at ages 15-51 years and their parents. A particular advantage of the database is that the contribution of both parental lineages on cancer risk can be ...

    journal_title:Genetic epidemiology

    pub_type: 杂志文章

    doi:10.1002/(SICI)1098-2272(1998)15:3<225::AID-GEPI2>3

    authors: Hemminki K,Vaittinen P

    更新日期:1998-01-01 00:00:00