An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies.

Abstract:

BACKGROUND:Many different methods exist to adjust for variability in cell-type mixture proportions when analyzing DNA methylation studies. Here we present the result of an extensive simulation study, built on cell-separated DNA methylation profiles from Illumina Infinium 450K methylation data, to compare the performance of eight methods including the most commonly used approaches. RESULTS:We designed a rich multi-layered simulation containing a set of probes with true associations with either binary or continuous phenotypes, confounding by cell type, variability in means and standard deviations for population parameters, additional variability at the level of an individual cell-type-specific sample, and variability in the mixture proportions across samples. Performance varied quite substantially across methods and simulations. In particular, the number of false positives was sometimes unrealistically high, indicating limited ability to discriminate the true signals from those appearing significant through confounding. Methods that filtered probes had consequently poor power. QQ plots of p values across all tested probes showed that adjustments did not always improve the distribution. The same methods were used to examine associations between smoking and methylation data from a case-control study of colorectal cancer, and we also explored the effect of cell-type adjustments on associations between rheumatoid arthritis cases and controls. CONCLUSIONS:We recommend surrogate variable analysis for cell-type mixture adjustment since performance was stable under all our simulated scenarios.

journal_name

Genome Biol

journal_title

Genome biology

authors

McGregor K,Bernatsky S,Colmegna I,Hudson M,Pastinen T,Labbe A,Greenwood CM

doi

10.1186/s13059-016-0935-y

subject

Has Abstract

pub_date

2016-05-03 00:00:00

pages

84

eissn

1474-7596

issn

1474-760X

pii

10.1186/s13059-016-0935-y

journal_volume

17

pub_type

杂志文章
  • DarkHorse: a method for genome-wide prediction of horizontal gene transfer.

    abstract::A new approach to rapid, genome-wide identification and ranking of horizontal transfer candidate proteins is presented. The method is quantitative, reproducible, and computationally undemanding. It can be combined with genomic signature and/or phylogenetic tree-building procedures to improve accuracy and efficiency. T...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-2-r16

    authors: Podell S,Gaasterland T

    更新日期:2007-01-01 00:00:00

  • Assessing taxonomic metagenome profilers with OPAL.

    abstract::The explosive growth in taxonomic metagenome profiling methods over the past years has created a need for systematic comparisons using relevant performance criteria. The Open-community Profiling Assessment tooL (OPAL) implements commonly used performance metrics, including those of the first challenge of the initiativ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1646-y

    authors: Meyer F,Bremges A,Belmann P,Janssen S,McHardy AC,Koslicki D

    更新日期:2019-03-04 00:00:00

  • A comparison of programmed cell death between species.

    abstract::Key components of the programmed cell death pathway are conserved between Caenorhabditis elegans, Drosophila melanogaster and humans. The search for additional homologs has been facilitated by the availability of the entire genomic sequence for each of these organisms. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-3-reviews0003

    authors: Tittel JN,Steller H

    更新日期:2000-01-01 00:00:00

  • A novel protein encoded by circular SMO RNA is essential for Hedgehog signaling activation and glioblastoma tumorigenicity.

    abstract:BACKGROUND:Aberrant activation of the Hedgehog pathway drives tumorigenesis of many cancers, including glioblastoma. However, the sensitization mechanism of the G protein-coupled-like receptor smoothened (SMO), a key component of Hedgehog signaling, remains largely unknown. RESULTS:In this study, we describe a novel p...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02250-6

    authors: Wu X,Xiao S,Zhang M,Yang L,Zhong J,Li B,Li F,Xia X,Li X,Zhou H,Liu D,Huang N,Yang X,Xiao F,Zhang N

    更新日期:2021-01-14 00:00:00

  • The rhomboid protease family: a decade of progress on function and mechanism.

    abstract::Rhomboid proteases are the largest family of enzymes that hydrolyze peptide bonds within the cell membrane. Although discovered to be serine proteases only a decade ago, rhomboid proteases are already considered to be the best understood intramembrane proteases. The presence of rhomboid proteins in all domains of life...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2011-12-10-231

    authors: Urban S,Dickey SW

    更新日期:2011-10-27 00:00:00

  • Clustering of phosphorylation site recognition motifs can be exploited to predict the targets of cyclin-dependent kinase.

    abstract::Protein kinases are critical to cellular signalling and post-translational gene regulation, but their biological substrates are difficult to identify. We show that cyclin-dependent kinase (CDK) consensus motifs are frequently clustered in CDK substrate proteins. Based on this, we introduce a new computational strategy...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-2-r23

    authors: Moses AM,Hériché JK,Durbin R

    更新日期:2007-01-01 00:00:00

  • Quantitative protein expression profiling reveals extensive post-transcriptional regulation and post-translational modifications in schizont-stage malaria parasites.

    abstract:BACKGROUND:Malaria is a one of the most important infectious diseases and is caused by parasitic protozoa of the genus Plasmodium. Previously, quantitative characterization of the P. falciparum transcriptome demonstrated that the strictly controlled progression of these parasites through their intra-erythrocytic develo...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-12-r177

    authors: Foth BJ,Zhang N,Mok S,Preiser PR,Bozdech Z

    更新日期:2008-01-01 00:00:00

  • A compendium of Caenorhabditis elegans regulatory transcription factors: a resource for mapping transcription regulatory networks.

    abstract:BACKGROUND:Transcription regulatory networks are composed of interactions between transcription factors and their target genes. Whereas unicellular networks have been studied extensively, metazoan transcription regulatory networks remain largely unexplored. Caenorhabditis elegans provides a powerful model to study such...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-13-r110

    authors: Reece-Hoyes JS,Deplancke B,Shingles J,Grove CA,Hope IA,Walhout AJ

    更新日期:2005-01-01 00:00:00

  • Consensus clustering and functional interpretation of gene-expression data.

    abstract::Microarray analysis using clustering algorithms can suffer from lack of inter-method consistency in assigning related gene-expression profiles to clusters. Obtaining a consensus set of clusters from a number of clustering methods should improve confidence in gene-expression analysis. Here we introduce consensus cluste...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-11-r94

    authors: Swift S,Tucker A,Vinciotti V,Martin N,Orengo C,Liu X,Kellam P

    更新日期:2004-01-01 00:00:00

  • Global orchestration of gene expression by the biological clock of cyanobacteria.

    abstract::Prokaryotic cyanobacteria express robust circadian (daily) rhythms under the control of a central clock. Recent studies shed light on the mechanisms governing circadian rhythms in cyanobacteria and highlight key differences between prokaryotic and eukaryotic clocks. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2004-5-4-217

    authors: Johnson CH

    更新日期:2004-01-01 00:00:00

  • Chromatin accessibility reveals insights into androgen receptor activation and transcriptional specificity.

    abstract:BACKGROUND:Epigenetic mechanisms such as chromatin accessibility impact transcription factor binding to DNA and transcriptional specificity. The androgen receptor (AR), a master regulator of the male phenotype and prostate cancer pathogenesis, acts primarily through ligand-activated transcription of target genes. Altho...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-10-r88

    authors: Tewari AK,Yardimci GG,Shibata Y,Sheffield NC,Song L,Taylor BS,Georgiev SG,Coetzee GA,Ohler U,Furey TS,Crawford GE,Febbo PG

    更新日期:2012-10-03 00:00:00

  • Normalization of boutique two-color microarrays with a high proportion of differentially expressed probes.

    abstract::Normalization is critical for removing systematic variation from microarray data. For two-color microarray platforms, intensity-dependent lowess normalization is commonly used to correct relative gene expression values for biases. Here we outline a normalization method for use when the assumptions of lowess normalizat...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-1-r2

    authors: Oshlack A,Emslie D,Corcoran LM,Smyth GK

    更新日期:2007-01-01 00:00:00

  • Decode-seq: a practical approach to improve differential gene expression analysis.

    abstract::Many differential gene expression analyses are conducted with an inadequate number of biological replicates. We describe an easy and effective RNA-seq approach using molecular barcoding to enable profiling of a large number of replicates simultaneously. This approach significantly improves the performance of different...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-01966-9

    authors: Li Y,Yang H,Zhang H,Liu Y,Shang H,Zhao H,Zhang T,Tu Q

    更新日期:2020-03-23 00:00:00

  • Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets using protocol-specific bias modeling.

    abstract:BACKGROUND:DNase-seq and ATAC-seq are broadly used methods to assay open chromatin regions genome-wide. The single nucleotide resolution of DNase-seq has been further exploited to infer transcription factor binding sites (TFBSs) in regulatory regions through footprinting. Recent studies have demonstrated the sequence b...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1654-y

    authors: Karabacak Calviello A,Hirsekorn A,Wurmus R,Yusuf D,Ohler U

    更新日期:2019-02-21 00:00:00

  • Histone variants: are they functionally heterogeneous?

    abstract::In most eukaryotes, histones, which are the major structural components of chromatin, are expressed as a family of sequence variants encoded by multiple genes. Because different histone variants can contribute to a distinct or unique nucleosomal architecture, this heterogeneity can be exploited to regulate a wide rang...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-2-7-reviews0006

    authors: Brown DT

    更新日期:2001-01-01 00:00:00

  • EpiTEome: Simultaneous detection of transposable element insertion sites and their DNA methylation levels.

    abstract::The genome-wide investigation of DNA methylation levels has been limited to reference transposable element positions. The methylation analysis of non-reference and mobile transposable elements has only recently been performed, but required both genome resequencing and MethylC-seq datasets. We have created epiTEome, a ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1232-0

    authors: Daron J,Slotkin RK

    更新日期:2017-05-12 00:00:00

  • A novel mode of chromosomal evolution peculiar to filamentous Ascomycete fungi.

    abstract:BACKGROUND:Gene loss, inversions, translocations, and other chromosomal rearrangements vary among species, resulting in different rates of structural genome evolution. Major chromosomal rearrangements are rare in most eukaryotes, giving large regions with the same genes in the same order and orientation across species....

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-5-r45

    authors: Hane JK,Rouxel T,Howlett BJ,Kema GH,Goodwin SB,Oliver RP

    更新日期:2011-01-01 00:00:00

  • A top-down view on DNA replication and recombination from 9,000 feet above sea level.

    abstract::A report of the Keystone Symposium 'DNA Replication and Recombination' held in Keystone, USA, 27 February to 4 March 2011. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2011-12-4-304

    authors: Johansson E,Speck C,Chabes A

    更新日期:2011-01-01 00:00:00

  • Using the canary genome to decipher the evolution of hormone-sensitive gene regulation in seasonal singing birds.

    abstract:BACKGROUND:While the song of all songbirds is controlled by the same neural circuit, the hormone dependence of singing behavior varies greatly between species. For this reason, songbirds are ideal organisms to study ultimate and proximate mechanisms of hormone-dependent behavior and neuronal plasticity. RESULTS:We pre...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-014-0578-9

    authors: Frankl-Vilches C,Kuhl H,Werber M,Klages S,Kerick M,Bakker A,de Oliveira EH,Reusch C,Capuano F,Vowinckel J,Leitner S,Ralser M,Timmermann B,Gahr M

    更新日期:2015-01-29 00:00:00

  • Statistical tools for synthesizing lists of differentially expressed features in related experiments.

    abstract::We propose a novel approach for finding a list of features that are commonly perturbed in two or more experiments, quantifying the evidence of dependence between the experiments by a ratio. We present a Bayesian analysis of this ratio, which leads us to suggest two rules for choosing a cut-off on the ranked list of p ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-4-r54

    authors: Blangiardo M,Richardson S

    更新日期:2007-01-01 00:00:00

  • Retrozymes are a unique family of non-autonomous retrotransposons with hammerhead ribozymes that propagate in plants through circular RNAs.

    abstract:BACKGROUND:Catalytic RNAs, or ribozymes, are regarded as fossils of a prebiotic RNA world that have remained in the genomes of modern organisms. The simplest ribozymes are the small self-cleaving RNAs, like the hammerhead ribozyme, which have been historically considered biological oddities restricted to some RNA patho...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-1002-4

    authors: Cervera A,Urbina D,de la Peña M

    更新日期:2016-06-23 00:00:00

  • A genomic and evolutionary approach reveals non-genetic drug resistance in malaria.

    abstract:BACKGROUND:Drug resistance remains a major public health challenge for malaria treatment and eradication. Individual loci associated with drug resistance to many antimalarials have been identified, but their epistasis with other resistance mechanisms has not yet been elucidated. RESULTS:We previously described two mut...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/PREACCEPT-1067113631444973

    authors: Herman JD,Rice DP,Ribacke U,Silterra J,Deik AA,Moss EL,Broadbent KM,Neafsey DE,Desai MM,Clish CB,Mazitschek R,Wirth DF

    更新日期:2014-01-01 00:00:00

  • Gene expression analysis of nuclear factor I-A deficient mice indicates delayed brain maturation.

    abstract:BACKGROUND:Nuclear factor I-A (NFI-A), a phylogenetically conserved transcription/replication protein, plays a crucial role in mouse brain development. Previous studies have shown that disruption of the Nfia gene in mice leads to perinatal lethality, corpus callosum agenesis, and hydrocephalus. RESULTS:To identify pot...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-5-r72

    authors: Wong YW,Schulze C,Streichert T,Gronostajski RM,Schachner M,Tilling T

    更新日期:2007-01-01 00:00:00

  • ELXR: a resource for rapid exon-directed sequence analysis.

    abstract::ELXR (Exon Locator and Extractor for Resequencing) streamlines the process of determining exon/intron boundaries and designing PCR and sequencing primers for high-throughput resequencing of exons. We have pre-computed ELXR primer sets for all exons identified from the human, mouse, and rat mRNA reference sequence (Ref...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-5-r36

    authors: Schageman JJ,Horton CJ,Niu S,Garner HR,Pertsemlidis A

    更新日期:2004-01-01 00:00:00

  • Reduced selection leads to accelerated gene loss in Shigella.

    abstract:BACKGROUND:Obligate pathogenic bacteria lose more genes relative to facultative pathogens, which, in turn, lose more genes than free-living bacteria. It was suggested that the increased gene loss in obligate pathogens may be due to a reduction in the effectiveness of purifying selection. Less attention has been given t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-8-r164

    authors: Hershberg R,Tang H,Petrov DA

    更新日期:2007-01-01 00:00:00

  • Genomics through the lens of next-generation sequencing.

    abstract::A report on the 23rd annual meeting on 'The Biology of Genomes', 11-15 May 2010, Cold Spring Harbor, USA. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-6-306

    authors: Capra JA,Carbone L,Riesenfeld SJ,Wall JD

    更新日期:2010-01-01 00:00:00

  • Selection in the evolution of gene duplications.

    abstract:BACKGROUND:Gene duplications have a major role in the evolution of new biological functions. Theoretical studies often assume that a duplication per se is selectively neutral and that, following a duplication, one of the gene copies is freed from purifying (stabilizing) selection, which creates the potential for evolut...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-2-research0008

    authors: Kondrashov FA,Rogozin IB,Wolf YI,Koonin EV

    更新日期:2002-01-01 00:00:00

  • The real cost of sequencing: scaling computation to keep pace with data generation.

    abstract::As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0917-0

    authors: Muir P,Li S,Lou S,Wang D,Spakowicz DJ,Salichos L,Zhang J,Weinstock GM,Isaacs F,Rozowsky J,Gerstein M

    更新日期:2016-03-23 00:00:00

  • Homologous recombination: from model organisms to human disease.

    abstract::Recent experiments show that properly controlled recombination between homologous DNA molecules is essential for the maintenance of genome stability and for the prevention of tumorigenesis. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-2-5-reviews1014

    authors: Modesti M,Kanaar R

    更新日期:2001-01-01 00:00:00

  • Comprehensive miRNA sequence analysis reveals survival differences in diffuse large B-cell lymphoma patients.

    abstract:BACKGROUND:Diffuse large B-cell lymphoma (DLBCL) is an aggressive disease, with 30% to 40% of patients failing to be cured with available primary therapy. microRNAs (miRNAs) are RNA molecules that attenuate expression of their mRNA targets. To characterize the DLBCL miRNome, we sequenced miRNAs from 92 DLBCL and 15 ben...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-014-0568-y

    authors: Lim EL,Trinh DL,Scott DW,Chu A,Krzywinski M,Zhao Y,Robertson AG,Mungall AJ,Schein J,Boyle M,Mottok A,Ennishi D,Johnson NA,Steidl C,Connors JM,Morin RD,Gascoyne RD,Marra MA

    更新日期:2015-01-29 00:00:00