A curated benchmark of enhancer-gene interactions for evaluating enhancer-target gene prediction methods.

Abstract:

BACKGROUND:Many genome-wide collections of candidate cis-regulatory elements (cCREs) have been defined using genomic and epigenomic data, but it remains a major challenge to connect these elements to their target genes. RESULTS:To facilitate the development of computational methods for predicting target genes, we develop a Benchmark of candidate Enhancer-Gene Interactions (BENGI) by integrating the recently developed Registry of cCREs with experimentally derived genomic interactions. We use BENGI to test several published computational methods for linking enhancers with genes, including signal correlation and the TargetFinder and PEP supervised learning methods. We find that while TargetFinder is the best-performing method, it is only modestly better than a baseline distance method for most benchmark datasets when trained and tested with the same cell type and that TargetFinder often does not outperform the distance method when applied across cell types. CONCLUSIONS:Our results suggest that current computational methods need to be improved and that BENGI presents a useful framework for method development and testing.

journal_name

Genome Biol

journal_title

Genome biology

authors

Moore JE,Pratt HE,Purcaro MJ,Weng Z

doi

10.1186/s13059-019-1924-8

subject

Has Abstract

pub_date

2020-01-22 00:00:00

pages

17

issue

1

eissn

1474-7596

issn

1474-760X

pii

10.1186/s13059-019-1924-8

journal_volume

21

pub_type

杂志文章
  • MicroRNAs and their isomiRs function cooperatively to target common biological pathways.

    abstract:BACKGROUND:Variants of microRNAs (miRNAs), called isomiRs, are commonly reported in deep-sequencing studies; however, the functional significance of these variants remains controversial. Observational studies show that isomiR patterns are non-random, hinting that these molecules could be regulated and therefore functio...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-12-r126

    authors: Cloonan N,Wani S,Xu Q,Gu J,Lea K,Heater S,Barbacioru C,Steptoe AL,Martin HC,Nourbakhsh E,Krishnan K,Gardiner B,Wang X,Nones K,Steen JA,Matigian NA,Wood DL,Kassahn KS,Waddell N,Shepherd J,Lee C,Ichikawa J,McKer

    更新日期:2011-12-30 00:00:00

  • Survey of human mitochondrial diseases using new genomic/proteomic tools.

    abstract:BACKGROUND:We have constructed Bayesian prior-based, amino-acid sequence profiles for the complete yeast mitochondrial proteome and used them to develop methods for identifying and characterizing the context of protein mutations that give rise to human mitochondrial diseases. (Bayesian priors are conditional probabilit...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2001-2-6-research0021

    authors: Plasterer TN,Smith TF,Mohr SC

    更新日期:2001-01-01 00:00:00

  • Decode-seq: a practical approach to improve differential gene expression analysis.

    abstract::Many differential gene expression analyses are conducted with an inadequate number of biological replicates. We describe an easy and effective RNA-seq approach using molecular barcoding to enable profiling of a large number of replicates simultaneously. This approach significantly improves the performance of different...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-01966-9

    authors: Li Y,Yang H,Zhang H,Liu Y,Shang H,Zhao H,Zhang T,Tu Q

    更新日期:2020-03-23 00:00:00

  • Mining physical protein-protein interactions from the literature.

    abstract:BACKGROUND:Deciphering physical protein-protein interactions is fundamental to elucidating both the functions of proteins and biological processes. The development of high-throughput experimental technologies such as the yeast two-hybrid screening has produced an explosion in data relating to interactions. Since manual...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-s2-s12

    authors: Huang M,Ding S,Wang H,Zhu X

    更新日期:2008-01-01 00:00:00

  • Muscular expressions: profiling genes in complex tissues.

    abstract::Gene-expression profiling has yielded important information about simple systems, but complex tissues have not yet been widely profiled. Four recent studies of mammalian skeletal muscles have added to the catalogs of their gene expression differences, but have yet to lead to better understanding of the molecular proce...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-2-12-reviews1033

    authors: Hampson R,Hughes SM

    更新日期:2001-01-01 00:00:00

  • Discovery and functional prioritization of Parkinson's disease candidate genes from large-scale whole exome sequencing.

    abstract:BACKGROUND:Whole-exome sequencing (WES) has been successful in identifying genes that cause familial Parkinson's disease (PD). However, until now this approach has not been deployed to study large cohorts of unrelated participants. To discover rare PD susceptibility variants, we performed WES in 1148 unrelated cases an...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1147-9

    authors: Jansen IE,Ye H,Heetveld S,Lechler MC,Michels H,Seinstra RI,Lubbe SJ,Drouet V,Lesage S,Majounie E,Gibbs JR,Nalls MA,Ryten M,Botia JA,Vandrovcova J,Simon-Sanchez J,Castillo-Lizardo M,Rizzu P,Blauwendraat C,Chouhan AK

    更新日期:2017-01-30 00:00:00

  • A cell surface interaction network of neural leucine-rich repeat receptors.

    abstract:BACKGROUND:The vast number of precise intercellular connections within vertebrate nervous systems is only partly explained by the comparatively few known extracellular guidance cues. Large families of neural orphan receptor proteins have been identified and are likely to contribute to these recognition processes but du...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-9-r99

    authors: Söllner C,Wright GJ

    更新日期:2009-01-01 00:00:00

  • The circadian clock goes genomic.

    abstract::Large-scale biology among plant species, as well as comparative genomics of circadian clock architecture and clock-regulated output processes, have greatly advanced our understanding of the endogenous timing system in plants. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2013-14-6-208

    authors: Staiger D,Shin J,Johansson M,Davis SJ

    更新日期:2013-06-24 00:00:00

  • Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains.

    abstract:BACKGROUND:CTCF binding contributes to the establishment of a higher-order genome structure by demarcating the boundaries of large-scale topologically associating domains (TADs). However, despite the importance and conservation of TADs, the role of CTCF binding in their evolution and stability remains elusive. RESULTS...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1894-x

    authors: Kentepozidou E,Aitken SJ,Feig C,Stefflova K,Ibarra-Soria X,Odom DT,Roller M,Flicek P

    更新日期:2020-01-07 00:00:00

  • Annotation and analysis of 10,000 expressed sequence tags from developing mouse eye and adult retina.

    abstract:BACKGROUND:As a biomarker of cellular activities, the transcriptome of a specific tissue or cell type during development and disease is of great biomedical interest. We have generated and analyzed 10,000 expressed sequence tags (ESTs) from three mouse eye tissue cDNA libraries: embryonic day 15.5 (M15E) eye, postnatal ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2003-4-10-r65

    authors: Yu J,Farjo R,MacNee SP,Baehr W,Stambolian DE,Swaroop A

    更新日期:2003-01-01 00:00:00

  • Cytoscape Automation: empowering workflow-based network analysis.

    abstract::Cytoscape is one of the most successful network biology analysis and visualization tools, but because of its interactive nature, its role in creating reproducible, scalable, and novel workflows has been limited. We describe Cytoscape Automation (CA), which marries Cytoscape to highly productive workflow systems, for e...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1758-4

    authors: Otasek D,Morris JH,Bouças J,Pico AR,Demchak B

    更新日期:2019-09-02 00:00:00

  • The R-spondin protein family.

    abstract::The four vertebrate R-spondin proteins are secreted agonists of the canonical Wnt/β-catenin signaling pathway. These proteins are approximately 35 kDa, and are characterized by two amino-terminal furin-like repeats, which are necessary and sufficient for Wnt signal potentiation, and a thrombospondin domain situated mo...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2012-13-3-242

    authors: de Lau WB,Snel B,Clevers HC

    更新日期:2012-01-01 00:00:00

  • A small RNA makes a Bic difference.

    abstract::The first highly specific knockouts of a microRNA, miR155, in mice result in multiple defects in adaptive immunity, and also show the feasibility of investigating at least some microRNAs by gene knockout. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2007-8-7-221

    authors: Moffett HF,Novina CD

    更新日期:2007-01-01 00:00:00

  • NicE-seq: high resolution open chromatin profiling.

    abstract::Open chromatin profiling integrates information across diverse regulatory elements to reveal the transcriptionally active genome. Tn5 transposase and DNase I sequencing-based methods prefer native or high cell numbers. Here, we describe NicE-seq (nicking enzyme assisted sequencing) for high-resolution open chromatin p...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1247-6

    authors: Ponnaluri VKC,Zhang G,Estève PO,Spracklin G,Sian S,Xu SY,Benoukraf T,Pradhan S

    更新日期:2017-06-28 00:00:00

  • Substantial contribution of genetic variation in the expression of transcription factors to phenotypic variation revealed by eRD-GWAS.

    abstract:BACKGROUND:There are significant limitations in existing methods for the genome-wide identification of genes whose expression patterns affect traits. RESULTS:The transcriptomes of five tissues from 27 genetically diverse maize inbred lines were deeply sequenced to identify genes exhibiting high and low levels of expre...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1328-6

    authors: Lin HY,Liu Q,Li X,Yang J,Liu S,Huang Y,Scanlon MJ,Nettleton D,Schnable PS

    更新日期:2017-10-17 00:00:00

  • An ontology for cell types.

    abstract::We describe an ontology for cell types that covers the prokaryotic, fungal, animal and plant worlds. It includes over 680 cell types. These cell types are classified under several generic categories and are organized as a directed acyclic graph. The ontology is available in the formats adopted by the Open Biological O...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-2-r21

    authors: Bard J,Rhee SY,Ashburner M

    更新日期:2005-01-01 00:00:00

  • The real cost of sequencing: scaling computation to keep pace with data generation.

    abstract::As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0917-0

    authors: Muir P,Li S,Lou S,Wang D,Spakowicz DJ,Salichos L,Zhang J,Weinstock GM,Isaacs F,Rozowsky J,Gerstein M

    更新日期:2016-03-23 00:00:00

  • CRISPhieRmix: a hierarchical mixture model for CRISPR pooled screens.

    abstract::Pooled CRISPR screens allow researchers to interrogate genetic causes of complex phenotypes at the genome-wide scale and promise higher specificity and sensitivity compared to competing technologies. Unfortunately, two problems exist, particularly for CRISPRi/a screens: variability in guide efficiency and large rare o...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1538-6

    authors: Daley TP,Lin Z,Lin X,Liu Y,Wong WH,Qi LS

    更新日期:2018-10-08 00:00:00

  • Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene.

    abstract:BACKGROUND:RNA sequencing has opened new avenues for the study of transcriptome composition. Significant evidence has accumulated showing that the human transcriptome contains in excess of a hundred thousand different transcripts. However, it is still not clear to what extent this diversity prevails when considering th...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-7-r70

    authors: Gonzàlez-Porta M,Frankish A,Rung J,Harrow J,Brazma A

    更新日期:2013-07-01 00:00:00

  • Wrangling for microRNAs provokes much crosstalk.

    abstract::Levels of transcripts sharing microRNA response elements are co-regulated. These RNA-RNA interactions imply that combinations of microRNAs modulate cell-specific transcript networks. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-11-132

    authors: Marques AC,Tan J,Ponting CP

    更新日期:2011-11-21 00:00:00

  • Dating branches on the tree of life using DNA.

    abstract::The use of DNA sequences to estimate the timing of evolutionary events is increasingly popular, although it is fraught with practical difficulties. But the exponential growth of relevant information and improved methods of analysis are providing increasingly reliable sequence-derived dates, and it may become possible ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-3-1-reviews0001

    authors: Wray GA

    更新日期:2002-01-01 00:00:00

  • Interrogation of global mutagenesis data with a genome scale model of Neisseria meningitidis to assess gene fitness in vitro and in sera.

    abstract:BACKGROUND:Neisseria meningitidis is an important human commensal and pathogen that causes several thousand deaths each year, mostly in young children. How the pathogen replicates and causes disease in the host is largely unknown, particularly the role of metabolism in colonization and disease. Completed genome sequenc...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-12-r127

    authors: Mendum TA,Newcombe J,Mannan AA,Kierzek AM,McFadden J

    更新日期:2011-12-30 00:00:00

  • Constitutive patterns of gene expression regulated by RNA-binding proteins.

    abstract:BACKGROUND:RNA-binding proteins regulate a number of cellular processes, including synthesis, folding, translocation, assembly and clearance of RNAs. Recent studies have reported that an unexpectedly large number of proteins are able to interact with RNA, but the partners of many RNA-binding proteins are still uncharac...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2014-15-1-r13

    authors: Cirillo D,Marchese D,Agostini F,Livi CM,Botta-Orfila T,Tartaglia GG

    更新日期:2014-01-02 00:00:00

  • The nature of evidence for and against epigenetic inheritance.

    abstract::Not so fast. The Iqbal et. al. study and the associated Whitelaw commentary highlight the appropriately high standards of study design and interpretation needed to obtain good evidence for or against epigenetic inheritance. Please see related article: www.dx.doi.org/10.1186/s13059-015-0714-1. ...

    journal_title:Genome biology

    pub_type: 评论,信件

    doi:10.1186/s13059-015-0709-y

    authors: Nadeau JH

    更新日期:2015-07-11 00:00:00

  • Vex-seq: high-throughput identification of the impact of genetic variation on pre-mRNA splicing efficiency.

    abstract::Understanding the functional impact of genomic variants is a major goal of modern genetics and personalized medicine. Although many synonymous and non-coding variants act through altering the efficiency of pre-mRNA splicing, it is difficult to predict how these variants impact pre-mRNA splicing. Here, we describe a ma...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1437-x

    authors: Adamson SI,Zhan L,Graveley BR

    更新日期:2018-06-01 00:00:00

  • External signals shape the epigenome.

    abstract::A new study shows how a single cytokine, interleukin-4, regulates hematopoietic lineage choice by activating the JAK3-STAT6 pathway, which causes dendritic-cell-specific DNA demethylation. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0884-5

    authors: Lennartsson A

    更新日期:2016-02-01 00:00:00

  • The WUS homeobox-containing (WOX) protein family.

    abstract::The WOX genes form a plant-specific subclade of the eukaryotic homeobox transcription factor superfamily, which is characterized by the presence of a conserved DNA-binding homeodomain. The analysis of WOX gene expression and function shows that WOX family members fulfill specialized functions in key developmental proc...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2009-10-12-248

    authors: van der Graaff E,Laux T,Rensing SA

    更新日期:2009-01-01 00:00:00

  • ELXR: a resource for rapid exon-directed sequence analysis.

    abstract::ELXR (Exon Locator and Extractor for Resequencing) streamlines the process of determining exon/intron boundaries and designing PCR and sequencing primers for high-throughput resequencing of exons. We have pre-computed ELXR primer sets for all exons identified from the human, mouse, and rat mRNA reference sequence (Ref...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-5-r36

    authors: Schageman JJ,Horton CJ,Niu S,Garner HR,Pertsemlidis A

    更新日期:2004-01-01 00:00:00

  • Discovery of estrogen receptor alpha target genes and response elements in breast tumor cells.

    abstract:BACKGROUND:Estrogens and their receptors are important in human development, physiology and disease. In this study, we utilized an integrated genome-wide molecular and computational approach to characterize the interaction between the activated estrogen receptor (ER) and the regulatory elements of candidate target gene...

    journal_title:Genome biology

    pub_type: 杂志文章,meta分析

    doi:10.1186/gb-2004-5-9-r66

    authors: Lin CY,Ström A,Vega VB,Kong SL,Yeo AL,Thomsen JS,Chan WC,Doray B,Bangarusamy DK,Ramasamy A,Vergara LA,Tang S,Chong A,Bajic VB,Miller LD,Gustafsson JA,Liu ET

    更新日期:2004-01-01 00:00:00

  • Recurrent evolution of heat-responsiveness in Brassicaceae COPIA elements.

    abstract:BACKGROUND:The mobilization of transposable elements (TEs) is suppressed by host genome defense mechanisms. Recent studies showed that the cis-regulatory region of Arabidopsis thaliana COPIA78/ONSEN retrotransposons contains heat-responsive elements (HREs), which cause their activation during heat stress. However, it r...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-1072-3

    authors: Pietzenuk B,Markus C,Gaubert H,Bagwan N,Merotto A,Bucher E,Pecinka A

    更新日期:2016-10-11 00:00:00