Promoter features related to tissue specificity as measured by Shannon entropy.

Abstract:

BACKGROUND:The regulatory mechanisms underlying tissue specificity are a crucial part of the development and maintenance of multicellular organisms. A genome-wide analysis of promoters in the context of gene-expression patterns in tissue surveys provides a means of identifying the general principles for these mechanisms. RESULTS:We introduce a definition of tissue specificity based on Shannon entropy to rank human genes according to their overall tissue specificity and by their specificity to particular tissues. We apply our definition to microarray-based and expressed sequence tag (EST)-based expression data for human genes and use similar data for mouse genes to validate our results. We show that most genes show statistically significant tissue-dependent variations in expression level. We find that the most tissue-specific genes typically have a TATA box, no CpG island, and often code for extracellular proteins. As expected, CpG islands are found in most of the least tissue-specific genes, which often code for proteins located in the nucleus or mitochondrion. The class of genes with no CpG island or TATA box are the most common mid-specificity genes and commonly code for proteins located in a membrane. Sp1 was found to be a weak indicator of less-specific expression. YY1 binding sites, either as initiators or as downstream sites, were strongly associated with the least-specific genes. CONCLUSIONS:We have begun to understand the components of promoters that distinguish tissue-specific from ubiquitous genes, to identify associations that can predict the broad class of gene expression from sequence data alone.

journal_name

Genome Biol

journal_title

Genome biology

authors

Schug J,Schuller WP,Kappen C,Salbaum JM,Bucan M,Stoeckert CJ Jr

doi

10.1186/gb-2005-6-4-r33

keywords:

subject

Has Abstract

pub_date

2005-01-01 00:00:00

pages

R33

issue

4

eissn

1474-7596

issn

1474-760X

pii

gb-2005-6-4-r33

journal_volume

6

pub_type

杂志文章
  • Characterization of the expression ratio noise structure in high-density oligonucleotide arrays.

    abstract:BACKGROUND:High-density oligonucleotide microarrays provide a powerful tool for assessing differential mRNA expression levels. Characterizing the noise resulting from the enzymatic and hybridization steps, called type I noise, is essential for attributing significance measures to the differential expression scores. We ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:

    authors: Naef F,Hacker CR,Patil N,Magnasco M

    更新日期:2002-01-01 00:00:00

  • Butterfly gene flow goes berserk.

    abstract::A new study shows that genomic introgression between two Heliconius butterfly species is not solely confined to color pattern loci. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-016-0898-z

    authors: Ffrench-Constant RH

    更新日期:2016-02-27 00:00:00

  • Comparative genomics reveals a constant rate of origination and convergent acquisition of functional retrogenes in Drosophila.

    abstract:BACKGROUND:Processed copies of genes (retrogenes) are duplicate genes that originated through the reverse-transcription of a host transcript and insertion in the genome. This type of gene duplication, as any other, could be a source of new genes and functions. Using whole genome sequence data for 12 Drosophila species,...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-1-r11

    authors: Bai Y,Casola C,Feschotte C,Betrán E

    更新日期:2007-01-01 00:00:00

  • A physical map for the Amborella trichopoda genome sheds light on the evolution of angiosperm genome structure.

    abstract:BACKGROUND:Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes becau...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-5-r48

    authors: Zuccolo A,Bowers JE,Estill JC,Xiong Z,Luo M,Sebastian A,Goicoechea JL,Collura K,Yu Y,Jiao Y,Duarte J,Tang H,Ayyampalayam S,Rounsley S,Kudrna D,Paterson AH,Pires JC,Chanderbali A,Soltis DE,Chamala S,Barbazuk B,So

    更新日期:2011-01-01 00:00:00

  • Emerging roles of chromatin in the maintenance of genome organization and function in plants.

    abstract::Chromatin is not a uniform macromolecular entity; it contains different domains characterized by complex signatures of DNA and histone modifications. Such domains are organized both at a linear scale along the genome and spatially within the nucleus. We discuss recent discoveries regarding mechanisms that establish bo...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/s13059-017-1236-9

    authors: Vergara Z,Gutierrez C

    更新日期:2017-05-23 00:00:00

  • Comparative genomics of gene-family size in closely related bacteria.

    abstract:BACKGROUND:The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear. RESULTS:The relative content o...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-4-r27

    authors: Pushker R,Mira A,Rodríguez-Valera F

    更新日期:2004-01-01 00:00:00

  • A human functional protein interaction network and its application to cancer data analysis.

    abstract:BACKGROUND:One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. RESULTS:We have construct...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-5-r53

    authors: Wu G,Feng X,Stein L

    更新日期:2010-01-01 00:00:00

  • quantro: a data-driven approach to guide the choice of an appropriate normalization method.

    abstract::Normalization is an essential step in the analysis of high-throughput data. Multi-sample global normalization methods, such as quantile normalization, have been successfully used to remove technical variation. However, these methods rely on the assumption that observed global changes across samples are due to unwanted...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0679-0

    authors: Hicks SC,Irizarry RA

    更新日期:2015-06-04 00:00:00

  • FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data.

    abstract::Chromosome conformation capture data, particularly from high-throughput approaches such as Hi-C, are typically very complex to analyse. Existing analysis tools are often single-purpose, or limited in compatibility to a small number of data formats, frequently making Hi-C analyses tedious and time-consuming. Here, we p...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02215-9

    authors: Kruse K,Hug CB,Vaquerizas JM

    更新日期:2020-12-17 00:00:00

  • Wheat chromatin architecture is organized in genome territories and transcription factories.

    abstract:BACKGROUND:Polyploidy is ubiquitous in eukaryotic plant and fungal lineages, and it leads to the co-existence of several copies of similar or related genomes in one nucleus. In plants, polyploidy is considered a major factor in successful domestication. However, polyploidy challenges chromosome folding architecture in ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-01998-1

    authors: Concia L,Veluchamy A,Ramirez-Prado JS,Martin-Ramirez A,Huang Y,Perez M,Domenichini S,Rodriguez Granados NY,Kim S,Blein T,Duncan S,Pichot C,Manza-Mianza D,Juery C,Paux E,Moore G,Hirt H,Bergounioux C,Crespi M,Mahfouz

    更新日期:2020-04-29 00:00:00

  • Landscape and evolutionary dynamics of terminal repeat retrotransposons in miniature in plant genomes.

    abstract:BACKGROUND:Terminal repeat retrotransposons in miniature (TRIMs) are a unique group of small long terminal repeat retrotransposons that are difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes, and their evolutionary and biological significance as well as their transposition m...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0867-y

    authors: Gao D,Li Y,Kim KD,Abernathy B,Jackson SA

    更新日期:2016-01-18 00:00:00

  • Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights.

    abstract:BACKGROUND:Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has b...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-12-r125

    authors: Gordân R,Murphy KF,McCord RP,Zhu C,Vedenko A,Bulyk ML

    更新日期:2011-12-21 00:00:00

  • Chipping away at major depressive disorder.

    abstract::An intriguing recent study examines the role of miR-1202, a glutamate receptor regulating microRNA, in regulating major depressive disorder. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-014-0421-3

    authors: Rucker JJ,McGuffin P

    更新日期:2014-07-26 00:00:00

  • Statistical tests for differential expression in cDNA microarray experiments.

    abstract::Extracting biological information from microarray data requires appropriate statistical methods. The simplest statistical method for detecting differential expression is the t test, which can be used to compare two conditions when there is replication of samples. With more than two conditions, analysis of variance (AN...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2003-4-4-210

    authors: Cui X,Churchill GA

    更新日期:2003-01-01 00:00:00

  • Influence of metabolic network structure and function on enzyme evolution.

    abstract:BACKGROUND:Most studies of molecular evolution are focused on individual genes and proteins. However, understanding the design principles and evolutionary properties of molecular networks requires a system-wide perspective. In the present work we connect molecular evolution on the gene level with system properties of a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-5-r39

    authors: Vitkup D,Kharchenko P,Wagner A

    更新日期:2006-01-01 00:00:00

  • Concept recognition for extracting protein interaction relations from biomedical text.

    abstract:BACKGROUND:Reliable information extraction applications have been a long sought goal of the biomedical text mining community, a goal that if reached would provide valuable tools to benchside biologists in their increasingly difficult task of assimilating the knowledge contained in the biomedical literature. We present ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-s2-s9

    authors: Baumgartner WA Jr,Lu Z,Johnson HL,Caporaso JG,Paquette J,Lindemann A,White EK,Medvedeva O,Cohen KB,Hunter L

    更新日期:2008-01-01 00:00:00

  • The greatest catch: big game fishing for mRNA-bound proteins.

    abstract::Purification of proteins cross-linked to mRNAs has identified 800 mRNA-binding proteins and their characteristics. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb4030

    authors: Sibley CR,Attig J,Ule J

    更新日期:2012-07-17 00:00:00

  • Toxicity in mice expressing short hairpin RNAs gives new insight into RNAi.

    abstract::Short hairpin RNAs can provide stable gene silencing via RNA interference. Recent studies have shown toxicity in vivo that appears to be related to saturation of the endogenous microRNA pathway. Will these findings limit the therapeutic use of such hairpins? ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2006-7-8-231

    authors: Snøve O Jr,Rossi JJ

    更新日期:2006-01-01 00:00:00

  • A Keystone for ncRNA.

    abstract::A report on the Keystone symposium 'Non-coding RNAs' held at Snowbird, Utah, USA, 31 March to 5 April 2012. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2012-13-5-315

    authors: Hacisuleyman E,Cabili MN,Rinn JL

    更新日期:2012-05-25 00:00:00

  • Is mouse embryonic stem cell technology obsolete?

    abstract::Injection of recombinant Cas9 protein and synthetic guide RNAs into mouse zygotes has been shown to facilitate gene disruption and knock-ins using the CRISPR system. These technologies may soon displace genetic modification using embryonic stem cells. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-015-0673-6

    authors: Skarnes WC

    更新日期:2015-05-27 00:00:00

  • Non-base-contacting residues enable kaleidoscopic evolution of metazoan C2H2 zinc finger DNA binding.

    abstract:BACKGROUND:The C2H2 zinc finger (C2H2-ZF) is the most numerous protein domain in many metazoans, but is not as frequent or diverse in other eukaryotes. The biochemical and evolutionary mechanisms that underlie the diversity of this DNA-binding domain exclusively in metazoans are, however, mostly unknown. RESULTS:Here,...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1287-y

    authors: Najafabadi HS,Garton M,Weirauch MT,Mnaimneh S,Yang A,Kim PM,Hughes TR

    更新日期:2017-09-06 00:00:00

  • Whole genome DNA sequencing provides an atlas of somatic mutagenesis in healthy human cells and identifies a tumor-prone cell type.

    abstract:BACKGROUND:The lifelong accumulation of somatic mutations underlies age-related phenotypes and cancer. Mutagenic forces are thought to shape the genome of aging cells in a tissue-specific way. Whole genome analyses of somatic mutation patterns, based on both types and genomic distribution of variants, can shed light on...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1892-z

    authors: Franco I,Helgadottir HT,Moggio A,Larsson M,Vrtačnik P,Johansson A,Norgren N,Lundin P,Mas-Ponte D,Nordström J,Lundgren T,Stenvinkel P,Wennberg L,Supek F,Eriksson M

    更新日期:2019-12-18 00:00:00

  • Proteomic view of mitochondrial function.

    abstract::Genomic and proteomic studies have identified hundreds of proteins from mitochondria. A recent study has added a functional twist to these systematic approaches and identified novel mitochondrial modifiers and regulators. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2008-9-2-209

    authors: Dimmer KS,Rapaport D

    更新日期:2008-01-01 00:00:00

  • Permutation-validated principal components analysis of microarray data.

    abstract:BACKGROUND:In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-4-research0019

    authors: Landgrebe J,Wurst W,Welzl G

    更新日期:2002-01-01 00:00:00

  • Chromatin Central: towards the comparative proteome by accurate mapping of the yeast proteomic environment.

    abstract:BACKGROUND:Understanding the design logic of living systems requires the understanding and comparison of proteomes. Proteomes define the commonalities between organisms more precisely than genomic sequences. Because uncertainties remain regarding the accuracy of proteomic data, several issues need to be resolved before...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-11-r167

    authors: Shevchenko A,Roguev A,Schaft D,Buchanan L,Habermann B,Sakalar C,Thomas H,Krogan NJ,Shevchenko A,Stewart AF

    更新日期:2008-01-01 00:00:00

  • FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease.

    abstract:BACKGROUND:Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically p...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-12-r170

    authors: Chen R,Morgan AA,Dudley J,Deshpande T,Li L,Kodama K,Chiang AP,Butte AJ

    更新日期:2008-01-01 00:00:00

  • Discovery and functional prioritization of Parkinson's disease candidate genes from large-scale whole exome sequencing.

    abstract:BACKGROUND:Whole-exome sequencing (WES) has been successful in identifying genes that cause familial Parkinson's disease (PD). However, until now this approach has not been deployed to study large cohorts of unrelated participants. To discover rare PD susceptibility variants, we performed WES in 1148 unrelated cases an...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1147-9

    authors: Jansen IE,Ye H,Heetveld S,Lechler MC,Michels H,Seinstra RI,Lubbe SJ,Drouet V,Lesage S,Majounie E,Gibbs JR,Nalls MA,Ryten M,Botia JA,Vandrovcova J,Simon-Sanchez J,Castillo-Lizardo M,Rizzu P,Blauwendraat C,Chouhan AK

    更新日期:2017-01-30 00:00:00

  • Improved reference genome of the arboviral vector Aedes albopictus.

    abstract:BACKGROUND:The Asian tiger mosquito Aedes albopictus is globally expanding and has become the main vector for human arboviruses in Europe. With limited antiviral drugs and vaccines available, vector control is the primary approach to prevent mosquito-borne diseases. A reliable and accurate DNA sequence of the Ae. albop...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02141-w

    authors: Palatini U,Masri RA,Cosme LV,Koren S,Thibaud-Nissen F,Biedler JK,Krsticevic F,Johnston JS,Halbach R,Crawford JE,Antoshechkin I,Failloux AB,Pischedda E,Marconcini M,Ghurye J,Rhie A,Sharma A,Karagodin DA,Jenrette J,Ga

    更新日期:2020-08-26 00:00:00

  • Mapping human pluripotent stem cell differentiation pathways using high throughput single-cell RNA-sequencing.

    abstract:BACKGROUND:Human pluripotent stem cells (hPSCs) provide powerful models for studying cellular differentiations and unlimited sources of cells for regenerative medicine. However, a comprehensive single-cell level differentiation roadmap for hPSCs has not been achieved. RESULTS:We use high throughput single-cell RNA-seq...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1426-0

    authors: Han X,Chen H,Huang D,Chen H,Fei L,Cheng C,Huang H,Yuan GC,Guo G

    更新日期:2018-04-05 00:00:00

  • Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.

    abstract:BACKGROUND:The current bovine genomic reference sequence was assembled from a Hereford cow. The resulting linear assembly lacks diversity because it does not contain allelic variation, a drawback of linear references that causes reference allele bias. High nucleotide diversity and the separation of individuals by hundr...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02105-0

    authors: Crysnanto D,Pausch H

    更新日期:2020-07-27 00:00:00