Functional constraint and small insertions and deletions in the ENCODE regions of the human genome.

Abstract:

BACKGROUND:We describe the distribution of indels in the 44 Encyclopedia of DNA Elements (ENCODE) regions (about 1% of the human genome) and evaluate the potential contributions of small insertion and deletion polymorphisms (indels) to human genetic variation. We relate indels to known genomic annotation features and measures of evolutionary constraint. RESULTS:Indel rates are observed to be reduced approximately 20-fold to 60-fold in exonic regions, 5-fold to 10-fold in sequence that exhibits high evolutionary constraint in mammals, and up to 2-fold in some classes of regulatory elements (for instance, formaldehyde assisted isolation of regulatory elements [FAIRE] and hypersensitive sites). In addition, some noncoding transcription and other chromatin mediated regulatory sites also have reduced indel rates. Overall indel rates for these data are estimated to be smaller than single nucleotide polymorphism (SNP) rates by a factor of approximately 2, with both rates measured as base pairs per 100 kilobases to facilitate comparison. CONCLUSION:Indel rates exhibit a broadly similar distribution across genomic features compared with SNP density rates, with a reduction in rates in coding transcription and evolutionarily constrained sequence. However, unlike indels, SNP rates do not appear to be reduced in some noncoding functional sequences, such as pseudo-exons, and FAIRE and hypersensitive sites. We conclude that indel rates are greatly reduced in transcribed and evolutionarily constrained DNA, and discuss why indel (but not SNP) rates appear to be constrained at some regulatory sites.

journal_name

Genome Biol

journal_title

Genome biology

authors

Clark TG,Andrew T,Cooper GM,Margulies EH,Mullikin JC,Balding DJ

doi

10.1186/gb-2007-8-9-r180

subject

Has Abstract

pub_date

2007-01-01 00:00:00

pages

R180

issue

9

eissn

1474-7596

issn

1474-760X

pii

gb-2007-8-9-r180

journal_volume

8

pub_type

杂志文章
  • Design and computational analysis of single-cell RNA-sequencing experiments.

    abstract::Single-cell RNA-sequencing (scRNA-seq) has emerged as a revolutionary tool that allows us to address scientific questions that eluded examination just a few years ago. With the advantages of scRNA-seq come computational challenges that are just beginning to be addressed. In this article, we highlight the computational...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/s13059-016-0927-y

    authors: Bacher R,Kendziorski C

    更新日期:2016-04-07 00:00:00

  • Wheat rusts never sleep but neither do sequencers: will pathogenomics transform the way plant diseases are managed?

    abstract::Field pathogenomics adds highly informative data to surveillance surveys by enabling rapid evaluation of pathogen variability, population structure and host genotype. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0615-3

    authors: Derevnina L,Michelmore RW

    更新日期:2015-03-02 00:00:00

  • Epigenetic modifications of histones in cancer.

    abstract::The epigenetic modifications of histones are versatile marks that are intimately connected to development and disease pathogenesis including human cancers. In this review, we will discuss the many different types of histone modifications and the biological processes with which they are involved. Specifically, we revie...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/s13059-019-1870-5

    authors: Zhao Z,Shilatifard A

    更新日期:2019-11-20 00:00:00

  • Mutational signature distribution varies with DNA replication timing and strand asymmetry.

    abstract:BACKGROUND:DNA replication plays an important role in mutagenesis, yet little is known about how it interacts with other mutagenic processes. Here, we use somatic mutation signatures-each representing a mutagenic process-derived from 3056 patients spanning 19 cancer types to quantify the strand asymmetry of mutational ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1509-y

    authors: Tomkova M,Tomek J,Kriaucionis S,Schuster-Böckler B

    更新日期:2018-09-10 00:00:00

  • A human lung tumor microenvironment interactome identifies clinically relevant cell-type cross-talk.

    abstract:BACKGROUND:Tumors comprise a complex microenvironment of interacting malignant and stromal cell types. Much of our understanding of the tumor microenvironment comes from in vitro studies isolating the interactions between malignant cells and a single stromal cell type, often along a single pathway. RESULT:To develop a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02019-x

    authors: Gentles AJ,Hui AB,Feng W,Azizi A,Nair RV,Bouchard G,Knowles DA,Yu A,Jeong Y,Bejnood A,Forgó E,Varma S,Xu Y,Kuong A,Nair VS,West R,van de Rijn M,Hoang CD,Diehn M,Plevritis SK

    更新日期:2020-05-07 00:00:00

  • The rhomboid protease family: a decade of progress on function and mechanism.

    abstract::Rhomboid proteases are the largest family of enzymes that hydrolyze peptide bonds within the cell membrane. Although discovered to be serine proteases only a decade ago, rhomboid proteases are already considered to be the best understood intramembrane proteases. The presence of rhomboid proteins in all domains of life...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2011-12-10-231

    authors: Urban S,Dickey SW

    更新日期:2011-10-27 00:00:00

  • A reappraisal of the phylogenetic placement of the Aquilegia whole-genome duplication.

    abstract::The accurate placement of an ancient whole-genome duplication (WGD) in relation to the lineage divergence is important. Here, we re-investigated the Aquilegia coerulea WGD and found it is more likely lineage-specific rather than shared by all eudicots. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02212-y

    authors: Shi T,Chen J

    更新日期:2020-12-08 00:00:00

  • Archaeal phylogeny based on proteins of the transcription and translation machineries: tackling the Methanopyrus kandleri paradox.

    abstract:BACKGROUND:Phylogenetic analysis of the Archaea has been mainly established by 16S rRNA sequence comparison. With the accumulation of completely sequenced genomes, it is now possible to test alternative approaches by using large sequence datasets. We analyzed archaeal phylogeny using two concatenated datasets consistin...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-3-r17

    authors: Brochier C,Forterre P,Gribaldo S

    更新日期:2004-01-01 00:00:00

  • 'Horizontal' plant biology on the rise.

    abstract::A report on the Plant Genomics European Meeting (Plant-GEMS2004), Lyon, France, 22-25 September 2004. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2004-6-1-302

    authors: Van de Peer Y

    更新日期:2005-01-01 00:00:00

  • Molecular mechanisms of spindle function.

    abstract::The key molecules involved in regulating the assembly and function of the mitotic spindle are shared by evolutionarily divergent species. Studies in different model systems are leading to convergent conclusions about the central role of microtubule nucleation and dynamics and of kinesin-related motor proteins in spind...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-1-reviews101

    authors: Walczak CE

    更新日期:2000-01-01 00:00:00

  • Accelerated exon evolution within primate segmental duplications.

    abstract:BACKGROUND:The identification of signatures of natural selection has long been used as an approach to understanding the unique features of any given species. Genes within segmental duplications are overlooked in most studies of selection due to the limitations of draft nonhuman genome assemblies and to the methodologic...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-1-r9

    authors: Lorente-Galdos B,Bleyhl J,Santpere G,Vives L,Ramírez O,Hernandez J,Anglada R,Cooper GM,Navarro A,Eichler EE,Marques-Bonet T

    更新日期:2013-01-29 00:00:00

  • Genome-wide mutagenesis of Zea mays L. using RescueMu transposons.

    abstract::Derived from the maize Mu1 transposon, RescueMu provides strategies for maize gene discovery and mutant phenotypic analysis. 9.92 Mb of gene-enriched sequences next to RescueMu insertion sites were co-assembled with expressed sequence tags and analyzed. Multiple plasmid recoveries identified probable germinal insertio...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-10-r82

    authors: Fernandes J,Dong Q,Schneider B,Morrow DJ,Nan GL,Brendel V,Walbot V

    更新日期:2004-01-01 00:00:00

  • Rapid draft sequencing and real-time nanopore sequencing in a hospital outbreak of Salmonella.

    abstract:BACKGROUND:Foodborne outbreaks of Salmonella remain a pressing public health concern. We recently detected a large outbreak of Salmonella enterica serovar Enteritidis phage type 14b affecting more than 30 patients in our hospital. This outbreak was linked to community, national and European-wide cases. Hospital patient...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0677-2

    authors: Quick J,Ashton P,Calus S,Chatt C,Gossain S,Hawker J,Nair S,Neal K,Nye K,Peters T,De Pinna E,Robinson E,Struthers K,Webber M,Catto A,Dallman TJ,Hawkey P,Loman NJ

    更新日期:2015-05-30 00:00:00

  • The first aurochs genome reveals the breeding history of British and European cattle.

    abstract::The first genome sequence of the extinct European wild aurochs reveals the genetic foundation of native British and Irish landraces of cattle.See related Research article: www.dx.doi.org/10.1186/s13059-015-0790-2. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-015-0793-z

    authors: Orlando L

    更新日期:2015-10-26 00:00:00

  • Localizing the proteome.

    abstract::The subcellular localization of the entire proteome of an organism, the yeast Saccharomyces cerevisiae, has been revealed for the first time. Comparison with less comprehensive studies of mammalian cells provides insights into the localization of the mammalian proteome. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2003-4-12-240

    authors: Simpson JC,Pepperkok R

    更新日期:2003-01-01 00:00:00

  • The real cost of sequencing: higher than you think!

    abstract::Advances in sequencing technology have led to a sharp decrease in the cost of 'data generation'. But is this sufficient to ensure cost-effective and efficient 'knowledge generation'? ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-8-125

    authors: Sboner A,Mu XJ,Greenbaum D,Auerbach RK,Gerstein MB

    更新日期:2011-08-25 00:00:00

  • Conserved rules govern genetic interaction degree across species.

    abstract:BACKGROUND:Synthetic genetic interactions have recently been mapped on a genome scale in the budding yeast Saccharomyces cerevisiae, providing a functional view of the central processes of eukaryotic life. Currently, comprehensive genetic interaction networks have not been determined for other species, and we therefore...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-7-r57

    authors: Koch EN,Costanzo M,Bellay J,Deshpande R,Chatfield-Reed K,Chua G,D'Urso G,Andrews BJ,Boone C,Myers CL

    更新日期:2012-07-02 00:00:00

  • Transcriptional profiling of long non-coding RNAs and novel transcribed regions across a diverse panel of archived human cancers.

    abstract:BACKGROUND:Molecular characterization of tumors has been critical for identifying important genes in cancer biology and for improving tumor classification and diagnosis. Long non-coding RNAs, as a new, relatively unstudied class of transcripts, provide a rich opportunity to identify both functional drivers and cancer-t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-8-r75

    authors: Brunner AL,Beck AH,Edris B,Sweeney RT,Zhu SX,Li R,Montgomery K,Varma S,Gilks T,Guo X,Foley JW,Witten DM,Giacomini CP,Flynn RA,Pollack JR,Tibshirani R,Chang HY,van de Rijn M,West RB

    更新日期:2012-08-28 00:00:00

  • The need for speed.

    abstract::DNA sequence data are being produced at an ever-increasing rate. The Bowtie sequence-alignment algorithm uses advanced data structures to help data analysis keep pace with data generation. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/gb-2009-10-3-212

    authors: Flicek P

    更新日期:2009-01-01 00:00:00

  • Direct measurement of transcription rates reveals multiple mechanisms for configuration of the Arabidopsis ambient temperature response.

    abstract:BACKGROUND:Sensing and responding to ambient temperature is important for controlling growth and development of many organisms, in part by regulating mRNA levels. mRNA abundance can change with temperature, but it is unclear whether this results from changes in transcription or decay rates, and whether passive or activ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2014-15-3-r45

    authors: Sidaway-Lee K,Costa MJ,Rand DA,Finkenstadt B,Penfield S

    更新日期:2014-03-03 00:00:00

  • What does biologically meaningful mean? A perspective on gene regulatory network validation.

    abstract::Gene regulatory networks (GRNs) are rapidly being delineated, but their quality and biological meaning are often questioned. Here, I argue that biological meaning is challenging to define and discuss reasons why GRN validation should be interpreted cautiously. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-4-109

    authors: Walhout AJ

    更新日期:2011-01-01 00:00:00

  • An ontology for cell types.

    abstract::We describe an ontology for cell types that covers the prokaryotic, fungal, animal and plant worlds. It includes over 680 cell types. These cell types are classified under several generic categories and are organized as a directed acyclic graph. The ontology is available in the formats adopted by the Open Biological O...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-2-r21

    authors: Bard J,Rhee SY,Ashburner M

    更新日期:2005-01-01 00:00:00

  • A human functional protein interaction network and its application to cancer data analysis.

    abstract:BACKGROUND:One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. RESULTS:We have construct...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-5-r53

    authors: Wu G,Feng X,Stein L

    更新日期:2010-01-01 00:00:00

  • Decoding dosage compensation.

    abstract::How the mechanisms of dosage compensation distinguish the sex chromosomes from the autosomes has been something of a mystery. A recent study in Caenorhabditis elegans has identified clusters of two common DNA motifs as a cis-acting code for the recruitment of the DCC, the protein complex that mediates dosage compensat...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2007-8-2-204

    authors: Deng X,Disteche CM

    更新日期:2007-01-01 00:00:00

  • Genomic studies of mood disorders -- the brain as a muscle?

    abstract::Recent genomic studies showing abnormalities in the fibroblast growth factor system in the postmortem brains of people with major depressive disorder support previous indications of a role for growth factors in mood disorders. Similar molecular pathways, volumetric changes, and the effects of exercise on mood suggest ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2005-6-4-215

    authors: Niculescu AB

    更新日期:2005-01-01 00:00:00

  • Comparative and functional genomics reveals genetic diversity and determinants of host specificity among reference strains and a large collection of Chinese isolates of the phytopathogen Xanthomonas campestris pv. campestris.

    abstract:BACKGROUND:Xanthomonas campestris pathovar campestris (Xcc) is the causal agent of black rot disease of crucifers worldwide. The molecular genetic diversity and host specificity of Xcc are poorly understood. RESULTS:We constructed a microarray based on the complete genome sequence of Xcc strain 8004 and investigated t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-10-r218

    authors: He YQ,Zhang L,Jiang BL,Zhang ZC,Xu RQ,Tang DJ,Qin J,Jiang W,Zhang X,Liao J,Cao JR,Zhang SS,Wei ML,Liang XX,Lu GT,Feng JX,Chen B,Cheng J,Tang JL

    更新日期:2007-01-01 00:00:00

  • Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene.

    abstract:BACKGROUND:RNA sequencing has opened new avenues for the study of transcriptome composition. Significant evidence has accumulated showing that the human transcriptome contains in excess of a hundred thousand different transcripts. However, it is still not clear to what extent this diversity prevails when considering th...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-7-r70

    authors: Gonzàlez-Porta M,Frankish A,Rung J,Harrow J,Brazma A

    更新日期:2013-07-01 00:00:00

  • The real cost of sequencing: scaling computation to keep pace with data generation.

    abstract::As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0917-0

    authors: Muir P,Li S,Lou S,Wang D,Spakowicz DJ,Salichos L,Zhang J,Weinstock GM,Isaacs F,Rozowsky J,Gerstein M

    更新日期:2016-03-23 00:00:00

  • The ribosomal protein genes and Minute loci of Drosophila melanogaster.

    abstract:BACKGROUND:Mutations in genes encoding ribosomal proteins (RPs) have been shown to cause an array of cellular and developmental defects in a variety of organisms. In Drosophila melanogaster, disruption of RP genes can result in the 'Minute' syndrome of dominant, haploinsufficient phenotypes, which include prolonged dev...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-10-r216

    authors: Marygold SJ,Roote J,Reuter G,Lambertsson A,Ashburner M,Millburn GH,Harrison PM,Yu Z,Kenmochi N,Kaufman TC,Leevers SJ,Cook KR

    更新日期:2007-01-01 00:00:00

  • Copy number variation goes clinical.

    abstract::A report of the First Golden Helix Symposium 'Copy Number Variation (CNV) and Genomic Alterations in Health and Disease', Athens, Greece, 28-29 November 2008. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2009-10-1-301

    authors: Le Caignec C,Redon R

    更新日期:2009-01-01 00:00:00