Abstract:
BACKGROUND:We describe the distribution of indels in the 44 Encyclopedia of DNA Elements (ENCODE) regions (about 1% of the human genome) and evaluate the potential contributions of small insertion and deletion polymorphisms (indels) to human genetic variation. We relate indels to known genomic annotation features and measures of evolutionary constraint. RESULTS:Indel rates are observed to be reduced approximately 20-fold to 60-fold in exonic regions, 5-fold to 10-fold in sequence that exhibits high evolutionary constraint in mammals, and up to 2-fold in some classes of regulatory elements (for instance, formaldehyde assisted isolation of regulatory elements [FAIRE] and hypersensitive sites). In addition, some noncoding transcription and other chromatin mediated regulatory sites also have reduced indel rates. Overall indel rates for these data are estimated to be smaller than single nucleotide polymorphism (SNP) rates by a factor of approximately 2, with both rates measured as base pairs per 100 kilobases to facilitate comparison. CONCLUSION:Indel rates exhibit a broadly similar distribution across genomic features compared with SNP density rates, with a reduction in rates in coding transcription and evolutionarily constrained sequence. However, unlike indels, SNP rates do not appear to be reduced in some noncoding functional sequences, such as pseudo-exons, and FAIRE and hypersensitive sites. We conclude that indel rates are greatly reduced in transcribed and evolutionarily constrained DNA, and discuss why indel (but not SNP) rates appear to be constrained at some regulatory sites.
journal_name
Genome Bioljournal_title
Genome biologyauthors
Clark TG,Andrew T,Cooper GM,Margulies EH,Mullikin JC,Balding DJdoi
10.1186/gb-2007-8-9-r180subject
Has Abstractpub_date
2007-01-01 00:00:00pages
R180issue
9eissn
1474-7596issn
1474-760Xpii
gb-2007-8-9-r180journal_volume
8pub_type
杂志文章相关文献
GENOME BIOLOGY文献大全abstract::Single-cell RNA-sequencing (scRNA-seq) has emerged as a revolutionary tool that allows us to address scientific questions that eluded examination just a few years ago. With the advantages of scRNA-seq come computational challenges that are just beginning to be addressed. In this article, we highlight the computational...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/s13059-016-0927-y
更新日期:2016-04-07 00:00:00
abstract::Field pathogenomics adds highly informative data to surveillance surveys by enabling rapid evaluation of pathogen variability, population structure and host genotype. ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-015-0615-3
更新日期:2015-03-02 00:00:00
abstract::The epigenetic modifications of histones are versatile marks that are intimately connected to development and disease pathogenesis including human cancers. In this review, we will discuss the many different types of histone modifications and the biological processes with which they are involved. Specifically, we revie...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/s13059-019-1870-5
更新日期:2019-11-20 00:00:00
abstract:BACKGROUND:DNA replication plays an important role in mutagenesis, yet little is known about how it interacts with other mutagenic processes. Here, we use somatic mutation signatures-each representing a mutagenic process-derived from 3056 patients spanning 19 cancer types to quantify the strand asymmetry of mutational ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-018-1509-y
更新日期:2018-09-10 00:00:00
abstract:BACKGROUND:Tumors comprise a complex microenvironment of interacting malignant and stromal cell types. Much of our understanding of the tumor microenvironment comes from in vitro studies isolating the interactions between malignant cells and a single stromal cell type, often along a single pathway. RESULT:To develop a...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-020-02019-x
更新日期:2020-05-07 00:00:00
abstract::Rhomboid proteases are the largest family of enzymes that hydrolyze peptide bonds within the cell membrane. Although discovered to be serine proteases only a decade ago, rhomboid proteases are already considered to be the best understood intramembrane proteases. The presence of rhomboid proteins in all domains of life...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/gb-2011-12-10-231
更新日期:2011-10-27 00:00:00
abstract::The accurate placement of an ancient whole-genome duplication (WGD) in relation to the lineage divergence is important. Here, we re-investigated the Aquilegia coerulea WGD and found it is more likely lineage-specific rather than shared by all eudicots. ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-020-02212-y
更新日期:2020-12-08 00:00:00
abstract:BACKGROUND:Phylogenetic analysis of the Archaea has been mainly established by 16S rRNA sequence comparison. With the accumulation of completely sequenced genomes, it is now possible to test alternative approaches by using large sequence datasets. We analyzed archaeal phylogeny using two concatenated datasets consistin...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2004-5-3-r17
更新日期:2004-01-01 00:00:00
abstract::A report on the Plant Genomics European Meeting (Plant-GEMS2004), Lyon, France, 22-25 September 2004. ...
journal_title:Genome biology
pub_type:
doi:10.1186/gb-2004-6-1-302
更新日期:2005-01-01 00:00:00
abstract::The key molecules involved in regulating the assembly and function of the mitotic spindle are shared by evolutionarily divergent species. Studies in different model systems are leading to convergent conclusions about the central role of microtubule nucleation and dynamics and of kinesin-related motor proteins in spind...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/gb-2000-1-1-reviews101
更新日期:2000-01-01 00:00:00
abstract:BACKGROUND:The identification of signatures of natural selection has long been used as an approach to understanding the unique features of any given species. Genes within segmental duplications are overlooked in most studies of selection due to the limitations of draft nonhuman genome assemblies and to the methodologic...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2013-14-1-r9
更新日期:2013-01-29 00:00:00
abstract::Derived from the maize Mu1 transposon, RescueMu provides strategies for maize gene discovery and mutant phenotypic analysis. 9.92 Mb of gene-enriched sequences next to RescueMu insertion sites were co-assembled with expressed sequence tags and analyzed. Multiple plasmid recoveries identified probable germinal insertio...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2004-5-10-r82
更新日期:2004-01-01 00:00:00
abstract:BACKGROUND:Foodborne outbreaks of Salmonella remain a pressing public health concern. We recently detected a large outbreak of Salmonella enterica serovar Enteritidis phage type 14b affecting more than 30 patients in our hospital. This outbreak was linked to community, national and European-wide cases. Hospital patient...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-015-0677-2
更新日期:2015-05-30 00:00:00
abstract::The first genome sequence of the extinct European wild aurochs reveals the genetic foundation of native British and Irish landraces of cattle.See related Research article: www.dx.doi.org/10.1186/s13059-015-0790-2. ...
journal_title:Genome biology
pub_type: 评论,杂志文章
doi:10.1186/s13059-015-0793-z
更新日期:2015-10-26 00:00:00
abstract::The subcellular localization of the entire proteome of an organism, the yeast Saccharomyces cerevisiae, has been revealed for the first time. Comparison with less comprehensive studies of mammalian cells provides insights into the localization of the mammalian proteome. ...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/gb-2003-4-12-240
更新日期:2003-01-01 00:00:00
abstract::Advances in sequencing technology have led to a sharp decrease in the cost of 'data generation'. But is this sufficient to ensure cost-effective and efficient 'knowledge generation'? ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2011-12-8-125
更新日期:2011-08-25 00:00:00
abstract:BACKGROUND:Synthetic genetic interactions have recently been mapped on a genome scale in the budding yeast Saccharomyces cerevisiae, providing a functional view of the central processes of eukaryotic life. Currently, comprehensive genetic interaction networks have not been determined for other species, and we therefore...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2012-13-7-r57
更新日期:2012-07-02 00:00:00
abstract:BACKGROUND:Molecular characterization of tumors has been critical for identifying important genes in cancer biology and for improving tumor classification and diagnosis. Long non-coding RNAs, as a new, relatively unstudied class of transcripts, provide a rich opportunity to identify both functional drivers and cancer-t...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2012-13-8-r75
更新日期:2012-08-28 00:00:00
abstract::DNA sequence data are being produced at an ever-increasing rate. The Bowtie sequence-alignment algorithm uses advanced data structures to help data analysis keep pace with data generation. ...
journal_title:Genome biology
pub_type: 评论,杂志文章
doi:10.1186/gb-2009-10-3-212
更新日期:2009-01-01 00:00:00
abstract:BACKGROUND:Sensing and responding to ambient temperature is important for controlling growth and development of many organisms, in part by regulating mRNA levels. mRNA abundance can change with temperature, but it is unclear whether this results from changes in transcription or decay rates, and whether passive or activ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2014-15-3-r45
更新日期:2014-03-03 00:00:00
abstract::Gene regulatory networks (GRNs) are rapidly being delineated, but their quality and biological meaning are often questioned. Here, I argue that biological meaning is challenging to define and discuss reasons why GRN validation should be interpreted cautiously. ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2011-12-4-109
更新日期:2011-01-01 00:00:00
abstract::We describe an ontology for cell types that covers the prokaryotic, fungal, animal and plant worlds. It includes over 680 cell types. These cell types are classified under several generic categories and are organized as a directed acyclic graph. The ontology is available in the formats adopted by the Open Biological O...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2005-6-2-r21
更新日期:2005-01-01 00:00:00
abstract:BACKGROUND:One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system. RESULTS:We have construct...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2010-11-5-r53
更新日期:2010-01-01 00:00:00
abstract::How the mechanisms of dosage compensation distinguish the sex chromosomes from the autosomes has been something of a mystery. A recent study in Caenorhabditis elegans has identified clusters of two common DNA motifs as a cis-acting code for the recruitment of the DCC, the protein complex that mediates dosage compensat...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/gb-2007-8-2-204
更新日期:2007-01-01 00:00:00
abstract::Recent genomic studies showing abnormalities in the fibroblast growth factor system in the postmortem brains of people with major depressive disorder support previous indications of a role for growth factors in mood disorders. Similar molecular pathways, volumetric changes, and the effects of exercise on mood suggest ...
journal_title:Genome biology
pub_type: 杂志文章,评审
doi:10.1186/gb-2005-6-4-215
更新日期:2005-01-01 00:00:00
abstract:BACKGROUND:Xanthomonas campestris pathovar campestris (Xcc) is the causal agent of black rot disease of crucifers worldwide. The molecular genetic diversity and host specificity of Xcc are poorly understood. RESULTS:We constructed a microarray based on the complete genome sequence of Xcc strain 8004 and investigated t...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2007-8-10-r218
更新日期:2007-01-01 00:00:00
abstract:BACKGROUND:RNA sequencing has opened new avenues for the study of transcriptome composition. Significant evidence has accumulated showing that the human transcriptome contains in excess of a hundred thousand different transcripts. However, it is still not clear to what extent this diversity prevails when considering th...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2013-14-7-r70
更新日期:2013-07-01 00:00:00
abstract::As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward. ...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/s13059-016-0917-0
更新日期:2016-03-23 00:00:00
abstract:BACKGROUND:Mutations in genes encoding ribosomal proteins (RPs) have been shown to cause an array of cellular and developmental defects in a variety of organisms. In Drosophila melanogaster, disruption of RP genes can result in the 'Minute' syndrome of dominant, haploinsufficient phenotypes, which include prolonged dev...
journal_title:Genome biology
pub_type: 杂志文章
doi:10.1186/gb-2007-8-10-r216
更新日期:2007-01-01 00:00:00
abstract::A report of the First Golden Helix Symposium 'Copy Number Variation (CNV) and Genomic Alterations in Health and Disease', Athens, Greece, 28-29 November 2008. ...
journal_title:Genome biology
pub_type:
doi:10.1186/gb-2009-10-1-301
更新日期:2009-01-01 00:00:00