Domain atrophy creates rare cases of functional partial protein domains.

Abstract:

BACKGROUND:Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain atrophy, where protein domains lose a significant number of core structural elements. RESULTS:Here, we implement a new pipeline to systematically identify new cases of domain atrophy across all known protein sequences. The output of this pipeline was carefully checked by hand, which filtered out partial domain instances that were unlikely to represent true domain atrophy due to misannotations or un-annotated sequence fragments. We identify 75 cases of domain atrophy, of which eight cases are found in a three-dimensional protein structure and 67 cases have been inferred based on mapping to a known homologous structure. Domains with structural variations include ancient folds such as the TIM-barrel and Rossmann folds. Most of these domains are observed to show structural loss that does not affect their functional sites. CONCLUSION:Our analysis has significantly increased the known cases of domain atrophy. We discuss specific instances of domain atrophy and see that there has often been a compensatory mechanism that helps to maintain the stability of the partial domain. Our study indicates that although domain atrophy is an extremely rare phenomenon, protein domains under certain circumstances can tolerate extreme mutations giving rise to partial, but functional, domains.

journal_name

Genome Biol

journal_title

Genome biology

authors

Prakash A,Bateman A

doi

10.1186/s13059-015-0655-8

subject

Has Abstract

pub_date

2015-04-30 00:00:00

pages

88

eissn

1474-7596

issn

1474-760X

pii

10.1186/s13059-015-0655-8

journal_volume

16

pub_type

杂志文章
  • High tandem repeat content in the genome of the short-lived annual fish Nothobranchius furzeri: a new vertebrate model for aging research.

    abstract:BACKGROUND:The annual fish Nothobranchius furzeri is the vertebrate with the shortest known life span in captivity. Fish of the GRZ strain live only three to four months under optimal laboratory conditions, show explosive growth, early sexual maturation and age-dependent physiological and behavioral decline, and expres...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-2-r16

    authors: Reichwald K,Lauber C,Nanda I,Kirschner J,Hartmann N,Schories S,Gausmann U,Taudien S,Schilhabel MB,Szafranski K,Glöckner G,Schmid M,Cellerino A,Schartl M,Englert C,Platzer M

    更新日期:2009-02-11 00:00:00

  • The first whole genome and transcriptome of the cinereous vulture reveals adaptation in the gastric and immune defense systems and possible convergent evolution between the Old and New World vultures.

    abstract:BACKGROUND:The cinereous vulture, Aegypius monachus, is the largest bird of prey and plays a key role in the ecosystem by removing carcasses, thus preventing the spread of diseases. Its feeding habits force it to cope with constant exposure to pathogens, making this species an interesting target for discovering functio...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0780-4

    authors: Chung O,Jin S,Cho YS,Lim J,Kim H,Jho S,Kim HM,Jun J,Lee H,Chon A,Ko J,Edwards J,Weber JA,Han K,O'Brien SJ,Manica A,Bhak J,Paek WK

    更新日期:2015-10-21 00:00:00

  • Constitutive patterns of gene expression regulated by RNA-binding proteins.

    abstract:BACKGROUND:RNA-binding proteins regulate a number of cellular processes, including synthesis, folding, translocation, assembly and clearance of RNAs. Recent studies have reported that an unexpectedly large number of proteins are able to interact with RNA, but the partners of many RNA-binding proteins are still uncharac...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2014-15-1-r13

    authors: Cirillo D,Marchese D,Agostini F,Livi CM,Botta-Orfila T,Tartaglia GG

    更新日期:2014-01-02 00:00:00

  • Discovery of estrogen receptor alpha target genes and response elements in breast tumor cells.

    abstract:BACKGROUND:Estrogens and their receptors are important in human development, physiology and disease. In this study, we utilized an integrated genome-wide molecular and computational approach to characterize the interaction between the activated estrogen receptor (ER) and the regulatory elements of candidate target gene...

    journal_title:Genome biology

    pub_type: 杂志文章,meta分析

    doi:10.1186/gb-2004-5-9-r66

    authors: Lin CY,Ström A,Vega VB,Kong SL,Yeo AL,Thomsen JS,Chan WC,Doray B,Bangarusamy DK,Ramasamy A,Vergara LA,Tang S,Chong A,Bajic VB,Miller LD,Gustafsson JA,Liu ET

    更新日期:2004-01-01 00:00:00

  • Muscular expressions: profiling genes in complex tissues.

    abstract::Gene-expression profiling has yielded important information about simple systems, but complex tissues have not yet been widely profiled. Four recent studies of mammalian skeletal muscles have added to the catalogs of their gene expression differences, but have yet to lead to better understanding of the molecular proce...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-2-12-reviews1033

    authors: Hampson R,Hughes SM

    更新日期:2001-01-01 00:00:00

  • Responses of hyperthermophilic crenarchaea to UV irradiation.

    abstract:BACKGROUND:DNA damage leads to cellular responses that include the increased expression of DNA repair genes, repression of DNA replication and alterations in cellular metabolism. Archaeal information processing pathways resemble those in eukaryotes, but archaeal damage response pathways remain poorly understood. RESUL...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-10-r220

    authors: Götz D,Paytubi S,Munro S,Lundgren M,Bernander R,White MF

    更新日期:2007-01-01 00:00:00

  • The real cost of sequencing: scaling computation to keep pace with data generation.

    abstract::As the cost of sequencing continues to decrease and the amount of sequence data generated grows, new paradigms for data storage and analysis are increasingly important. The relative scaling behavior of these evolving technologies will impact genomics research moving forward. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0917-0

    authors: Muir P,Li S,Lou S,Wang D,Spakowicz DJ,Salichos L,Zhang J,Weinstock GM,Isaacs F,Rozowsky J,Gerstein M

    更新日期:2016-03-23 00:00:00

  • Serum-dependent transcriptional networks identify distinct functional roles for H-Ras and N-Ras during initial stages of the cell cycle.

    abstract:BACKGROUND:Using oligonucleotide microarrays, we compared transcriptional profiles corresponding to the initial cell cycle stages of mouse fibroblasts lacking the small GTPases H-Ras and/or N-Ras with those of matching, wild-type controls. RESULTS:Serum-starved wild-type and knockout ras fibroblasts had very similar t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-11-r123

    authors: Castellano E,Guerrero C,Núñez A,De Las Rivas J,Santos E

    更新日期:2009-01-01 00:00:00

  • Large-scale and high-confidence proteomic analysis of human seminal plasma.

    abstract:BACKGROUND:The development of mass spectrometric (MS) techniques now allows the investigation of very complex protein mixtures ranging from subcellular structures to tissues. Body fluids are also popular targets of proteomic analysis because of their potential for biomarker discovery. Seminal plasma has not yet receive...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-5-r40

    authors: Pilch B,Mann M

    更新日期:2006-01-01 00:00:00

  • Single-cell sequencing reveals karyotype heterogeneity in murine and human malignancies.

    abstract:BACKGROUND:Chromosome instability leads to aneuploidy, a state in which cells have abnormal numbers of chromosomes, and is found in two out of three cancers. In a chromosomal instable p53 deficient mouse model with accelerated lymphomagenesis, we previously observed whole chromosome copy number changes affecting all ly...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-0971-7

    authors: Bakker B,Taudt A,Belderbos ME,Porubsky D,Spierings DC,de Jong TV,Halsema N,Kazemier HG,Hoekstra-Wakker K,Bradley A,de Bont ES,van den Berg A,Guryev V,Lansdorp PM,Colomé-Tatché M,Foijer F

    更新日期:2016-05-31 00:00:00

  • A Drosophila protein-interaction map centered on cell-cycle regulators.

    abstract:BACKGROUND:Maps depicting binary interactions between proteins can be powerful starting points for understanding biological systems. A proven technology for generating such maps is high-throughput yeast two-hybrid screening. In the most extensive screen to date, a Gal4-based two-hybrid system was used recently to detec...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2004-5-12-r96

    authors: Stanyon CA,Liu G,Mangiola BA,Patel N,Giot L,Kuang B,Zhang H,Zhong J,Finley RL Jr

    更新日期:2004-01-01 00:00:00

  • Networks for all.

    abstract::A report on the Cold Spring Harbor Laboratory/Wellcome Trust conference on Network Biology, Hinxton, UK, 27-31 August 2008. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2008-9-10-324

    authors: Ahnert SE,Teichmann SA

    更新日期:2008-10-27 00:00:00

  • CircAtlas: an integrated resource of one million highly accurate circular RNAs from 1070 vertebrate transcriptomes.

    abstract::Existing circular RNA (circRNA) databases have become essential for transcriptomics. However, most are unsuitable for mining in-depth information for candidate circRNA prioritization. To address this, we integrate circular transcript collections to develop the circAtlas database based on 1070 RNA-seq samples collected...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02018-y

    authors: Wu W,Ji P,Zhao F

    更新日期:2020-04-28 00:00:00

  • AlphaBeta: computational inference of epimutation rates and spectra from high-throughput DNA methylation data in plants.

    abstract::Stochastic changes in DNA methylation (i.e., spontaneous epimutations) contribute to methylome diversity in plants. Here, we describe AlphaBeta, a computational method for estimating the precise rate of such stochastic events using pedigree-based DNA methylation data as input. We demonstrate how AlphaBeta can be emplo...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02161-6

    authors: Shahryary Y,Symeonidi A,Hazarika RR,Denkena J,Mubeen T,Hofmeister B,van Gurp T,Colomé-Tatché M,Verhoeven KJF,Tuskan G,Schmitz RJ,Johannes F

    更新日期:2020-10-06 00:00:00

  • The functional spectrum of low-frequency coding variation.

    abstract:BACKGROUND:Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, b...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-9-r84

    authors: Marth GT,Yu F,Indap AR,Garimella K,Gravel S,Leong WF,Tyler-Smith C,Bainbridge M,Blackwell T,Zheng-Bradley X,Chen Y,Challis D,Clarke L,Ball EV,Cibulskis K,Cooper DN,Fulton B,Hartl C,Koboldt D,Muzny D,Smith R,Soug

    更新日期:2011-09-14 00:00:00

  • Detection and analysis of alternative splicing in Yarrowia lipolytica reveal structural constraints facilitating nonsense-mediated decay of intron-retaining transcripts.

    abstract:BACKGROUND:Hemiascomycetous yeasts have intron-poor genomes with very few cases of alternative splicing. Most of the reported examples result from intron retention in Saccharomyces cerevisiae and some have been shown to be functionally significant. Here we used transcriptome-wide approaches to evaluate the mechanisms u...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2010-11-6-r65

    authors: Mekouar M,Blanc-Lenfle I,Ozanne C,Da Silva C,Cruaud C,Wincker P,Gaillardin C,Neuvéglise C

    更新日期:2010-01-01 00:00:00

  • The rhomboid protease family: a decade of progress on function and mechanism.

    abstract::Rhomboid proteases are the largest family of enzymes that hydrolyze peptide bonds within the cell membrane. Although discovered to be serine proteases only a decade ago, rhomboid proteases are already considered to be the best understood intramembrane proteases. The presence of rhomboid proteins in all domains of life...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2011-12-10-231

    authors: Urban S,Dickey SW

    更新日期:2011-10-27 00:00:00

  • Functional variation in allelic methylomes underscores a strong genetic contribution and reveals novel epigenetic alterations in the human epigenome.

    abstract:BACKGROUND:The functional impact of genetic variation has been extensively surveyed, revealing that genetic changes correlated to phenotypes lie mostly in non-coding genomic regions. Studies have linked allele-specific genetic changes to gene expression, DNA methylation, and histone marks but these investigations have ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1173-7

    authors: Cheung WA,Shao X,Morin A,Siroux V,Kwan T,Ge B,Aïssi D,Chen L,Vasquez L,Allum F,Guénard F,Bouzigon E,Simon MM,Boulier E,Redensek A,Watt S,Datta A,Clarke L,Flicek P,Mead D,Paul DS,Beck S,Bourque G,Lathrop M,

    更新日期:2017-03-10 00:00:00

  • Studying alternative splicing regulatory networks through partial correlation analysis.

    abstract:BACKGROUND:Alternative pre-mRNA splicing is an important gene regulation mechanism for expanding proteomic diversity in higher eukaryotes. Each splicing regulator can potentially influence a large group of alternative exons. Meanwhile, each alternative exon is controlled by multiple splicing regulators. The rapid accum...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-1-r3

    authors: Chen L,Zheng S

    更新日期:2009-01-01 00:00:00

  • From biological clock to biological rhythms.

    abstract::The genetic and molecular analysis of circadian timekeeping mechanisms has accelerated as a result of the increasing volume of genomic markers and nucleotide sequence information. Completion of whole genome sequences and the use of differential gene expression technology will hasten the discovery of the clock output p...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-4-reviews1023

    authors: Hardin PE

    更新日期:2000-01-01 00:00:00

  • Characterization of the immunophenotypes and antigenomes of colorectal cancers reveals distinct tumor escape mechanisms and novel targets for immunotherapy.

    abstract:BACKGROUND:While large-scale cancer genomic projects are comprehensively characterizing the mutational spectrum of various cancers, so far little attention has been devoted to either define the antigenicity of these mutations or to characterize the immune responses they elicit. Here we present a strategy to characteriz...

    journal_title:Genome biology

    pub_type: 杂志文章,meta分析

    doi:10.1186/s13059-015-0620-6

    authors: Angelova M,Charoentong P,Hackl H,Fischer ML,Snajder R,Krogsdam AM,Waldner MJ,Bindea G,Mlecnik B,Galon J,Trajanoski Z

    更新日期:2015-03-31 00:00:00

  • Concept recognition for extracting protein interaction relations from biomedical text.

    abstract:BACKGROUND:Reliable information extraction applications have been a long sought goal of the biomedical text mining community, a goal that if reached would provide valuable tools to benchside biologists in their increasingly difficult task of assimilating the knowledge contained in the biomedical literature. We present ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-s2-s9

    authors: Baumgartner WA Jr,Lu Z,Johnson HL,Caporaso JG,Paquette J,Lindemann A,White EK,Medvedeva O,Cohen KB,Hunter L

    更新日期:2008-01-01 00:00:00

  • SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes.

    abstract::Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a si...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1284-1

    authors: Davidson NM,Hawkins ADK,Oshlack A

    更新日期:2017-08-04 00:00:00

  • A novel mode of chromosomal evolution peculiar to filamentous Ascomycete fungi.

    abstract:BACKGROUND:Gene loss, inversions, translocations, and other chromosomal rearrangements vary among species, resulting in different rates of structural genome evolution. Major chromosomal rearrangements are rare in most eukaryotes, giving large regions with the same genes in the same order and orientation across species....

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-5-r45

    authors: Hane JK,Rouxel T,Howlett BJ,Kema GH,Goodwin SB,Oliver RP

    更新日期:2011-01-01 00:00:00

  • Going beyond genetics to discover cancer targets.

    abstract::Two recent studies demonstrate the power of integrating tumor genotype information with epigenetic and proteomic studies to discover potential therapeutic targets in breast cancer. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1238-7

    authors: Sandoval GJ,Hahn WC

    更新日期:2017-05-22 00:00:00

  • Demystifying "drop-outs" in single-cell UMI data.

    abstract::Many existing pipelines for scRNA-seq data apply pre-processing steps such as normalization or imputation to account for excessive zeros or "drop-outs." Here, we extensively analyze diverse UMI data sets to show that clustering should be the foremost step of the workflow. We observe that most drop-outs disappear once ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02096-y

    authors: Kim TH,Zhou X,Chen M

    更新日期:2020-08-06 00:00:00

  • Substantial contribution of genetic variation in the expression of transcription factors to phenotypic variation revealed by eRD-GWAS.

    abstract:BACKGROUND:There are significant limitations in existing methods for the genome-wide identification of genes whose expression patterns affect traits. RESULTS:The transcriptomes of five tissues from 27 genetically diverse maize inbred lines were deeply sequenced to identify genes exhibiting high and low levels of expre...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1328-6

    authors: Lin HY,Liu Q,Li X,Yang J,Liu S,Huang Y,Scanlon MJ,Nettleton D,Schnable PS

    更新日期:2017-10-17 00:00:00

  • Crucial timing in schizophrenia: role of DNA methylation in early neurodevelopment.

    abstract::An exciting recent study examining the methylation profile of human brain tissue implicates early-life epigenetic disruption in the neurodevelopmental origin of schizophrenia. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-014-0495-y

    authors: Ryan J,Saffery R

    更新日期:2014-10-29 00:00:00

  • Identifying related L1 retrotransposons by analyzing 3' transduced sequences.

    abstract:BACKGROUND:A large fraction of the human genome is attributable to L1 retrotransposon sequences. Not only do L1s themselves make up a significant portion of the genome, but L1-encoded proteins are thought to be responsible for the transposition of other repetitive elements and processed pseudogenes. In addition, L1s ca...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2003-4-5-r30

    authors: Szak ST,Pickeral OK,Landsman D,Boeke JD

    更新日期:2003-01-01 00:00:00

  • Assessing taxonomic metagenome profilers with OPAL.

    abstract::The explosive growth in taxonomic metagenome profiling methods over the past years has created a need for systematic comparisons using relevant performance criteria. The Open-community Profiling Assessment tooL (OPAL) implements commonly used performance metrics, including those of the first challenge of the initiativ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1646-y

    authors: Meyer F,Bremges A,Belmann P,Janssen S,McHardy AC,Koslicki D

    更新日期:2019-03-04 00:00:00