Use of designed sequences in protein structure recognition.

Abstract:

BACKGROUND:Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is provided for part or full-length proteins of 7726 families. For the remaining 52% of the families, information on 3-D structure is not yet available. We use the computationally designed sequences that are intermediately related to two protein domain families, which are already known to share the same fold. These strategically designed sequences enable detection of distant relationships and here, we have employed them for the purpose of structure recognition of protein families of yet unknown structure. RESULTS:We first measured the success rate of our approach using a dataset of protein families of known fold and achieved a success rate of 88%. Next, for 1392 families of yet unknown structure, we made structural assignments for part/full length of the proteins. Fold association for 423 domains of unknown function (DUFs) are provided as a step towards functional annotation. CONCLUSION:The results indicate that knowledge-based filling of gaps in protein sequence space is a lucrative approach for structure recognition. Such sequences assist in traversal through protein sequence space and effectively function as 'linkers', where natural linkers between distant proteins are unavailable. REVIEWERS:This article was reviewed by Oliviero Carugo, Christine Orengo and Srikrishna Subramanian.

journal_name

Biol Direct

journal_title

Biology direct

authors

Kumar G,Mudgal R,Srinivasan N,Sandhya S

doi

10.1186/s13062-018-0209-6

subject

Has Abstract

pub_date

2018-05-09 00:00:00

pages

8

issue

1

issn

1745-6150

pii

10.1186/s13062-018-0209-6

journal_volume

13

pub_type

杂志文章
  • Proteomic changes associated with deletion of the Magnaporthe oryzae conidial morphology-regulating gene COM1.

    abstract:BACKGROUND:The rice blast disease caused by Magnaporthe oryzae is a major constraint on world rice production. The conidia produced by this fungal pathogen are the main source of disease dissemination. The morphology of conidia may be a critical factor in the spore dispersal and virulence of M. oryzae in the field. Del...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-5-61

    authors: Bhadauria V,Wang LX,Peng YL

    更新日期:2010-11-02 00:00:00

  • Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes.

    abstract:BACKGROUND:The prokaryotic toxin-antitoxin systems (TAS, also referred to as TA loci) are widespread, mobile two-gene modules that can be viewed as selfish genetic elements because they evolved mechanisms to become addictive for replicons and cells in which they reside, but also possess "normal" cellular functions in v...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-4-19

    authors: Makarova KS,Wolf YI,Koonin EV

    更新日期:2009-06-03 00:00:00

  • MutL homologs in restriction-modification systems and the origin of eukaryotic MORC ATPases.

    abstract::The provenance and biochemical roles of eukaryotic MORC proteins have remained poorly understood since the discovery of their prototype MORC1, which is required for meiotic nuclear division in animals. The MORC family contains a combination of a gyrase, histidine kinase, and MutL (GHKL) and S5 domains that together co...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-3-8

    authors: Iyer LM,Abhiman S,Aravind L

    更新日期:2008-03-17 00:00:00

  • Optimal treatment and stochastic modeling of heterogeneous tumors.

    abstract:UNLABELLED:In this work we review past articles that have mathematically studied cancer heterogeneity and the impact of this heterogeneity on the structure of optimal therapy. We look at past works on modeling how heterogeneous tumors respond to radiotherapy, and take a particularly close look at how the optimal radiot...

    journal_title:Biology direct

    pub_type: 杂志文章,评审

    doi:10.1186/s13062-016-0142-5

    authors: Badri H,Leder K

    更新日期:2016-08-23 00:00:00

  • Description of plant tRNA-derived RNA fragments (tRFs) associated with argonaute and identification of their putative targets.

    abstract::tRNA-derived RNA fragments (tRFs) are 19mer small RNAs that associate with Argonaute (AGO) proteins in humans. However, in plants, it is unknown if tRFs bind with AGO proteins. Here, using public deep sequencing libraries of immunoprecipitated Argonaute proteins (AGO-IP) and bioinformatics approaches, we identified th...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-8-6

    authors: Loss-Morais G,Waterhouse PM,Margis R

    更新日期:2013-02-12 00:00:00

  • Episodic, transient systemic acidosis delays evolution of the malignant phenotype: Possible mechanism for cancer prevention by increased physical activity.

    abstract:BACKGROUND:The transition from premalignant to invasive tumour growth is a prolonged multistep process governed by phenotypic adaptation to changing microenvironmental selection pressures. Cancer prevention strategies are required to interrupt or delay somatic evolution of the malignant invasive phenotype. Empirical st...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-5-22

    authors: Smallbone K,Maini PK,Gatenby RA

    更新日期:2010-04-20 00:00:00

  • Diverse bacterial genomes encode an operon of two genes, one of which is an unusual class-I release factor that potentially recognizes atypical mRNA signals other than normal stop codons.

    abstract:BACKGROUND:While all codons that specify amino acids are universally recognized by tRNA molecules, codons signaling termination of translation are recognized by proteins known as class-I release factors (RF). In most eukaryotes and archaea a single RF accomplishes termination at all three stop codons. In most bacteria,...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-1-28

    authors: Baranov PV,Vestergaard B,Hamelryck T,Gesteland RF,Nyborg J,Atkins JF

    更新日期:2006-09-13 00:00:00

  • Why eukaryotic cells use introns to enhance gene expression: splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity.

    abstract:BACKGROUND:The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-24

    authors: Niu DK,Yang YF

    更新日期:2011-05-18 00:00:00

  • A network-based approach to classify the three domains of life.

    abstract:BACKGROUND:Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-53

    authors: Mueller LA,Kugler KG,Netzer M,Graber A,Dehmer M

    更新日期:2011-10-13 00:00:00

  • From tumors to species: a SCANDAL hypothesis.

    abstract::ᅟ: Some tumor cells can evolve into transmissible parasites. Notable examples include the Tasmanian devil facial tumor disease, the canine transmissible venereal tumor and transmissible cancers of mollusks. We present a hypothesis that such transmissible tumors existed in the past and that some modern animal taxa are ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-019-0233-1

    authors: Panchin AY,Aleoshin VV,Panchin YV

    更新日期:2019-01-23 00:00:00

  • Prediction and mechanistic analysis of drug-induced liver injury (DILI) based on chemical structure.

    abstract:BACKGROUND:Drug-induced liver injury (DILI) is a major safety concern characterized by a complex and diverse pathogenesis. In order to identify DILI early in drug development, a better understanding of the injury and models with better predictivity are urgently needed. One approach in this regard are in silico models w...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-020-00285-0

    authors: Liu A,Walter M,Wright P,Bartosik A,Dolciami D,Elbasir A,Yang H,Bender A

    更新日期:2021-01-18 00:00:00

  • Trees and networks before and after Darwin.

    abstract::It is well-known that Charles Darwin sketched abstract trees of relationship in his 1837 notebook, and depicted a tree in the Origin of Species (1859). Here I attempt to place Darwin's trees in historical context. By the mid-Eighteenth century the Great Chain of Being was increasingly seen to be an inadequate descript...

    journal_title:Biology direct

    pub_type: 历史文章,杂志文章,评审

    doi:10.1186/1745-6150-4-43

    authors: Ragan MA

    更新日期:2009-11-16 00:00:00

  • Is pre-Darwinian evolution plausible?

    abstract:BACKGROUND:This essay highlights critical aspects of the plausibility of pre-Darwinian evolution. It is based on a critical review of some better-known open, far-from-equilibrium system-based scenarios supposed to explain processes that took place before Darwinian evolution had emerged and that resulted in the origin o...

    journal_title:Biology direct

    pub_type: 杂志文章,评审

    doi:10.1186/s13062-018-0216-7

    authors: Tessera M

    更新日期:2018-09-21 00:00:00

  • The multiple personalities of Watson and Crick strands.

    abstract:BACKGROUND:In genetics it is customary to refer to double-stranded DNA as containing a "Watson strand" and a "Crick strand." However, there seems to be no consensus in the literature on the exact meaning of these two terms, and the many usages contradict one another as well as the original definition. Here, we review t...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-7

    authors: Cartwright RA,Graur D

    更新日期:2011-02-08 00:00:00

  • Origin of the nuclear proteome on the basis of pre-existing nuclear localization signals in prokaryotic proteins.

    abstract:BACKGROUND:The origin of the selective nuclear protein import machinery, which consists of nuclear pore complexes and adaptor molecules interacting with the nuclear localization signals (NLSs) of cargo molecules, is one of the most important events in the evolution of eukaryotic cells. How proteins were selected for im...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-020-00263-6

    authors: Lisitsyna OM,Kurnaeva MA,Arifulin EA,Shubina MY,Musinova YR,Mironov AA,Sheval EV

    更新日期:2020-04-28 00:00:00

  • LINEs of evidence: noncanonical DNA replication as an epigenetic determinant.

    abstract::LINE-1 (L1) retrotransposons are repetitive elements in mammalian genomes. They are capable of synthesizing DNA on their own RNA templates by harnessing reverse transcriptase (RT) that they encode. Abundantly expressed full-length L1s and their RT are found to globally influence gene expression profiles, differentiati...

    journal_title:Biology direct

    pub_type: 杂志文章,评审

    doi:10.1186/1745-6150-8-22

    authors: Belan E

    更新日期:2013-09-13 00:00:00

  • Domain enhanced lookup time accelerated BLAST.

    abstract:BACKGROUND:BLAST is a commonly-used software package for comparing a query sequence to a database of known sequences; in this study, we focus on protein sequences. Position-specific-iterated BLAST (PSI-BLAST) iteratively searches a protein sequence database, using the matches in round i to construct a position-specific...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-7-12

    authors: Boratyn GM,Schäffer AA,Agarwala R,Altschul SF,Lipman DJ,Madden TL

    更新日期:2012-04-17 00:00:00

  • Interplay of recombination and selection in the genomes of Chlamydia trachomatis.

    abstract:BACKGROUND:Chlamydia trachomatis is an obligate intracellular bacterial parasite, which causes several severe and debilitating diseases in humans. This study uses comparative genomic analyses of 12 complete published C. trachomatis genomes to assess the contribution of recombination and selection in this pathogen and t...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-28

    authors: Joseph SJ,Didelot X,Gandhi K,Dean D,Read TD

    更新日期:2011-05-26 00:00:00

  • Rotational restriction of nascent peptides as an essential element of co-translational protein folding: possible molecular players and structural consequences.

    abstract:BACKGROUND:A basic tenet of protein science is that all information about the spatial structure of proteins is present in their sequences. Nonetheless, many proteins fail to attain native structure upon experimental denaturation and refolding in vitro, raising the question of the specific role of cellular machinery in ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-017-0186-1

    authors: Sorokina I,Mushegian A

    更新日期:2017-05-31 00:00:00

  • Biased gene transfer and its implications for the concept of lineage.

    abstract:BACKGROUND:In the presence of horizontal gene transfer (HGT), the concepts of lineage and genealogy in the microbial world become more ambiguous because chimeric genomes trace their ancestry from a myriad of sources, both living and extinct. RESULTS:We present the evolutionary histories of three aminoacyl-tRNA synthet...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-47

    authors: Andam CP,Gogarten JP

    更新日期:2011-09-23 00:00:00

  • Plant viruses of the Amalgaviridae family evolved via recombination between viruses with double-stranded and negative-strand RNA genomes.

    abstract::Plant viruses of the recently recognized family Amalgaviridae have monopartite double-stranded (ds) RNA genomes and encode two proteins: an RNA-dependent RNA polymerase (RdRp) and a putative capsid protein (CP). Whereas the RdRp of amalgaviruses has been found to be most closely related to the RdRps of dsRNA viruses o...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0047-8

    authors: Krupovic M,Dolja VV,Koonin EV

    更新日期:2015-03-29 00:00:00

  • Human gammadelta T cell recognition of lipid A is predominately presented by CD1b or CD1c on dendritic cells.

    abstract:BACKGROUND:The gammadelta T cells serve as early immune defense against certain encountered microbes. Only a few gammadelta T cell-recognized ligands from microbial antigens have been identified so far and the mechanisms by which gammadelta T cells recognize these ligands remain unknown. Here we explored the mechanism ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-4-47

    authors: Cui Y,Kang L,Cui L,He W

    更新日期:2009-12-01 00:00:00

  • Putative adaptive inter-slope divergence of transposon frequency in fruit flies (Drosophila melanogaster) at "Evolution Canyon", Mount Carmel, Israel.

    abstract:BACKGROUND:The current analysis of transposon elements (TE) in Drosophila melanogaster at Evolution Canyon, (EC), Israel, is based on data and analysis done by our collaborators (Drs. J. Gonzalez, J. Martinez and W. Makalowski, this issue). They estimated the frequencies of 28 TEs (transposon elements) in fruit flies (...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0074-5

    authors: Beiles A,Raz S,Ben-Abu Y,Nevo E

    更新日期:2015-10-14 00:00:00

  • A novel superfamily containing the beta-grasp fold involved in binding diverse soluble ligands.

    abstract:BACKGROUND:Domains containing the beta-grasp fold are utilized in a great diversity of physiological functions but their role, if any, in soluble or small molecule ligand recognition is poorly studied. RESULTS:Using sensitive sequence and structure similarity searches we identify a novel superfamily containing the bet...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-2-4

    authors: Burroughs AM,Balaji S,Iyer LM,Aravind L

    更新日期:2007-01-24 00:00:00

  • PEPstrMOD: structure prediction of peptides containing natural, non-natural and modified residues.

    abstract:BACKGROUND:In the past, many methods have been developed for peptide tertiary structure prediction but they are limited to peptides having natural amino acids. This study describes a method PEPstrMOD, which is an updated version of PEPstr, developed specifically for predicting the structure of peptides containing natur...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0103-4

    authors: Singh S,Singh H,Tuknait A,Chaudhary K,Singh B,Kumaran S,Raghava GP

    更新日期:2015-12-21 00:00:00

  • Global analyses of Chromosome 17 and 18 genes of lung telocytes compared with mesenchymal stem cells, fibroblasts, alveolar type II cells, airway epithelial cells, and lymphocytes.

    abstract:BACKGROUND:Telocytes (TCs) is an interstitial cell with extremely long and thin telopodes (Tps) with thin segments (podomers) and dilations (podoms) to interact with neighboring cells. TCs have been found in different organs, while there is still a lack of TCs-specific biomarkers to distinguish TCs from the other cells...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0042-0

    authors: Wang J,Ye L,Jin M,Wang X

    更新日期:2015-03-11 00:00:00

  • Evolution before genes.

    abstract:BACKGROUND:Our current understanding of evolution is so tightly linked to template-dependent replication of DNA and RNA molecules that the old idea from Oparin of a self-reproducing 'garbage bag' ('coacervate') of chemicals that predated fully-fledged cell-like entities seems to be farfetched to most scientists today. ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-7-1

    authors: Vasas V,Fernando C,Santos M,Kauffman S,Szathmáry E

    更新日期:2012-01-05 00:00:00

  • Insights into archaeal evolution and symbiosis from the genomes of a nanoarchaeon and its inferred crenarchaeal host from Obsidian Pool, Yellowstone National Park.

    abstract:BACKGROUND:A single cultured marine organism, Nanoarchaeum equitans, represents the Nanoarchaeota branch of symbiotic Archaea, with a highly reduced genome and unusual features such as multiple split genes. RESULTS:The first terrestrial hyperthermophilic member of the Nanoarchaeota was collected from Obsidian Pool, a ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-8-9

    authors: Podar M,Makarova KS,Graham DE,Wolf YI,Koonin EV,Reysenbach AL

    更新日期:2013-04-22 00:00:00

  • Comparative genomic analysis of the DUF71/COG2102 family predicts roles in diphthamide biosynthesis and B12 salvage.

    abstract:BACKGROUND:The availability of over 3000 published genome sequences has enabled the use of comparative genomic approaches to drive the biological function discovery process. Classically, one used to link gene with function by genetic or biochemical approaches, a lengthy process that often took years. Phylogenetic distr...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-7-32

    authors: de Crécy-Lagard V,Forouhar F,Brochier-Armanet C,Tong L,Hunt JF

    更新日期:2012-09-26 00:00:00

  • Pseudo-chaotic oscillations in CRISPR-virus coevolution predicted by bifurcation analysis.

    abstract:BACKGROUND:The CRISPR-Cas systems of adaptive antivirus immunity are present in most archaea and many bacteria, and provide resistance to specific viruses or plasmids by inserting fragments of foreign DNA into the host genome and then utilizing transcripts of these spacers to inactivate the cognate foreign genome. The ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-9-13

    authors: Berezovskaya FS,Wolf YI,Koonin EV,Karev GP

    更新日期:2014-07-02 00:00:00