Abstract:
BACKGROUND:An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. RESULTS:New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. CONCLUSION:The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
journal_name
Biol Directjournal_title
Biology directauthors
Makarova KS,Sorokin AV,Novichkov PS,Wolf YI,Koonin EVdoi
10.1186/1745-6150-2-33subject
Has Abstractpub_date
2007-11-27 00:00:00pages
33issn
1745-6150pii
1745-6150-2-33journal_volume
2pub_type
杂志文章相关文献
Biology Direct文献大全abstract:BACKGROUND:H. sapiens-M. tuberculosis H37Rv protein-protein interaction (PPI) data are essential for understanding the infection mechanism of the formidable pathogen M. tuberculosis H37Rv. Computational prediction is an important strategy to fill the gap in experimental H. sapiens-M. tuberculosis H37Rv PPI data. Homolo...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-9-5
更新日期:2014-04-08 00:00:00
abstract::In this article, I review the results of studies on the origin of life distinct from the popular RNA world hypothesis. The alternate scenario postulates the origin of the first bimolecular genetic system (a polynucleotide gene and a polypeptide processive polymerase) with simultaneous replication and translation and i...
journal_title:Biology direct
pub_type: 杂志文章,评审
doi:10.1186/s13062-015-0096-z
更新日期:2015-11-26 00:00:00
abstract:BACKGROUND:In genetics it is customary to refer to double-stranded DNA as containing a "Watson strand" and a "Crick strand." However, there seems to be no consensus in the literature on the exact meaning of these two terms, and the many usages contradict one another as well as the original definition. Here, we review t...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-7
更新日期:2011-02-08 00:00:00
abstract:BACKGROUND:The transition from premalignant to invasive tumour growth is a prolonged multistep process governed by phenotypic adaptation to changing microenvironmental selection pressures. Cancer prevention strategies are required to interrupt or delay somatic evolution of the malignant invasive phenotype. Empirical st...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-5-22
更新日期:2010-04-20 00:00:00
abstract:BACKGROUND:The gammadelta T cells serve as early immune defense against certain encountered microbes. Only a few gammadelta T cell-recognized ligands from microbial antigens have been identified so far and the mechanisms by which gammadelta T cells recognize these ligands remain unknown. Here we explored the mechanism ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-47
更新日期:2009-12-01 00:00:00
abstract:BACKGROUND:The elucidation of the dominant role of horizontal gene transfer (HGT) in the evolution of prokaryotes led to a severe crisis of the Tree of Life (TOL) concept and intense debates on this subject. CONCEPT:Prompted by the crisis of the TOL, we attempt to define the primary units and the fundamental patterns ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-33
更新日期:2009-09-29 00:00:00
abstract:BACKGROUND:A dramatic increase in the prevalence of autism and Autistic Spectrum Disorders (ASD) has been observed over the last two decades in USA, Europe and Asia. Given the accumulating data on the possible role of translation in the etiology of ASD, we analyzed potential effects of rare synonymous substitutions ass...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-9-16
更新日期:2014-07-10 00:00:00
abstract:BACKGROUND:Phagocytosis, that is, engulfment of large particles by eukaryotic cells, is found in diverse organisms and is often thought to be central to the very origin of the eukaryotic cell, in particular, for the acquisition of bacterial endosymbionts including the ancestor of the mitochondrion. RESULTS:Comparisons...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-9
更新日期:2009-02-26 00:00:00
abstract::tRNA-derived RNA fragments (tRFs) are 19mer small RNAs that associate with Argonaute (AGO) proteins in humans. However, in plants, it is unknown if tRFs bind with AGO proteins. Here, using public deep sequencing libraries of immunoprecipitated Argonaute proteins (AGO-IP) and bioinformatics approaches, we identified th...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-8-6
更新日期:2013-02-12 00:00:00
abstract::LINE-1 (L1) retrotransposons are repetitive elements in mammalian genomes. They are capable of synthesizing DNA on their own RNA templates by harnessing reverse transcriptase (RT) that they encode. Abundantly expressed full-length L1s and their RT are found to globally influence gene expression profiles, differentiati...
journal_title:Biology direct
pub_type: 杂志文章,评审
doi:10.1186/1745-6150-8-22
更新日期:2013-09-13 00:00:00
abstract:BACKGROUND:Glycogen synthase kinase-3 (GSK-3) is a ubiquitously expressed serine/threonine (Ser/Thr) kinase comprising two isoforms, GSK-3α and GSK-3β. Both enzymes are similarly inactivated by serine phosphorylation (GSK-3α at Ser21 and GSK-3β at Ser9) and activated by tyrosine phosphorylation (GSK-3α at Tyr279 and GS...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-4
更新日期:2011-01-24 00:00:00
abstract:BACKGROUND:In the past, many methods have been developed for peptide tertiary structure prediction but they are limited to peptides having natural amino acids. This study describes a method PEPstrMOD, which is an updated version of PEPstr, developed specifically for predicting the structure of peptides containing natur...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0103-4
更新日期:2015-12-21 00:00:00
abstract:BACKGROUND:Our current understanding of evolution is so tightly linked to template-dependent replication of DNA and RNA molecules that the old idea from Oparin of a self-reproducing 'garbage bag' ('coacervate') of chemicals that predated fully-fledged cell-like entities seems to be farfetched to most scientists today. ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-7-1
更新日期:2012-01-05 00:00:00
abstract:BACKGROUND:The N-end rule pathway is a part of the ubiquitin-dependent proteolytic system wherein N-recognin proteins recognize the amino terminal degradation signals (N-degrons) of the substrate. The type 1 N-degron recognizing UBR-box domain of the eukaryotic Arg/N-end rule pathway is known to possess a novel three-z...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0066-5
更新日期:2015-07-17 00:00:00
abstract:BACKGROUND:Microscopic examination of living cells often reveals that cells from some cell strains appear to be in a permanent state of disarray without obvious reason. In all probability such a disorderly state affects cell functioning. The aim of this study was to establish whether a disorderly state could occur that...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-1-9
更新日期:2006-04-02 00:00:00
abstract::The provenance and biochemical roles of eukaryotic MORC proteins have remained poorly understood since the discovery of their prototype MORC1, which is required for meiotic nuclear division in animals. The MORC family contains a combination of a gyrase, histidine kinase, and MutL (GHKL) and S5 domains that together co...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-3-8
更新日期:2008-03-17 00:00:00
abstract:BACKGROUND:Accurate estimation of the isoelectric point (pI) based on the amino acid sequence is useful for many analytical biochemistry and proteomics techniques such as 2-D polyacrylamide gel electrophoresis, or capillary isoelectric focusing used in combination with high-throughput mass spectrometry. Additionally, p...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-016-0159-9
更新日期:2016-10-21 00:00:00
abstract:BACKGROUND:Telocytes (TCs) is an interstitial cell with extremely long and thin telopodes (Tps) with thin segments (podomers) and dilations (podoms) to interact with neighboring cells. TCs have been found in different organs, while there is still a lack of TCs-specific biomarkers to distinguish TCs from the other cells...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0042-0
更新日期:2015-03-11 00:00:00
abstract:BACKGROUND:BLAST is a commonly-used software package for comparing a query sequence to a database of known sequences; in this study, we focus on protein sequences. Position-specific-iterated BLAST (PSI-BLAST) iteratively searches a protein sequence database, using the matches in round i to construct a position-specific...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-7-12
更新日期:2012-04-17 00:00:00
abstract::Chron's Disease is a chronic inflammatory intestinal disease, first described at the beginning of the last century. The disease is characterized by the alternation of periods of flares and remissions influenced by a complex pathogenesis in which inflammation plays a key role. Crohn's disease evolution is mediated by a...
journal_title:Biology direct
pub_type: 杂志文章,评审
doi:10.1186/s13062-020-00280-5
更新日期:2020-11-07 00:00:00
abstract::The concept of developmental constraints has been central to understand the role of development in morphological evolution. Developmental constraints are classically defined as biases imposed by development on the distribution of morphological variation.This opinion article argues that the concepts of developmental co...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-020-00289-w
更新日期:2021-01-09 00:00:00
abstract:BACKGROUND:The origin of eukaryotic cells was an important transition in evolution. The factors underlying the origin and evolutionary success of the eukaryote lineage are still discussed. One camp argues that mitochondria were essential for eukaryote origin because of the unique configuration of internalized bioenerge...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-018-0221-x
更新日期:2018-10-03 00:00:00
abstract:BACKGROUND:The current analysis of transposon elements (TE) in Drosophila melanogaster at Evolution Canyon, (EC), Israel, is based on data and analysis done by our collaborators (Drs. J. Gonzalez, J. Martinez and W. Makalowski, this issue). They estimated the frequencies of 28 TEs (transposon elements) in fruit flies (...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0074-5
更新日期:2015-10-14 00:00:00
abstract:BACKGROUND:Drug-induced liver injury (DILI) is a major concern in drug development, as hepatotoxicity may not be apparent at early stages but can lead to life threatening consequences. The ability to predict DILI from in vitro data would be a crucial advantage. In 2018, the Critical Assessment Massive Data Analysis gro...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-020-0259-4
更新日期:2020-02-13 00:00:00
abstract:BACKGROUND:Several computational candidate gene selection and prioritization methods have recently been developed. These in silico selection and prioritization techniques are usually based on two central approaches--the examination of similarities to known disease genes and/or the evaluation of functional annotation of...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-30
更新日期:2011-06-13 00:00:00
abstract:BACKGROUND:One of the major challenges in the field of system biology is to understand the interaction between a wide range of proteins and ligands. In the past, methods have been developed for predicting binding sites in a protein for a limited number of ligands. RESULTS:In order to address this problem, we developed...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-016-0118-5
更新日期:2016-03-25 00:00:00
abstract:BACKGROUND:Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-018-0209-6
更新日期:2018-05-09 00:00:00
abstract::All modern cells are bounded by cell membranes best described by the fluid mosaic model. This statement is so widely accepted by biologists that little attention is generally given to the theoretical importance of cell membranes in describing the cell. This has not always been the case. When the Cell Theory was first ...
journal_title:Biology direct
pub_type: 历史文章,杂志文章,评审
doi:10.1186/s13062-014-0032-7
更新日期:2014-12-19 00:00:00
abstract:BACKGROUND:In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs. The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the ta...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-29
更新日期:2009-08-25 00:00:00
abstract:BACKGROUND:The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-24
更新日期:2011-05-18 00:00:00