Abstract:
:Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required.
journal_name
Nucleic Acids Resjournal_title
Nucleic acids researchauthors
Varghese NJ,Mukherjee S,Ivanova N,Konstantinidis KT,Mavrommatis K,Kyrpides NC,Pati Adoi
10.1093/nar/gkv657subject
Has Abstractpub_date
2015-08-18 00:00:00pages
6761-71issue
14eissn
0305-1048issn
1362-4962pii
gkv657journal_volume
43pub_type
杂志文章abstract::The three dimensional crystal structure of T5 5'-3' exonuclease was compared with that of two other members of the 5'-3' exonuclease family: T4 ribonuclease H and the N-terminal domain of Thermus aquaticus DNA polymerase I. Though these structures were largely similar, some regions of these enzymes show evidence of si...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.21.4224
更新日期:1997-11-01 00:00:00
abstract::Transcription factor IIIA (TFIIIA) is specifically required for transcription of 5S rRNA genes and is the archetypal C2H2 zinc finger protein. All known vertebrate TFIIIAs have a similar organization: nine zinc fingers, followed by a C-terminal domain of unknown structure. The zinc fingers of Saccharomyces cerevisiae ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkf385
更新日期:2002-07-01 00:00:00
abstract::The genomic distribution of the abundant eukaryotic d(GA x TC)(n) DNA microsatellite suggests that it could contribute to DNA recombination. Here, it is shown that this type of microsatellite DNA sequence enhances DNA recombination in SV40 minichromosomes, the rate of homologous DNA recombination increasing by as much...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/28.23.4617
更新日期:2000-12-01 00:00:00
abstract::The SRPDB (signal recognition particle database) provides annotated SRP RNA sequences from Eucaryotes and Archaea, phylogenetically ordered and aligned with their bacterial equivalents. We also make available representative RNA secondary structure diagrams, where each base pair is proven by comparative sequence analys...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/21.13.3019
更新日期:1993-07-01 00:00:00
abstract::Group II introns are self-splicing RNAs and retroelements found in bacteria and lower eukaryotic organelles. During the past several years, they have been uncovered in surprising numbers in bacteria due to the genome sequencing projects; however, most of the newly sequenced introns are not correctly identified. We hav...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg049
更新日期:2003-01-01 00:00:00
abstract::The EcoCyc database describes the genome and gene products of Escherichia coli, its metabolic and signal-transduction pathways, and its tRNAs. The database describes 4391 genes of E.coli, 695 enzymes encoded by a subset of these genes, 904 metabolic reactions that occur in E.coli, and the organization of these reactio...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/27.1.55
更新日期:1999-01-01 00:00:00
abstract::PPARγ2 is a critical lineage-determining transcription factor that is essential for adipogenic differentiation. Here we report characterization of the three-dimensional structure of the PPARγ2 locus after the onset of adipogenic differentiation and the mechanisms by which it forms. We identified a differentiation-depe...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkw129
更新日期:2016-06-20 00:00:00
abstract::The nucleotide sequence of 5.8S rRNA from the Chinese silkworm Philosamia cynthia ricini has been determined by gel sequencing and mobility shift methods. The complete primary structure is (sequence in text). This is one of the largest known 5.8S rRNAs. As compared to Bombyx 5.8S rRNA, it is two nucleotides longer; t...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/10.20.6383
更新日期:1982-10-25 00:00:00
abstract::The covalent attachment of thiol-modified DNA oligomers; to self-assembled monolayer silane films on fused silica and oxidized silicon substrates is described. A heterobifunctional crosslinking molecule bearing both thiol- and amino-reactive moieties was used to tether a DNA oligomer (modified at its terminus with a t...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/24.15.3031
更新日期:1996-08-01 00:00:00
abstract::Nucleic acid triplexes may regulate many important biological processes. Persistent accumulation of the oncogenic 7-kb long noncoding RNA MALAT1 is dependent on an unusually long intramolecular triple helix. This triplex structure is positioned within a conserved ENE (element for nuclear expression) motif at the lncRN...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky1171
更新日期:2019-02-20 00:00:00
abstract::The reaction of the tetranucleotide, pA-A(2)-A, with 2'(3')-0-(alpha-methoxyethyl)uridine 5'-diphosphate, Mg(2+) ions, and M. luteus polynucleotide phosphorylase followed by mild acid treatment to remove the blocking groups results in a 49% yield of the desired single addition product, pA-A(3)-U, together with smaller...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/1.12.1665
更新日期:1974-12-01 00:00:00
abstract::Escherichia coli has long been regarded as a model organism in the study of codon usage bias (CUB). However, most studies in this organism regarding this topic have been computational or, when experimental, restricted to small datasets; particularly poor attention has been given to genes with low CUB. In this work, co...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg897
更新日期:2003-12-01 00:00:00
abstract::We have analyzed the molecular mechanism that makes translation of the MS2 replicase cistron dependent on the translation of the upstream coat cistron. Deletion mapping on cloned cDNA of the phage shows that the ribosomal binding site of the replicase cistron is masked by a long distance basepairing to an internal coa...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.19.6955
更新日期:1985-10-11 00:00:00
abstract::We describe the unique features of an aberrantly rearranged mu immunoglobulin heavy chain gene isolated from MPC-11 cells (a gamma 2b producing Balb/c plasmacytoma). A novel rearrangement has occurred 1.5 Kb 5' of the MPC-11 mu gene (denoted 18b mu) resulting in the deletion of the majority of the repetitive switch re...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/10.23.7751
更新日期:1982-12-11 00:00:00
abstract::Superhelical PM2 DNA can be photochemically modified by u.v. irradiation. The variation of S20,w with dose shows the following characteristics. There is a linear increase from 28 to 31s produced by a low dose of u.v. irradiation (4,000 ergs/mm2). A plateau in S20,w occurs between 4,000 and 10,000 ergs/mm2. The S20,w t...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/4.5.1243
更新日期:1977-01-01 00:00:00
abstract::Small nucleolar RNAs (snoRNAs) are among the first discovered and most extensively studied group of small non-coding RNA. However, most studies focused on a small subset of snoRNAs that guide the modification of ribosomal RNA. In this study, we annotated the expression pattern of all box C/D snoRNAs in normal and canc...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gku664
更新日期:2014-09-01 00:00:00
abstract::To investigate the feasibility of conducting a genomic-scale protein labeling and localization study in Escherichia coli, a representative subset of 23 coding DNA sequences (CDSs) was selected for chromosomal tagging with one or more fluorescent protein genes (EGFP, EYFP, mRFP1, DsRed2). We used lambda-Red recombinati...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkl1158
更新日期:2007-01-01 00:00:00
abstract::The availability of the complete sequence of the Saccharomyces cerevisiae genome has allowed a comprehensive analysis of the genes encoding cytoplasmic ribosomal proteins in this organism. On the basis of this complete inventory a new nomenclature for the yeast ribosomal proteins is presented. ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.24.4872
更新日期:1997-12-15 00:00:00
abstract::The large terminase subunit is a central component of the genome packaging motor from tailed bacteriophages and herpes viruses. This two-domain enzyme has an N-terminal ATPase activity that fuels DNA translocation during packaging and a C-terminal nuclease activity required for initiation and termination of the packag...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gks974
更新日期:2013-01-07 00:00:00
abstract::Telomere G-quadruplex is emerging as a promising anti-cancer target due to its inhibition to telomerase, an enzyme expressed in more than 85% tumors. Telomerase-mediated telomere extension and some other reactions require a free 3' telomere end in single-stranded form. G-quadruplex formation near the 3' end of telomer...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr164
更新日期:2011-08-01 00:00:00
abstract::Maintaining genome integrity is important for cells and damaged DNA triggers autoimmunity. Previous studies have reported that Three-prime repair exonuclease 1(TREX1), an endogenous DNA exonuclease, prevents immune activation by depleting damaged DNA, thus preventing the development of certain autoimmune diseases. Con...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkx178
更新日期:2017-05-05 00:00:00
abstract::Motif3D is a web-based protein structure viewer designed to allow sequence motifs, and in particular those contained in the fingerprints of the PRINTS database, to be visualised on three-dimensional (3D) structures. Additional functionality is provided for the rhodopsin-like G protein-coupled receptors, enabling finge...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg534
更新日期:2003-07-01 00:00:00
abstract::Fully protected diastereoisomers of deoxyguanylyl (3' leads to 5') deoxyadenosine stereospecifically labelled on phosphorus with oxygen-18 have been synthesized by oxidation of phosphite triester intermediates in the presence of 18O-labelled water. The diastereoisomers have been chromatographically separated and their...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/11.20.7087
更新日期:1983-10-25 00:00:00
abstract::The allele frequency net database (http://www.allelefrequencies.net) is an online repository that contains information on the frequencies of immune genes and their corresponding alleles in different populations. The extensive variability observed in genes and alleles related to the immune system response and its signi...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkq1128
更新日期:2011-01-01 00:00:00
abstract::Cells internalized synthetic oligonucleotides (oligos) in culture. The hybridization of these molecules to target RNA in the living cell was subsequently detected and characterized after fixation of the cells, with or without previous detergent extraction. Hybridized oligo was distinguished from free oligo in the cell...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/23.24.4946
更新日期:1995-12-25 00:00:00
abstract::The use of bacterial artificial chromosomes (BACs) provides a consistent and high targeting efficiency of homologous recombination in embryonic stem (ES) cells, facilitated by long stretches of sequence homology. Here, we introduce a BAC targeting method which employs restriction fragment length polymorphisms (RFLPs) ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr550
更新日期:2011-10-01 00:00:00
abstract::Replication Protein A (RPA) is a critical complex that acts in replication and promotes homologous recombination by allowing recombinase recruitment to processed DSB ends. Most organisms possess three RPA subunits (RPA1, RPA2, RPA3) that form a trimeric complex critical for viability. The Caenorhabditis elegans genome...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkaa1293
更新日期:2021-01-21 00:00:00
abstract::The Genomes Online Database (GOLD) (https://gold.jgi.doe.gov) is an open online resource, which maintains an up-to-date catalog of genome and metagenome projects in the context of a comprehensive list of associated metadata. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Proje...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky977
更新日期:2019-01-08 00:00:00
abstract::Proteomic and RNomic approaches have identified many components of different ribonucleoprotein particles (RNPs), yet still little is known about the organization and protein proximities within these heterogeneous and highly dynamic complexes. Here we describe a targeted cross-linking approach, which combines cross-lin...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkv1366
更新日期:2016-02-18 00:00:00
abstract::We describe a novel 3'-OH unblocked reversible terminator with the potential to improve accuracy and read-lengths in next-generation sequencing (NGS) technologies. This terminator is based on 5-hydroxymethyl-2'-deoxyuridine triphosphate (HOMedUTP), a hypermodified nucleotide found naturally in the genomes of numerous ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkq1293
更新日期:2011-03-01 00:00:00