Microbial species delineation using whole genome sequences.

Abstract:

:Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Varghese NJ,Mukherjee S,Ivanova N,Konstantinidis KT,Mavrommatis K,Kyrpides NC,Pati A

doi

10.1093/nar/gkv657

subject

Has Abstract

pub_date

2015-08-18 00:00:00

pages

6761-71

issue

14

eissn

0305-1048

issn

1362-4962

pii

gkv657

journal_volume

43

pub_type

杂志文章
  • Prokaryotic 5'-3' exonucleases share a common core structure with gamma-delta resolvase.

    abstract::The three dimensional crystal structure of T5 5'-3' exonuclease was compared with that of two other members of the 5'-3' exonuclease family: T4 ribonuclease H and the N-terminal domain of Thermus aquaticus DNA polymerase I. Though these structures were largely similar, some regions of these enzymes show evidence of si...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.21.4224

    authors: Artymiuk PJ,Ceska TA,Suck D,Sayers JR

    更新日期:1997-11-01 00:00:00

  • Identification and characterization of transcription factor IIIA from Schizosaccharomyces pombe.

    abstract::Transcription factor IIIA (TFIIIA) is specifically required for transcription of 5S rRNA genes and is the archetypal C2H2 zinc finger protein. All known vertebrate TFIIIAs have a similar organization: nine zinc fingers, followed by a C-terminal domain of unknown structure. The zinc fingers of Saccharomyces cerevisiae ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkf385

    authors: Schulman DB,Setzer DR

    更新日期:2002-07-01 00:00:00

  • d(GA x TC)(n) microsatellite DNA sequences enhance homologous DNA recombination in SV40 minichromosomes.

    abstract::The genomic distribution of the abundant eukaryotic d(GA x TC)(n) DNA microsatellite suggests that it could contribute to DNA recombination. Here, it is shown that this type of microsatellite DNA sequence enhances DNA recombination in SV40 minichromosomes, the rate of homologous DNA recombination increasing by as much...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.23.4617

    authors: Benet A,Mollà G,Azorín F

    更新日期:2000-12-01 00:00:00

  • The signal recognition particle database (SRPDB).

    abstract::The SRPDB (signal recognition particle database) provides annotated SRP RNA sequences from Eucaryotes and Archaea, phylogenetically ordered and aligned with their bacterial equivalents. We also make available representative RNA secondary structure diagrams, where each base pair is proven by comparative sequence analys...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.13.3019

    authors: Larsen N,Zwieb C

    更新日期:1993-07-01 00:00:00

  • Database for mobile group II introns.

    abstract::Group II introns are self-splicing RNAs and retroelements found in bacteria and lower eukaryotic organelles. During the past several years, they have been uncovered in surprising numbers in bacteria due to the genome sequencing projects; however, most of the newly sequenced introns are not correctly identified. We hav...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg049

    authors: Dai L,Toor N,Olson R,Keeping A,Zimmerly S

    更新日期:2003-01-01 00:00:00

  • Eco Cyc: encyclopedia of Escherichia coli genes and metabolism.

    abstract::The EcoCyc database describes the genome and gene products of Escherichia coli, its metabolic and signal-transduction pathways, and its tRNAs. The database describes 4391 genes of E.coli, 695 enzymes encoded by a subset of these genes, 904 metabolic reactions that occur in E.coli, and the organization of these reactio...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/27.1.55

    authors: Karp PD,Riley M,Paley SM,Pellegrini-Toole A,Krummenacker M

    更新日期:1999-01-01 00:00:00

  • Promoter-enhancer looping at the PPARγ2 locus during adipogenic differentiation requires the Prmt5 methyltransferase.

    abstract::PPARγ2 is a critical lineage-determining transcription factor that is essential for adipogenic differentiation. Here we report characterization of the three-dimensional structure of the PPARγ2 locus after the onset of adipogenic differentiation and the mechanisms by which it forms. We identified a differentiation-depe...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw129

    authors: LeBlanc SE,Wu Q,Lamba P,Sif S,Imbalzano AN

    更新日期:2016-06-20 00:00:00

  • The nucleotide sequence of 5.8S rRNA from the posterior silk gland of the silkworm Philosamia cynthia ricini.

    abstract::The nucleotide sequence of 5.8S rRNA from the Chinese silkworm Philosamia cynthia ricini has been determined by gel sequencing and mobility shift methods. The complete primary structure is (sequence in text). This is one of the largest known 5.8S rRNAs. As compared to Bombyx 5.8S rRNA, it is two nucleotides longer; t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.20.6383

    authors: Feng YX,Krupp G,Gross HJ

    更新日期:1982-10-25 00:00:00

  • Covalent attachment of synthetic DNA to self-assembled monolayer films.

    abstract::The covalent attachment of thiol-modified DNA oligomers; to self-assembled monolayer silane films on fused silica and oxidized silicon substrates is described. A heterobifunctional crosslinking molecule bearing both thiol- and amino-reactive moieties was used to tether a DNA oligomer (modified at its terminus with a t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.15.3031

    authors: Chrisey LA,Lee GU,O'Ferrall CE

    更新日期:1996-08-01 00:00:00

  • Finely tuned conformational dynamics regulate the protective function of the lncRNA MALAT1 triple helix.

    abstract::Nucleic acid triplexes may regulate many important biological processes. Persistent accumulation of the oncogenic 7-kb long noncoding RNA MALAT1 is dependent on an unusually long intramolecular triple helix. This triplex structure is positioned within a conserved ENE (element for nuclear expression) motif at the lncRN...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky1171

    authors: Ageeli AA,McGovern-Gooch KR,Kaminska MM,Baird NJ

    更新日期:2019-02-20 00:00:00

  • 'Single addition' and 'transnucleotidation' reactions catalyzed by polynucleotide phosphorylase. Effect of enzymatic removal of inorganic phosphate during reaction.

    abstract::The reaction of the tetranucleotide, pA-A(2)-A, with 2'(3')-0-(alpha-methoxyethyl)uridine 5'-diphosphate, Mg(2+) ions, and M. luteus polynucleotide phosphorylase followed by mild acid treatment to remove the blocking groups results in a 49% yield of the desired single addition product, pA-A(3)-U, together with smaller...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/1.12.1665

    authors: Sninsky JJ,Bennett GN,Gilham PT

    更新日期:1974-12-01 00:00:00

  • Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome.

    abstract::Escherichia coli has long been regarded as a model organism in the study of codon usage bias (CUB). However, most studies in this organism regarding this topic have been computational or, when experimental, restricted to small datasets; particularly poor attention has been given to genes with low CUB. In this work, co...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg897

    authors: dos Reis M,Wernisch L,Savva R

    更新日期:2003-12-01 00:00:00

  • Mechanism of translational coupling between coat protein and replicase genes of RNA bacteriophage MS2.

    abstract::We have analyzed the molecular mechanism that makes translation of the MS2 replicase cistron dependent on the translation of the upstream coat cistron. Deletion mapping on cloned cDNA of the phage shows that the ribosomal binding site of the replicase cistron is masked by a long distance basepairing to an internal coa...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.19.6955

    authors: Berkhout B,van Duin J

    更新日期:1985-10-11 00:00:00

  • A switch region inversion contributes to the aberrant rearrangement of a mu immunoglobulin heavy chain gene in MPC-11 cells.

    abstract::We describe the unique features of an aberrantly rearranged mu immunoglobulin heavy chain gene isolated from MPC-11 cells (a gamma 2b producing Balb/c plasmacytoma). A novel rearrangement has occurred 1.5 Kb 5' of the MPC-11 mu gene (denoted 18b mu) resulting in the deletion of the majority of the repetitive switch re...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.23.7751

    authors: Greenberg R,Lang RB,Diamond MS,Marcu KB

    更新日期:1982-12-11 00:00:00

  • Ultraviolet light irradiation of PM2 superhelical DNA.

    abstract::Superhelical PM2 DNA can be photochemically modified by u.v. irradiation. The variation of S20,w with dose shows the following characteristics. There is a linear increase from 28 to 31s produced by a low dose of u.v. irradiation (4,000 ergs/mm2). A plateau in S20,w occurs between 4,000 and 10,000 ergs/mm2. The S20,w t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/4.5.1243

    authors: Woodworth-Gutai M,Lebowitz J,Kato AC,Denhardt DT

    更新日期:1977-01-01 00:00:00

  • Identification of discrete classes of small nucleolar RNA featuring different ends and RNA binding protein dependency.

    abstract::Small nucleolar RNAs (snoRNAs) are among the first discovered and most extensively studied group of small non-coding RNA. However, most studies focused on a small subset of snoRNAs that guide the modification of ribosomal RNA. In this study, we annotated the expression pattern of all box C/D snoRNAs in normal and canc...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku664

    authors: Deschamps-Francoeur G,Garneau D,Dupuis-Sandoval F,Roy A,Frappier M,Catala M,Couture S,Barbe-Marcoux M,Abou-Elela S,Scott MS

    更新日期:2014-09-01 00:00:00

  • Visualizing the proteome of Escherichia coli: an efficient and versatile method for labeling chromosomal coding DNA sequences (CDSs) with fluorescent protein genes.

    abstract::To investigate the feasibility of conducting a genomic-scale protein labeling and localization study in Escherichia coli, a representative subset of 23 coding DNA sequences (CDSs) was selected for chromosomal tagging with one or more fluorescent protein genes (EGFP, EYFP, mRFP1, DsRed2). We used lambda-Red recombinati...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl1158

    authors: Watt RM,Wang J,Leong M,Kung HF,Cheah KS,Liu D,Danchin A,Huang JD

    更新日期:2007-01-01 00:00:00

  • A new nomenclature for the cytoplasmic ribosomal proteins of Saccharomyces cerevisiae.

    abstract::The availability of the complete sequence of the Saccharomyces cerevisiae genome has allowed a comprehensive analysis of the genes encoding cytoplasmic ribosomal proteins in this organism. On the basis of this complete inventory a new nomenclature for the yeast ribosomal proteins is presented. ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.24.4872

    authors: Mager WH,Planta RJ,Ballesta JG,Lee JC,Mizuta K,Suzuki K,Warner JR,Woolford J

    更新日期:1997-12-15 00:00:00

  • The nuclease domain of the SPP1 packaging motor coordinates DNA cleavage and encapsidation.

    abstract::The large terminase subunit is a central component of the genome packaging motor from tailed bacteriophages and herpes viruses. This two-domain enzyme has an N-terminal ATPase activity that fuels DNA translocation during packaging and a C-terminal nuclease activity required for initiation and termination of the packag...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks974

    authors: Cornilleau C,Atmane N,Jacquet E,Smits C,Alonso JC,Tavares P,Oliveira L

    更新日期:2013-01-07 00:00:00

  • G-quadruplex formation at the 3' end of telomere DNA inhibits its extension by telomerase, polymerase and unwinding by helicase.

    abstract::Telomere G-quadruplex is emerging as a promising anti-cancer target due to its inhibition to telomerase, an enzyme expressed in more than 85% tumors. Telomerase-mediated telomere extension and some other reactions require a free 3' telomere end in single-stranded form. G-quadruplex formation near the 3' end of telomer...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr164

    authors: Wang Q,Liu JQ,Chen Z,Zheng KW,Chen CY,Hao YH,Tan Z

    更新日期:2011-08-01 00:00:00

  • Aicardi-Goutières syndrome protein TREX1 suppresses L1 and maintains genome integrity through exonuclease-independent ORF1p depletion.

    abstract::Maintaining genome integrity is important for cells and damaged DNA triggers autoimmunity. Previous studies have reported that Three-prime repair exonuclease 1(TREX1), an endogenous DNA exonuclease, prevents immune activation by depleting damaged DNA, thus preventing the development of certain autoimmune diseases. Con...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx178

    authors: Li P,Du J,Goodier JL,Hou J,Kang J,Kazazian HH Jr,Zhao K,Yu XF

    更新日期:2017-05-05 00:00:00

  • Motif3D: Relating protein sequence motifs to 3D structure.

    abstract::Motif3D is a web-based protein structure viewer designed to allow sequence motifs, and in particular those contained in the fingerprints of the PRINTS database, to be visualised on three-dimensional (3D) structures. Additional functionality is provided for the rhodopsin-like G protein-coupled receptors, enabling finge...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg534

    authors: Gaulton A,Attwood TK

    更新日期:2003-07-01 00:00:00

  • A stereospecifically 18O-labelled deoxydinucleoside phosphate block for incorporation into an oligonucleotide.

    abstract::Fully protected diastereoisomers of deoxyguanylyl (3' leads to 5') deoxyadenosine stereospecifically labelled on phosphorus with oxygen-18 have been synthesized by oxidation of phosphite triester intermediates in the presence of 18O-labelled water. The diastereoisomers have been chromatographically separated and their...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/11.20.7087

    authors: Potter BV,Eckstein F,Uznański B

    更新日期:1983-10-25 00:00:00

  • Allele frequency net: a database and online repository for immune gene frequencies in worldwide populations.

    abstract::The allele frequency net database (http://www.allelefrequencies.net) is an online repository that contains information on the frequencies of immune genes and their corresponding alleles in different populations. The extensive variability observed in genes and alleles related to the immune system response and its signi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1128

    authors: Gonzalez-Galarza FF,Christmas S,Middleton D,Jones AR

    更新日期:2011-01-01 00:00:00

  • Characterization of hybridization between synthetic oligodeoxynucleotides and RNA in living cells.

    abstract::Cells internalized synthetic oligonucleotides (oligos) in culture. The hybridization of these molecules to target RNA in the living cell was subsequently detected and characterized after fixation of the cells, with or without previous detergent extraction. Hybridized oligo was distinguished from free oligo in the cell...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.24.4946

    authors: Politz JC,Taneja KL,Singer RH

    更新日期:1995-12-25 00:00:00

  • Precise BAC targeting of genetically polymorphic mouse ES cells.

    abstract::The use of bacterial artificial chromosomes (BACs) provides a consistent and high targeting efficiency of homologous recombination in embryonic stem (ES) cells, facilitated by long stretches of sequence homology. Here, we introduce a BAC targeting method which employs restriction fragment length polymorphisms (RFLPs) ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr550

    authors: Barakat TS,Rentmeester E,Sleutels F,Grootegoed JA,Gribnau J

    更新日期:2011-10-01 00:00:00

  • RPA complexes in Caenorhabditis elegans meiosis; unique roles in replication, meiotic recombination and apoptosis.

    abstract::Replication Protein A (RPA) is a critical complex that acts in replication and promotes homologous recombination by allowing recombinase recruitment to processed DSB ends. Most organisms possess three RPA subunits (RPA1, RPA2, RPA3) that form a trimeric complex critical for viability. The Caenorhabditis elegans genome...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkaa1293

    authors: Hefel A,Honda M,Cronin N,Harrell K,Patel P,Spies M,Smolikove S

    更新日期:2021-01-21 00:00:00

  • Genomes OnLine database (GOLD) v.7: updates and new features.

    abstract::The Genomes Online Database (GOLD) (https://gold.jgi.doe.gov) is an open online resource, which maintains an up-to-date catalog of genome and metagenome projects in the context of a comprehensive list of associated metadata. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Proje...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky977

    authors: Mukherjee S,Stamatis D,Bertsch J,Ovchinnikova G,Katta HY,Mojica A,Chen IA,Kyrpides NC,Reddy T

    更新日期:2019-01-08 00:00:00

  • Targeted cross-linking-mass spectrometry determines vicinal interactomes within heterogeneous RNP complexes.

    abstract::Proteomic and RNomic approaches have identified many components of different ribonucleoprotein particles (RNPs), yet still little is known about the organization and protein proximities within these heterogeneous and highly dynamic complexes. Here we describe a targeted cross-linking approach, which combines cross-lin...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv1366

    authors: Trahan C,Oeffinger M

    更新日期:2016-02-18 00:00:00

  • Improved nucleotide selectivity and termination of 3'-OH unblocked reversible terminators by molecular tuning of 2-nitrobenzyl alkylated HOMedU triphosphates.

    abstract::We describe a novel 3'-OH unblocked reversible terminator with the potential to improve accuracy and read-lengths in next-generation sequencing (NGS) technologies. This terminator is based on 5-hydroxymethyl-2'-deoxyuridine triphosphate (HOMedUTP), a hypermodified nucleotide found naturally in the genomes of numerous ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1293

    authors: Litosh VA,Wu W,Stupi BP,Wang J,Morris SE,Hersh MN,Metzker ML

    更新日期:2011-03-01 00:00:00