A new computational method for the detection of horizontal gene transfer events.

Abstract:

:In recent years, the increase in the amounts of available genomic data has made it easier to appreciate the extent by which organisms increase their genetic diversity through horizontally transferred genetic material. Such transfers have the potential to give rise to extremely dynamic genomes where a significant proportion of their coding DNA has been contributed by external sources. Because of the impact of these horizontal transfers on the ecological and pathogenic character of the recipient organisms, methods are continuously sought that are able to computationally determine which of the genes of a given genome are products of transfer events. In this paper, we introduce and discuss a novel computational method for identifying horizontal transfers that relies on a gene's nucleotide composition and obviates the need for knowledge of codon boundaries. In addition to being applicable to individual genes, the method can be easily extended to the case of clusters of horizontally transferred genes. With the help of an extensive and carefully designed set of experiments on 123 archaeal and bacterial genomes, we demonstrate that the new method exhibits significant improvement in sensitivity when compared to previously published approaches. In fact, it achieves an average relative improvement across genomes of between 11 and 41% compared to the Codon Adaptation Index method in distinguishing native from foreign genes. Our method's horizontal gene transfer predictions for 123 microbial genomes are available online at http://cbcsrv.watson.ibm.com/HGT/.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Tsirigos A,Rigoutsos I

doi

10.1093/nar/gki187

keywords:

subject

Has Abstract

pub_date

2005-02-16 00:00:00

pages

922-33

issue

3

eissn

0305-1048

issn

1362-4962

pii

33/3/922

journal_volume

33

pub_type

杂志文章
  • The characterisation of 1SF monomer nucleosomes from hen oviduct and the partial characterisation of a third HMG14/17-like in such nucleosomes.

    abstract::Nucleosomes released from oviduct nuclei during brief micrococcal nuclease digestions are enriched in transcribed sequences (bloom K.S. and Anderson, J.N. (1978) Cell, 15, 141-150). Such nucleosomes released into this 1Sf supernatant fraction are enriched in proteins HMG14, 17 and a third lower molecular weight protei...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/9.12.2761

    authors: Goodwin GH,Wright CA,Johns EW

    更新日期:1981-06-25 00:00:00

  • Complexes of the arginine-rich histone tetramer (H3)2(H4)2 with negatively supercoiled DNA: electron microscopy and chemical cross-linking.

    abstract::Tetramers of the arginine-rich histones H3 and H4 associate with supercoiled SV40 DNA either singly, giving tetrameric nucleoprotein complexes or in pairs giving octameric complexes, both of which are visualized as beads in the electron microscope. The relative amounts of the two complexes may be revealed by complete ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/7.3.611

    authors: Thomas JO,Oudet P

    更新日期:1979-10-10 00:00:00

  • Reorganizing the protein space at the Universal Protein Resource (UniProt).

    abstract::The mission of UniProt is to support biological research by providing a freely accessible, stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces. UniProt is comprised of four major components, each optimized for ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr981

    authors: UniProt Consortium.

    更新日期:2012-01-01 00:00:00

  • Intracellular inhibition of hepatitis C virus (HCV) internal ribosomal entry site (IRES)-dependent translation by peptide nucleic acids (PNAs) and locked nucleic acids (LNAs).

    abstract::Hepatitis C virus (HCV) is the major etiological agent of non-A, non-B hepatitis. Current therapies are not effective in all patients and can result in the generation of resistant mutants, leading to a need for new therapeutic options. HCV has an RNA genome that contains a well-defined and highly conserved secondary s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh706

    authors: Nulf CJ,Corey D

    更新日期:2004-07-19 00:00:00

  • Identifying DNA-binding proteins using structural motifs and the electrostatic potential.

    abstract::Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh803

    authors: Shanahan HP,Garcia MA,Jones S,Thornton JM

    更新日期:2004-09-08 00:00:00

  • Subtle structural alterations in G-quadruplex DNA regulate site specificity of fluorescence light-up probes.

    abstract::G-quadruplex (G4) DNA structures are linked to key biological processes and human diseases. Small molecules that target specific G4 DNA structures and signal their presence would therefore be of great value as chemical research tools with potential to further advance towards diagnostic and therapeutic developments. Ho...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz1205

    authors: Kumar R,Chand K,Bhowmik S,Das RN,Bhattacharjee S,Hedenström M,Chorell E

    更新日期:2020-02-20 00:00:00

  • Quantitative analysis of specific labelled RNA'S using DNA covalently linked to diazobenzyloxymethyl-paper.

    abstract::Substantial amounts of DNA (at least 25 microgram per cm2) can be stably bound to diazobenzyloxymethyl (DBM)-paper. Complementary RNA will hybridize to the DNA paper almost completely in 24 hours. Using several different conditions of hybridization and washing, the background of RNA bound non-specifically is very low ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.1.195

    authors: Stark GR,Williams JG

    更新日期:1979-01-01 00:00:00

  • Separation and assembly of deep sequencing data into discrete sub-population genomes.

    abstract::Sequence heterogeneity is a common characteristic of RNA viruses that is often referred to as sub-populations or quasispecies. Traditional techniques used for assembly of short sequence reads produced by deep sequencing, such as de-novo assemblers, ignore the underlying diversity. Here, we introduce a novel algorithm ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx755

    authors: Karagiannis K,Simonyan V,Chumakov K,Mazumder R

    更新日期:2017-11-02 00:00:00

  • Single-molecule kinetics reveal microscopic mechanism by which High-Mobility Group B proteins alter DNA flexibility.

    abstract::Eukaryotic High-Mobility Group B (HMGB) proteins alter DNA elasticity while facilitating transcription, replication and DNA repair. We developed a new single-molecule method to probe non-specific DNA interactions for two HMGB homologs: the human HMGB2 box A domain and yeast Nhp6Ap, along with chimeric mutants replacin...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks1031

    authors: McCauley MJ,Rueter EM,Rouzina I,Maher LJ 3rd,Williams MC

    更新日期:2013-01-07 00:00:00

  • Assembly, nuclear import and function of U7 snRNPs studied by microinjection of synthetic U7 RNA into Xenopus oocytes.

    abstract::In Xenopus oocytes in vitro transcribed mouse U7 RNA is assembled into small nuclear ribonucleoproteins (snRNPs) that are functional in histone RNA 3' processing. If the special Sm binding site of U7 (AAUUUGUCUAG, U7 Sm WT) is converted into the canonical Sm sequence derived from the major snRNAs (AAUUUUUGGAG, U7 Sm O...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.16.3141

    authors: Stefanovic B,Hackl W,Lührmann R,Schümperli D

    更新日期:1995-08-25 00:00:00

  • The integrase family of tyrosine recombinases: evolution of a conserved active site domain.

    abstract::The integrases are a diverse family of tyrosine recombinases which rearrange DNA duplexes by means of conservative site-specific recombination reactions. Members of this family, of which the well-studied lambda Int protein is the prototype, were previously found to share four strongly conserved residues, including an ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.18.3605

    authors: Esposito D,Scocca JJ

    更新日期:1997-09-15 00:00:00

  • Primary organization of nucleosomal core particles is invariable in repressed and active nuclei from animal, plant and yeast cells.

    abstract::A refined map for the linear arrangement of histones along DNA in nucleosomal core particles has been determined by DNA-protein crosslinking. On one strand of 145-bp core DNA, histones are aligned in the following order: (5') H2B25,35-H455,65-H375,85,95/H488-H2B105,11 5-H2A118-H3135,145/H2A145 (3') (the subscripts giv...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.10.3439

    authors: Bavykin SG,Usachenko SI,Lishanskaya AI,Shick VV,Belyavsky AV,Undritsov IM,Strokov AA,Zalenskaya IA,Mirzabekov AD

    更新日期:1985-05-24 00:00:00

  • Pyrophosphate-condensing activity linked to nucleic acid synthesis.

    abstract::In some preparations of DNA dependent RNA polymerase a new enzymatic activity has been found which catalyzes the condensation of two pyrophosphate molecules, liberated in the process of RNA synthesis, to one molecule of orthophosphate and one molecule of Mg (or Mn) - chelate complex with trimetaphosphate. This activit...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.4.1521

    authors: Volloch VZ,Rits S,Tumerman L

    更新日期:1979-04-01 00:00:00

  • Cryptic transcripts from a ubiquitous plasmid origin of replication confound tests for cis-regulatory function.

    abstract::A vast amount of research on the regulation of gene expression has relied on plasmid reporter assays. In this study, we show that plasmids widely used for this purpose constitutively produce substantial amounts of RNA from a TATA-containing cryptic promoter within the origin of replication. Readthrough of these RNAs i...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks451

    authors: Lemp NA,Hiraoka K,Kasahara N,Logg CR

    更新日期:2012-08-01 00:00:00

  • Consensus inverted terminal repeat sequence of Paramecium IESs: resemblance to termini of Tc1-related and Euplotes Tec transposons.

    abstract::During the formation of a transcriptionally active macronucleus, ciliated protozoa excise large numbers of interstitial segments of DNA (internal eliminated sequences; IESs) from their chromosomes. In this study we analyze the published sequences of 20 IESs that interrupt surface protein genes of Paramecium and identi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.11.2006

    authors: Klobutcher LA,Herrick G

    更新日期:1995-06-11 00:00:00

  • Heterogeneous base distribution in mitochondrial DNA of Neurospora crassa.

    abstract::The mitochondrial DNA of Neurospora crassa has a heterogeneous intramolecular base distribution. A contiguous piece, representing at least 30% of the total genome, has a G+C content that is 6% lower than the overall G+C content of the DNA. The genes for both ribosomal RNAs are contained in the remaining, relatively G+...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/4.1.129

    authors: Terpstra P,Holtrop M,Kroon A

    更新日期:1977-01-01 00:00:00

  • A sequence-specific core promoter-binding transcription factor recruits TRF2 to coordinately transcribe ribosomal protein genes.

    abstract::Ribosomal protein (RP) genes must be coordinately expressed for proper assembly of the ribosome yet the mechanisms that control expression of RP genes in metazoans are poorly understood. Recently, TATA-binding protein-related factor 2 (TRF2) rather than the TATA-binding protein (TBP) was found to function in transcrip...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx676

    authors: Baumann DG,Gilmour DS

    更新日期:2017-10-13 00:00:00

  • Insertion of an LrDNA gene fragment and of filler DNA at a mitochondrial exon-intron junction in Podospora.

    abstract::A rearrangement of the mitochondrial genome of a long lived mutant of Podospora anserina is presented. It consists in the insertion of 191 bp of the LrDNA gene (coding for the large ribosomal RNA) at the junction between exon1 and intron alpha of gene co1 (coding for subunit 1 of cytochrome oxidase). This insertion is...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.4.779

    authors: Sainsard-Chanet A,Begel O

    更新日期:1990-02-25 00:00:00

  • Cloning of two novel forms of human acidic fibroblast growth factor (aFGF) mRNA.

    abstract::We have previously isolated two different aFGF cDNA clones from kidney and brain. The two corresponding mRNA, designated aFGF 1.A and 1.B, are the predominant species in kidney and brain, respectively. During the characterization of aFGF mRNA in glioblastoma cells, we demonstrated that aFGF mRNA in U1242MG and D65MG g...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.3.489

    authors: Payson RA,Canatan H,Chotani MA,Wang WP,Harris SE,Myers RL,Chiu IM

    更新日期:1993-02-11 00:00:00

  • Chemical synthesis of a gene for somatomedin C.

    abstract::A synthetic gene for somatomedin C, a human growth factor, has been assembled by a single ligation of 23 oligodeoxyribonucleotides, which were chemically synthesized by an improved solid phase phosphotriester method. ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.8.2959

    authors: Sproat BS,Gait MJ

    更新日期:1985-04-25 00:00:00

  • The static and dynamic structural heterogeneities of B-DNA: extending Calladine-Dickerson rules.

    abstract::We present a multi-laboratory effort to describe the structural and dynamical properties of duplex B-DNA under physiological conditions. By processing a large amount of atomistic molecular dynamics simulations, we determine the sequence-dependent structural properties of DNA as expressed in the equilibrium distributio...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz905

    authors: Dans PD,Balaceanu A,Pasi M,Patelli AS,Petkevičiūtė D,Walther J,Hospital A,Bayarri G,Lavery R,Maddocks JH,Orozco M

    更新日期:2019-12-02 00:00:00

  • Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks.

    abstract::More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key roles in diverse biological processes. There is a critical need to annotate the functions of increasing available lncRNAs. In this article, we try to apply a global network-based strategy to tackle this issue for the first time. ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks967

    authors: Guo X,Gao L,Liao Q,Xiao H,Ma X,Yang X,Luo H,Zhao G,Bu D,Jiao F,Shao Q,Chen R,Zhao Y

    更新日期:2013-01-01 00:00:00

  • 2'-O-methyl, 2'-O-ethyl oligoribonucleotides and phosphorothioate oligodeoxyribonucleotides as inhibitors of the in vitro U7 snRNP-dependent mRNA processing event.

    abstract::We describe the synthesis of 2'-O-methyl, 2'-O-ethyl oligoribonucleotides and phosphorothioate oligodeoxyribonucleotides and demonstrate their utility as inhibitors of the in vitro U7 snRNP-dependent mRNA processing event. These 2'-O-modified compounds were designed to possess the binding affinity of an RNA molecule t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/19.10.2629

    authors: Cotten M,Oberhauser B,Brunar H,Holzner A,Issakides G,Noe CR,Schaffner G,Wagner E,Birnstiel ML

    更新日期:1991-05-25 00:00:00

  • Isolation and genome-wide characterization of cellular DNA:RNA triplex structures.

    abstract::RNA can directly bind to purine-rich DNA via Hoogsteen base pairing, forming a DNA:RNA triple helical structure that anchors the RNA to specific sequences and allows guiding of transcription regulators to distinct genomic loci. To unravel the prevalence of DNA:RNA triplexes in living cells, we have established a fast ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky1305

    authors: Sentürk Cetin N,Kuo CC,Ribarska T,Li R,Costa IG,Grummt I

    更新日期:2019-03-18 00:00:00

  • Does the 'non-coding' strand code?

    abstract::The hypothesis that DNA strands complementary to the coding strand contain in phase coding sequences has been investigated. Statistical analysis of the 50 genes of bacteriophage T7 shows no significant correlation between patterns of codon usage on the coding and non-coding strands. In Bacillus and yeast genes the cor...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.4.1389

    authors: Sharp PM

    更新日期:1985-02-25 00:00:00

  • Localized structural frustration for evaluating the impact of sequence variants.

    abstract::Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype-genotype associations. Protein structures provide a way of addr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw927

    authors: Kumar S,Clarke D,Gerstein M

    更新日期:2016-12-01 00:00:00

  • Spermine-DNA complexes build up metastable structures. Small-angle X-ray scattering and circular dichroism studies.

    abstract::Spermine-DNA complexes have been examined by small-angle and wide-angle X-ray scattering as well as by circular dichroism studies. Condensed complexes are building up below a critical ionic strength. We have found that at one and the same ionic strength condensed complexes having two different supramolecular structure...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/7.5.1297

    authors: Becker M,Misselwitz R,Damaschun H,Damaschun G,Zirwer D

    更新日期:1979-11-10 00:00:00

  • Differences in the phosphate oxygen requirements for self-cleavage by the extended and prototypical hammerhead forms.

    abstract::The hammerhead self-cleaving motif occurs in a variety of RNAs that infect plants and consists of three non-conserved helices connected by a highly conserved central core. A variant hammerhead, called the extended hammerhead, is found in satellite 2 transcripts from a variety of caudate amphibians. The extended hammer...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.11.2189

    authors: Mitrasinovic O,Epstein LM

    更新日期:1997-06-01 00:00:00

  • Comparative whole genome transcriptome analysis of three Plasmodium falciparum strains.

    abstract::Gene expression patterns have been demonstrated to be highly variable between similar cell types, for example lab strains and wild strains of Saccharomyces cerevisiae cultured under identical growth conditions exhibit a wide range of expression differences. We have used a genome-wide approach to characterize transcrip...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj517

    authors: Llinás M,Bozdech Z,Wong ED,Adai AT,DeRisi JL

    更新日期:2006-02-21 00:00:00

  • The human M creatine kinase gene enhancer contains multiple functional interacting domains.

    abstract::Cis-elements (-933 to -641) upstream of the human M creatine kinase gene cap site contain an enhancer that confers developmental and tissue-specific expression to the chloramphenicol acetyltransferase gene in C2C12 myogenic cells transfected in culture. Division of the enhancer at -770 into a 5' fragment that includes...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.9.2313

    authors: Trask RV,Koster JC,Ritchie ME,Billadello JJ

    更新日期:1992-05-11 00:00:00