Abstract:
:In recent years, the increase in the amounts of available genomic data has made it easier to appreciate the extent by which organisms increase their genetic diversity through horizontally transferred genetic material. Such transfers have the potential to give rise to extremely dynamic genomes where a significant proportion of their coding DNA has been contributed by external sources. Because of the impact of these horizontal transfers on the ecological and pathogenic character of the recipient organisms, methods are continuously sought that are able to computationally determine which of the genes of a given genome are products of transfer events. In this paper, we introduce and discuss a novel computational method for identifying horizontal transfers that relies on a gene's nucleotide composition and obviates the need for knowledge of codon boundaries. In addition to being applicable to individual genes, the method can be easily extended to the case of clusters of horizontally transferred genes. With the help of an extensive and carefully designed set of experiments on 123 archaeal and bacterial genomes, we demonstrate that the new method exhibits significant improvement in sensitivity when compared to previously published approaches. In fact, it achieves an average relative improvement across genomes of between 11 and 41% compared to the Codon Adaptation Index method in distinguishing native from foreign genes. Our method's horizontal gene transfer predictions for 123 microbial genomes are available online at http://cbcsrv.watson.ibm.com/HGT/.
journal_name
Nucleic Acids Resjournal_title
Nucleic acids researchauthors
Tsirigos A,Rigoutsos Idoi
10.1093/nar/gki187keywords:
subject
Has Abstractpub_date
2005-02-16 00:00:00pages
922-33issue
3eissn
0305-1048issn
1362-4962pii
33/3/922journal_volume
33pub_type
杂志文章abstract::Nucleosomes released from oviduct nuclei during brief micrococcal nuclease digestions are enriched in transcribed sequences (bloom K.S. and Anderson, J.N. (1978) Cell, 15, 141-150). Such nucleosomes released into this 1Sf supernatant fraction are enriched in proteins HMG14, 17 and a third lower molecular weight protei...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/9.12.2761
更新日期:1981-06-25 00:00:00
abstract::Tetramers of the arginine-rich histones H3 and H4 associate with supercoiled SV40 DNA either singly, giving tetrameric nucleoprotein complexes or in pairs giving octameric complexes, both of which are visualized as beads in the electron microscope. The relative amounts of the two complexes may be revealed by complete ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/7.3.611
更新日期:1979-10-10 00:00:00
abstract::The mission of UniProt is to support biological research by providing a freely accessible, stable, comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and querying interfaces. UniProt is comprised of four major components, each optimized for ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr981
更新日期:2012-01-01 00:00:00
abstract::Hepatitis C virus (HCV) is the major etiological agent of non-A, non-B hepatitis. Current therapies are not effective in all patients and can result in the generation of resistant mutants, leading to a need for new therapeutic options. HCV has an RNA genome that contains a well-defined and highly conserved secondary s...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkh706
更新日期:2004-07-19 00:00:00
abstract::Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkh803
更新日期:2004-09-08 00:00:00
abstract::G-quadruplex (G4) DNA structures are linked to key biological processes and human diseases. Small molecules that target specific G4 DNA structures and signal their presence would therefore be of great value as chemical research tools with potential to further advance towards diagnostic and therapeutic developments. Ho...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkz1205
更新日期:2020-02-20 00:00:00
abstract::Substantial amounts of DNA (at least 25 microgram per cm2) can be stably bound to diazobenzyloxymethyl (DBM)-paper. Complementary RNA will hybridize to the DNA paper almost completely in 24 hours. Using several different conditions of hybridization and washing, the background of RNA bound non-specifically is very low ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/6.1.195
更新日期:1979-01-01 00:00:00
abstract::Sequence heterogeneity is a common characteristic of RNA viruses that is often referred to as sub-populations or quasispecies. Traditional techniques used for assembly of short sequence reads produced by deep sequencing, such as de-novo assemblers, ignore the underlying diversity. Here, we introduce a novel algorithm ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkx755
更新日期:2017-11-02 00:00:00
abstract::Eukaryotic High-Mobility Group B (HMGB) proteins alter DNA elasticity while facilitating transcription, replication and DNA repair. We developed a new single-molecule method to probe non-specific DNA interactions for two HMGB homologs: the human HMGB2 box A domain and yeast Nhp6Ap, along with chimeric mutants replacin...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gks1031
更新日期:2013-01-07 00:00:00
abstract::In Xenopus oocytes in vitro transcribed mouse U7 RNA is assembled into small nuclear ribonucleoproteins (snRNPs) that are functional in histone RNA 3' processing. If the special Sm binding site of U7 (AAUUUGUCUAG, U7 Sm WT) is converted into the canonical Sm sequence derived from the major snRNAs (AAUUUUUGGAG, U7 Sm O...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/23.16.3141
更新日期:1995-08-25 00:00:00
abstract::The integrases are a diverse family of tyrosine recombinases which rearrange DNA duplexes by means of conservative site-specific recombination reactions. Members of this family, of which the well-studied lambda Int protein is the prototype, were previously found to share four strongly conserved residues, including an ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.18.3605
更新日期:1997-09-15 00:00:00
abstract::A refined map for the linear arrangement of histones along DNA in nucleosomal core particles has been determined by DNA-protein crosslinking. On one strand of 145-bp core DNA, histones are aligned in the following order: (5') H2B25,35-H455,65-H375,85,95/H488-H2B105,11 5-H2A118-H3135,145/H2A145 (3') (the subscripts giv...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.10.3439
更新日期:1985-05-24 00:00:00
abstract::In some preparations of DNA dependent RNA polymerase a new enzymatic activity has been found which catalyzes the condensation of two pyrophosphate molecules, liberated in the process of RNA synthesis, to one molecule of orthophosphate and one molecule of Mg (or Mn) - chelate complex with trimetaphosphate. This activit...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/6.4.1521
更新日期:1979-04-01 00:00:00
abstract::A vast amount of research on the regulation of gene expression has relied on plasmid reporter assays. In this study, we show that plasmids widely used for this purpose constitutively produce substantial amounts of RNA from a TATA-containing cryptic promoter within the origin of replication. Readthrough of these RNAs i...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gks451
更新日期:2012-08-01 00:00:00
abstract::During the formation of a transcriptionally active macronucleus, ciliated protozoa excise large numbers of interstitial segments of DNA (internal eliminated sequences; IESs) from their chromosomes. In this study we analyze the published sequences of 20 IESs that interrupt surface protein genes of Paramecium and identi...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/23.11.2006
更新日期:1995-06-11 00:00:00
abstract::The mitochondrial DNA of Neurospora crassa has a heterogeneous intramolecular base distribution. A contiguous piece, representing at least 30% of the total genome, has a G+C content that is 6% lower than the overall G+C content of the DNA. The genes for both ribosomal RNAs are contained in the remaining, relatively G+...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/4.1.129
更新日期:1977-01-01 00:00:00
abstract::Ribosomal protein (RP) genes must be coordinately expressed for proper assembly of the ribosome yet the mechanisms that control expression of RP genes in metazoans are poorly understood. Recently, TATA-binding protein-related factor 2 (TRF2) rather than the TATA-binding protein (TBP) was found to function in transcrip...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkx676
更新日期:2017-10-13 00:00:00
abstract::A rearrangement of the mitochondrial genome of a long lived mutant of Podospora anserina is presented. It consists in the insertion of 191 bp of the LrDNA gene (coding for the large ribosomal RNA) at the junction between exon1 and intron alpha of gene co1 (coding for subunit 1 of cytochrome oxidase). This insertion is...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/18.4.779
更新日期:1990-02-25 00:00:00
abstract::We have previously isolated two different aFGF cDNA clones from kidney and brain. The two corresponding mRNA, designated aFGF 1.A and 1.B, are the predominant species in kidney and brain, respectively. During the characterization of aFGF mRNA in glioblastoma cells, we demonstrated that aFGF mRNA in U1242MG and D65MG g...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/21.3.489
更新日期:1993-02-11 00:00:00
abstract::A synthetic gene for somatomedin C, a human growth factor, has been assembled by a single ligation of 23 oligodeoxyribonucleotides, which were chemically synthesized by an improved solid phase phosphotriester method. ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.8.2959
更新日期:1985-04-25 00:00:00
abstract::We present a multi-laboratory effort to describe the structural and dynamical properties of duplex B-DNA under physiological conditions. By processing a large amount of atomistic molecular dynamics simulations, we determine the sequence-dependent structural properties of DNA as expressed in the equilibrium distributio...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkz905
更新日期:2019-12-02 00:00:00
abstract::More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key roles in diverse biological processes. There is a critical need to annotate the functions of increasing available lncRNAs. In this article, we try to apply a global network-based strategy to tackle this issue for the first time. ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gks967
更新日期:2013-01-01 00:00:00
abstract::We describe the synthesis of 2'-O-methyl, 2'-O-ethyl oligoribonucleotides and phosphorothioate oligodeoxyribonucleotides and demonstrate their utility as inhibitors of the in vitro U7 snRNP-dependent mRNA processing event. These 2'-O-modified compounds were designed to possess the binding affinity of an RNA molecule t...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/19.10.2629
更新日期:1991-05-25 00:00:00
abstract::RNA can directly bind to purine-rich DNA via Hoogsteen base pairing, forming a DNA:RNA triple helical structure that anchors the RNA to specific sequences and allows guiding of transcription regulators to distinct genomic loci. To unravel the prevalence of DNA:RNA triplexes in living cells, we have established a fast ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky1305
更新日期:2019-03-18 00:00:00
abstract::The hypothesis that DNA strands complementary to the coding strand contain in phase coding sequences has been investigated. Statistical analysis of the 50 genes of bacteriophage T7 shows no significant correlation between patterns of codon usage on the coding and non-coding strands. In Bacillus and yeast genes the cor...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.4.1389
更新日期:1985-02-25 00:00:00
abstract::Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype-genotype associations. Protein structures provide a way of addr...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkw927
更新日期:2016-12-01 00:00:00
abstract::Spermine-DNA complexes have been examined by small-angle and wide-angle X-ray scattering as well as by circular dichroism studies. Condensed complexes are building up below a critical ionic strength. We have found that at one and the same ionic strength condensed complexes having two different supramolecular structure...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/7.5.1297
更新日期:1979-11-10 00:00:00
abstract::The hammerhead self-cleaving motif occurs in a variety of RNAs that infect plants and consists of three non-conserved helices connected by a highly conserved central core. A variant hammerhead, called the extended hammerhead, is found in satellite 2 transcripts from a variety of caudate amphibians. The extended hammer...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.11.2189
更新日期:1997-06-01 00:00:00
abstract::Gene expression patterns have been demonstrated to be highly variable between similar cell types, for example lab strains and wild strains of Saccharomyces cerevisiae cultured under identical growth conditions exhibit a wide range of expression differences. We have used a genome-wide approach to characterize transcrip...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkj517
更新日期:2006-02-21 00:00:00
abstract::Cis-elements (-933 to -641) upstream of the human M creatine kinase gene cap site contain an enhancer that confers developmental and tissue-specific expression to the chloramphenicol acetyltransferase gene in C2C12 myogenic cells transfected in culture. Division of the enhancer at -770 into a 5' fragment that includes...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/20.9.2313
更新日期:1992-05-11 00:00:00