Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions.

Abstract:

:Conserved segments in DNA or protein sequences are strong candidates for functional elements and thus appropriate methods for computing them need to be developed and compared. We describe five methods and computer programs for finding highly conserved blocks within previously computed multiple alignments, primarily for DNA sequences. Two of the methods are already in common use; these are based on good column agreement and high information content. Three additional methods find blocks with minimal evolutionary change, blocks that differ in at most k positions per row from a known center sequence and blocks that differ in at most k positions per row from a center sequence that is unknown a priori. The center sequence in the latter two methods is a way to model potential binding sites for known or unknown proteins in DNA sequences. The efficacy of each method was evaluated by analysis of three extensively analyzed regulatory regions in mammalian beta-globin gene clusters and the control region of bacterial arabinose operons. Although all five methods have quite different theoretical underpinnings, they produce rather similar results on these data sets when their parameters are adjusted to best approximate the experimental data. The optimal parameters for the method based on information content varied little for different regulatory regions of the beta-globin gene cluster and hence may be extrapolated to many other regulatory regions. The programs based on maximum allowed mismatches per row have simple parameters whose values can be chosen a priori and thus they may be more useful than the other methods when calibration against known functional sites is not available.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Stojanovic N,Florea L,Riemer C,Gumucio D,Slightom J,Goodman M,Miller W,Hardison R

doi

10.1093/nar/27.19.3899

keywords:

subject

Has Abstract

pub_date

1999-10-01 00:00:00

pages

3899-910

issue

19

eissn

0305-1048

issn

1362-4962

pii

gkc580

journal_volume

27

pub_type

杂志文章
  • 'Solo' large terminal repeats (LTR) of an endogenous retrovirus-like gene family (VL30) in the mouse genome.

    abstract::VL30 genetic elements constitute a murine multicopy gene family that is retrovirus-like, despite the lack of sequence homology with any known retrovirus. Over one hundred copies of VL30 units are dispersed throughout the mouse genome. We report here that the mouse genome also contains 'solo' VL30 long terminal repeats...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/12.5.2273

    authors: Rotman G,Itin A,Keshet E

    更新日期:1984-03-12 00:00:00

  • Role of PCNA-dependent stimulation of 3'-phosphodiesterase and 3'-5' exonuclease activities of human Ape2 in repair of oxidative DNA damage.

    abstract::Human Ape2 protein has 3' phosphodiesterase activity for processing 3'-damaged DNA termini, 3'-5' exonuclease activity that supports removal of mismatched nucleotides from the 3'-end of DNA, and a somewhat weak AP-endonuclease activity. However, very little is known about the role of Ape2 in DNA repair processes. Here...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp357

    authors: Burkovics P,Hajdú I,Szukacsov V,Unk I,Haracska L

    更新日期:2009-07-01 00:00:00

  • The antibiotic Furvina® targets the P-site of 30S ribosomal subunits and inhibits translation initiation displaying start codon bias.

    abstract::Furvina®, also denominated G1 (MW 297), is a synthetic nitrovinylfuran [2-bromo-5-(2-bromo-2-nitrovinyl)-furan] antibiotic with a broad antimicrobial spectrum. An ointment (Dermofural®) containing G1 as the only active principle is currently marketed in Cuba and successfully used to treat dermatological infections. He...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks822

    authors: Fabbretti A,Brandi L,Petrelli D,Pon CL,Castañedo NR,Medina R,Gualerzi CO

    更新日期:2012-11-01 00:00:00

  • Isolation and developmental expression of a rat cDNA encoding a cysteine-rich zinc finger protein.

    abstract::A number of cysteine-rich proteins have recently been isolated by homology screening, differential library screens, and association with other proteins. In this report, we describe the isolation of the rat cysteine-rich protein from a rat brain library during a search for clones with homology to the delta-opioid recep...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/22.24.5477

    authors: McLaughlin CR,Tao Q,Abood ME

    更新日期:1994-12-11 00:00:00

  • A dnaB-like protein of Pseudomonas aeruginosa.

    abstract::A dnaB-like protein from P. aeruginosa was purified to near homogeneity using as an assay the immunoprecipitation by E. coli dnaB antiserum in a solid-phase. In the chromatographic characteristics including the affinity to immobilized ATP the dnaB-like protein of P. aeruginosa is similar to the dnaB protein of E. coli...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/15.2.385

    authors: Dreiseikelmann B,Riedel HD,Schuster H

    更新日期:1987-01-26 00:00:00

  • RPI-Pred: predicting ncRNA-protein interaction using sequence and structural information.

    abstract::RNA-protein complexes are essential in mediating important fundamental cellular processes, such as transport and localization. In particular, ncRNA-protein interactions play an important role in post-transcriptional gene regulation like mRNA localization, mRNA stabilization, poly-adenylation, splicing and translation....

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv020

    authors: Suresh V,Liu L,Adjeroh D,Zhou X

    更新日期:2015-02-18 00:00:00

  • Photosnesitization of DNA by gold.

    abstract::Au (III) reacts with DNA at pH 5.6 to form a complex which is sensitive to mid-UV radiation. Cyclobutane pyrimidine dimers are produced at some 15 to 30 times the rate that they are in untreated DNA. The mechanism of photosensitization appears to involve energy absorption by Au-urine and Au-cytosine adducts which can ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.10.3731

    authors: Wilkins RJ

    更新日期:1978-10-01 00:00:00

  • Redesign of the monomer-monomer interface of Cre recombinase yields an obligate heterotetrameric complex.

    abstract::Cre recombinase catalyzes the cleavage and religation of DNA at loxP sites. The enzyme is a homotetramer in its functional state, and the symmetry of the protein complex enforces a pseudo-palindromic symmetry upon the loxP sequence. The Cre-lox system is a powerful tool for many researchers. However, broader applicati...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv901

    authors: Zhang C,Myers CA,Qi Z,Mitra RD,Corbo JC,Havranek JJ

    更新日期:2015-10-15 00:00:00

  • Binding site density enables paralog-specific activity of SLM2 and Sam68 proteins in Neurexin2 AS4 splicing control.

    abstract::SLM2 and Sam68 are splicing regulator paralogs that usually overlap in function, yet only SLM2 and not Sam68 controls the Neurexin2 AS4 exon important for brain function. Herein we find that SLM2 and Sam68 similarly bind to Neurexin2 pre-mRNA, both within the mouse cortex and in vitro. Protein domain-swap experiments ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw1277

    authors: Danilenko M,Dalgliesh C,Pagliarini V,Naro C,Ehrmann I,Feracci M,Kheirollahi-Chadegani M,Tyson-Capper A,Clowry GJ,Fort P,Dominguez C,Sette C,Elliott DJ

    更新日期:2017-04-20 00:00:00

  • Vitamin D receptor contains multiple dimerization interfaces that are functionally different.

    abstract::The vitamin D receptor mediates the signal of 1 alpha, 25-dihydroxyvitamin D3 by binding to vitamin D responsive elements in DNA as a homodimer or as a heterodimer composed of one vitamin D receptor subunit and one retinoid X receptor subunit. We have mapped the dimerization interfaces of the vitamin D receptor that i...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.4.606

    authors: Nishikawa J,Kitaura M,Imagawa M,Nishihara T

    更新日期:1995-02-25 00:00:00

  • Signatures of accelerated somatic evolution in gene promoters in multiple cancer types.

    abstract::Cancer-associated somatic mutations outside protein-coding regions remain largely unexplored. Analyses of the TERT locus have indicated that non-coding regulatory mutations can be more frequent than previously suspected and play important roles in oncogenesis. Using a computational method called SASE-hunter, developed...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv419

    authors: Smith KS,Yadav VK,Pedersen BS,Shaknovich R,Geraci MW,Pollard KS,De S

    更新日期:2015-06-23 00:00:00

  • DotKnot: pseudoknot prediction using the probability dot plot under a refined energy model.

    abstract::RNA pseudoknots are functional structure elements with key roles in viral and cellular processes. Prediction of a pseudoknotted minimum free energy structure is an NP-complete problem. Practical algorithms for RNA structure prediction including restricted classes of pseudoknots suffer from high runtime and poor accura...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq021

    authors: Sperschneider J,Datta A

    更新日期:2010-04-01 00:00:00

  • A non-canonical multisubunit RNA polymerase encoded by the AR9 phage recognizes the template strand of its uracil-containing promoters.

    abstract::AR9 is a giant Bacillus subtilis phage whose uracil-containing double-stranded DNA genome encodes distant homologs of β and β' subunits of bacterial RNA polymerase (RNAP). The products of these genes are thought to assemble into two non-canonical multisubunit RNAPs - a virion RNAP (vRNAP) that is injected into the hos...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx264

    authors: Sokolova M,Borukhov S,Lavysh D,Artamonova T,Khodorkovskii M,Severinov K

    更新日期:2017-06-02 00:00:00

  • Site-specific mutagenesis by triple helix-forming oligonucleotides containing a reactive nucleoside analog.

    abstract::The specific recognition of homopurine-homo pyrimidine regions in duplex DNA by triplex-forming oligonucleotides (TFOs) provides an attractive strategy for genetic manipulation. Alkylation of nucleobases with functionalized TFOs would have the potential for site-directed mutagenesis. Recently, we demonstrated that a T...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gng031

    authors: Nagatsugi F,Sasaki S,Miller PS,Seidman MM

    更新日期:2003-03-15 00:00:00

  • DNA polymerase δ stalls on telomeric lagging strand templates independently from G-quadruplex formation.

    abstract::Previous evidence indicates that telomeres resemble common fragile sites and present a challenge for DNA replication. The precise impediments to replication fork progression at telomeric TTAGGG repeats are unknown, but are proposed to include G-quadruplexes (G4) on the G-rich strand. Here we examined DNA synthesis and...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt813

    authors: Lormand JD,Buncher N,Murphy CT,Kaur P,Lee MY,Burgers P,Wang H,Kunkel TA,Opresko PL

    更新日期:2013-12-01 00:00:00

  • Cleavage of the sarcin-ricin loop of 23S rRNA differentially affects EF-G and EF-Tu binding.

    abstract::Ribotoxins are potent inhibitors of protein biosynthesis and inactivate ribosomes from a variety of organisms. The ribotoxin alpha-sarcin cleaves the large 23S ribosomal RNA (rRNA) at the universally conserved sarcin-ricin loop (SRL) leading to complete inactivation of the ribosome and cellular death. The SRL interact...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq151

    authors: García-Ortega L,Alvarez-García E,Gavilanes JG,Martínez-del-Pozo A,Joseph S

    更新日期:2010-07-01 00:00:00

  • Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes.

    abstract::Although cis-regulatory binding sites (CRBSs) are at least as important as the coding sequences in a genome, our general understanding of them in most sequenced genomes is very limited due to the lack of efficient and accurate experimental and computational methods for their characterization, which has largely hindere...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp248

    authors: Zhang S,Xu M,Li S,Su Z

    更新日期:2009-06-01 00:00:00

  • Identification and characterization of CRT10 as a novel regulator of Saccharomyces cerevisiae ribonucleotide reductase genes.

    abstract::The CRT10 gene was identified through screening of the Saccharomyces cerevisiae deletion library for hydroxyurea (HU) resistance. CRT10 encodes a putative 957 amino acid, 110 kDa protein with a leucine repeat and a WD40 repeat near the N-terminus. Deletion of CRT10 resulted in an enhanced resistance to HU reminiscent ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl100

    authors: Fu Y,Xiao W

    更新日期:2006-04-05 00:00:00

  • The GAA*TTC triplet repeat expanded in Friedreich's ataxia impedes transcription elongation by T7 RNA polymerase in a length and supercoil dependent manner.

    abstract::Large expansions of the trinucleotide repeat GAA*TTC within the first intron of the X25 (frataxin) gene cause Friedreich's ataxia, the most common inherited ataxia. Expansion leads to reduced levels of frataxin mRNA in affected individuals. Here we show that GAA*TTC tracts, in the absence of any other frataxin gene se...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.14.2815

    authors: Grabczyk E,Usdin K

    更新日期:2000-07-15 00:00:00

  • IndelFR: a database of indels in protein structures and their flanking regions.

    abstract::Insertion/deletion (indel) is one of the most common methods of protein sequence variation. Recent studies showed that indels could affect their flanking regions and they are important for protein function and evolution. Here, we describe the Indel Flanking Region Database (IndelFR, http://indel.bioinfo.sdu.edu.cn), w...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr1107

    authors: Zhang Z,Xing C,Wang L,Gong B,Liu H

    更新日期:2012-01-01 00:00:00

  • BRENDA in 2015: exciting developments in its 25th year of existence.

    abstract::The BRENDA enzyme information system (http://www.brenda-enzymes.org/) has developed into an elaborate system of enzyme and enzyme-ligand information obtained from different sources, combined with flexible query systems and evaluation tools. The information is obtained by manual extraction from primary literature, text...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku1068

    authors: Chang A,Schomburg I,Placzek S,Jeske L,Ulbrich M,Xiao M,Sensen CW,Schomburg D

    更新日期:2015-01-01 00:00:00

  • Analysis of pooled DNA samples on high density arrays without prior knowledge of differential hybridization rates.

    abstract::Array based DNA pooling techniques facilitate genome-wide scale genotyping of large samples. We describe a structured analysis method for pooled data using internal replication information in large scale genotyping sets. The method takes advantage of information from single nucleotide polymorphisms (SNPs) typed in par...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl136

    authors: Macgregor S,Visscher PM,Montgomery G

    更新日期:2006-04-20 00:00:00

  • The PARP promoter of Trypanosoma brucei is developmentally regulated in a chromosomal context.

    abstract::African trypanosomes are extracellular protozoan parasites that are transmitted from one mammalian host to the next by tsetse flies. Bloodstream forms express variant surface glycoprotein (VSG); the tsetse fly (procyclic) forms express instead the procyclic acidic repetitive protein (PARP). PARP mRNA is abundant in pr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.7.1202

    authors: Biebinger S,Rettenmaier S,Flaspohler J,Hartmann C,Peña-Diaz J,Wirtz LE,Hotz HR,Barry JD,Clayton C

    更新日期:1996-04-01 00:00:00

  • Cellular senescence mediated by p16INK4A-coupled miRNA pathways.

    abstract::p16 is a key regulator of cellular senescence, yet the drivers of this stable state of proliferative arrest are not well understood. Here, we identify 22 senescence-associated microRNAs (SA-miRNAs) in normal human mammary epithelial cells. We show that SA-miRNAs-26b, 181a, 210 and 424 function in concert to directly r...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt1096

    authors: Overhoff MG,Garbe JC,Koh J,Stampfer MR,Beach DH,Bishop CL

    更新日期:2014-02-01 00:00:00

  • The Universal Protein Resource (UniProt).

    abstract::The ability to store and interconnect all available information on proteins is crucial to modern biological research. Accordingly, the Universal Protein Resource (UniProt) plays an increasingly important role by providing a stable, comprehensive, freely accessible central resource on protein sequences and functional a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl929

    authors: UniProt Consortium.

    更新日期:2007-01-01 00:00:00

  • In vitro construction of deletion mutants of the bacteriocinogenic plasmid Clo DF13.

    abstract::The isolation and characterization of deletion mutants of the bacteriocinogenic plasmid Clo DF13 is described. To construct these deletion mutants, DNA of Clo DF13::Tn901 and Clo DF13-rep3::Tn901 plasmids was digested with restriction endonucleases, ligated with T4 ligase and introduced by transformation into Escheric...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.6.1801

    authors: Stuitje AR,Veltkamp E,van den Elzen PJ,Nijkamp HJ

    更新日期:1978-06-01 00:00:00

  • Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes.

    abstract::To understand how potential for G-quadruplex formation might influence regulation of gene expression, we examined the 2 kb spanning the transcription start sites (TSS) of the 18 217 human RefSeq genes, distinguishing contributions of template and nontemplate strands. Regions both upstream and downstream of the TSS are...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm1138

    authors: Eddy J,Maizels N

    更新日期:2008-03-01 00:00:00

  • Molecular dissection of the domain architecture and catalytic activities of human PrimPol.

    abstract::PrimPol is a primase-polymerase involved in nuclear and mitochondrial DNA replication in eukaryotic cells. Although PrimPol is predicted to possess an archaeo-eukaryotic primase and a UL52-like zinc finger domain, the role of these domains has not been established. Here, we report that the proposed zinc finger domain ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku214

    authors: Keen BA,Jozwiakowski SK,Bailey LJ,Bianchi J,Doherty AJ

    更新日期:2014-05-01 00:00:00

  • Non-specific binding of Na+ and Mg2+ to RNA determined by force spectroscopy methods.

    abstract::RNA duplex stability depends strongly on ionic conditions, and inside cells RNAs are exposed to both monovalent and multivalent ions. Despite recent advances, we do not have general methods to quantitatively account for the effects of monovalent and multivalent ions on RNA stability, and the thermodynamic parameters f...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks289

    authors: Bizarro CV,Alemany A,Ritort F

    更新日期:2012-08-01 00:00:00

  • Evolutionary Conserved Motif Finder (ECMFinder) for genome-wide identification of clustered YY1- and CTCF-binding sites.

    abstract::We have developed a new bioinformatics approach called ECMFinder (Evolutionary Conserved Motif Finder). This program searches for a given DNA motif within the entire genome of one species and uses the gene association information of a potential transcription factor-binding site (TFBS) to screen the homologous regions ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp077

    authors: Kang K,Chung JH,Kim J

    更新日期:2009-04-01 00:00:00