Correlation between sequence conservation and the genomic context after gene duplication.

Abstract:

:A key complication in comparative genomics for reliable gene function prediction is the existence of duplicated genes. To study the effect of gene duplication on function prediction, we analyze orthologs between pairs of genomes where in one genome the orthologous gene has duplicated after the speciation of the two genomes (i.e. inparalogs). For these duplicated genes we investigate whether the gene that is most similar on the sequence level is also the gene that has retained the ancestral gene-neighborhood. Although the majority of investigated cases show a consistent pattern between sequence similarity and gene-neighborhood conservation, a substantial fraction, 29-38%, is inconsistent. The observation of inconsistency is not the result of a chance outcome owing to a lack of divergence time between inparalogs, but rather it seems to be the result of a chance outcome caused by very similar rates of sequence evolution of both inparalogs relative to their ortholog. If one-to-one orthologous relationships are required, it is advisable to combine contextual information (i.e. gene-neighborhood in prokaryotes and co-expression in eukaryotes) with protein sequence information to predict the most probable functional equivalent ortholog in the presence of inparalogs.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Notebaart RA,Huynen MA,Teusink B,Siezen RJ,Snel B

doi

10.1093/nar/gki913

keywords:

subject

Has Abstract

pub_date

2005-10-27 00:00:00

pages

6164-71

issue

19

eissn

0305-1048

issn

1362-4962

pii

33/19/6164

journal_volume

33

pub_type

杂志文章
  • Nucleotide sequence of pS194, a streptomycin-resistance plasmid from Staphylococcus aureus.

    abstract::pS194 is a naturally occurring Staphylococcus aureus plasmid encoding streptomycin resistance. The plasmid has a copy number of about 25 per cell, and belongs to the inc5 incompatibility group. The nucleotide sequence of pS194 has been determined and consists of 4397 base pairs including four open reading frames poten...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.5.2179

    authors: Projan SJ,Moghazeh S,Novick RP

    更新日期:1988-03-25 00:00:00

  • The Kluyveromyces gene encoding the general transcription factor IIB: structural analysis and expression in Saccharomyces cerevisiae.

    abstract::The Kluyveromyces lactis gene encoding the general transcription factor IIB (TFIIB) was isolated from a genomic library by complementation of the cold-sensitive phenotype conferred by a mutation in the SUA7 gene, which encodes TFIIB in Saccharomyces cerevisiae. DNA sequence analysis of the KI-SUA7 gene revealed a 357 ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.15.3413

    authors: Na JG,Hampsey M

    更新日期:1993-07-25 00:00:00

  • Building promoter aware transcriptional regulatory networks using siRNA perturbation and deepCAGE.

    abstract::Perturbation and time-course data sets, in combination with computational approaches, can be used to infer transcriptional regulatory networks which ultimately govern the developmental pathways and responses of cells. Here, we individually knocked down the four transcription factors PU.1, IRF8, MYB and SP1 in the huma...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq729

    authors: Vitezic M,Lassmann T,Forrest AR,Suzuki M,Tomaru Y,Kawai J,Carninci P,Suzuki H,Hayashizaki Y,Daub CO

    更新日期:2010-12-01 00:00:00

  • Polynucleotides. XXIV. Synthesis and properties of a dinucleoside monophosphate derived from uridine 6,2-cyclonucleoside.

    abstract::A dinucleoside monophosphate, 6,2'-anhydro-6-oxy-1-beta-D-arabinofuranosyluracil-phosphoryl- (3'-5')-6,2'-anhydro-6-oxy-1-beta-D-arabinofuranosyluracil (I) was synthesized by the condensation reaction using DCC from 5'-monomethoxytrityl derivative(VII) and 3'-acetyl-5'-phosphate(X) of the monomer units. Yield was ca. ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/1.3.479

    authors: Ikehara M,Tezuka T

    更新日期:1974-03-01 00:00:00

  • Distinct regions of RPB11 are required for heterodimerization with RPB3 in human and yeast RNA polymerase II.

    abstract::In Saccharomyces cerevisiae, RNA polymerase II assembly is probably initiated by the formation of the RPB3-RPB11 heterodimer. RPB3 is encoded by a single copy gene in the yeast, mouse and human genomes. The RPB11 gene is also unique in yeast and mouse, but in humans a gene family has been identified that potentially e...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki672

    authors: Benga WJ,Grandemange S,Shpakovski GV,Shematorova EK,Kedinger C,Vigneron M

    更新日期:2005-06-24 00:00:00

  • A Thermus phage protein inhibits host RNA polymerase by preventing template DNA strand loading during open promoter complex formation.

    abstract::RNA polymerase (RNAP) is a major target of gene regulation. Thermus thermophilus bacteriophage P23-45 encodes two RNAP binding proteins, gp39 and gp76, which shut off host gene transcription while allowing orderly transcription of phage genes. We previously reported the structure of the T. thermophilus RNAP•σA holoenz...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx1162

    authors: Ooi WY,Murayama Y,Mekler V,Minakhin L,Severinov K,Yokoyama S,Sekine SI

    更新日期:2018-01-09 00:00:00

  • Gene-specific mutagenesis enables rapid continuous evolution of enzymes in vivo.

    abstract::Various in vivo mutagenesis methods have been developed to facilitate fast and efficient continuous evolution of proteins in cells. However, they either modify the DNA region that does not match the target gene, or suffer from low mutation rates. Here, we report a mutator, eMutaT7 (enhanced MutaT7), with very fast in ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkaa1231

    authors: Park H,Kim S

    更新日期:2021-01-06 00:00:00

  • High-throughput single-molecule mapping links subtelomeric variants and long-range haplotypes with specific telomeres.

    abstract::Accurate maps and DNA sequences for human subtelomere regions, along with detailed knowledge of subtelomere variation and long-range telomere-terminal haplotypes in individuals, are critical for understanding telomere function and its roles in human biology. Here, we use a highly automated whole genome mapping technol...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx017

    authors: Young E,Pastor S,Rajagopalan R,McCaffrey J,Sibert J,Mak ACY,Kwok PY,Riethman H,Xiao M

    更新日期:2017-05-19 00:00:00

  • Tissue-specific and imprinted epigenetic modifications of the human NDN gene.

    abstract::Allele-specific DNA methylation, histone acetylation and histone methylation are recognized as epigenetic characteristics of imprinted genes and imprinting centers (ICs). These epigenetic modifications are also used to regulate tissue-specific gene expression. Epigenetic differences between alleles can be significant ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh671

    authors: Lau JC,Hanel ML,Wevrick R

    更新日期:2004-06-24 00:00:00

  • Diverse transcription influences can be insulated by the Drosophila SF1 chromatin boundary.

    abstract::Chromatin boundaries regulate gene expression by modulating enhancer-promoter interactions and insulating transcriptional influences from organized chromatin. However, mechanistic distinctions between these two aspects of boundary function are not well understood. Here we show that SF1, a chromatin boundary located in...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp362

    authors: Majumder P,Roy S,Belozerov VE,Bosu D,Puppali M,Cai HN

    更新日期:2009-07-01 00:00:00

  • The MIGenAS integrated bioinformatics toolkit for web-based sequence analysis.

    abstract::We describe a versatile and extensible integrated bioinformatics toolkit for the analysis of biological sequences over the Internet. The web portal offers convenient interactive access to a growing pool of chainable bioinformatics software tools and databases that are centrally installed and maintained by the RZG. Cur...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl254

    authors: Rampp M,Soddemann T,Lederer H

    更新日期:2006-07-01 00:00:00

  • The imprinted gene and parent-of-origin effect database.

    abstract::The database of imprinted genes and parent-of-origin effects in animals (http://www.otago.ac.nz/IGC ) is a collation of genes and phenotypes for which parent-of-origin effects have been reported. The database currently includes over 220 entries, which describe over 40 imprinted genes in human, mouse and other animals....

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.1.275

    authors: Morison IM,Paton CJ,Cleverley SD

    更新日期:2001-01-01 00:00:00

  • Anti-HIV-1 activity of anti-TAR polyamide nucleic acid conjugated with various membrane transducing peptides.

    abstract::The transactivator responsive region (TAR) present in the 5'-NTR of the HIV-1 genome represents a potential target for antiretroviral intervention and a model system for the development of specific inhibitors of RNA-protein interaction. Earlier, we have shown that an anti-TAR polyamide nucleotide analog (PNA(TAR)) con...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki743

    authors: Tripathi S,Chaubey B,Ganguly S,Harris D,Casale RA,Pandey VN

    更新日期:2005-08-02 00:00:00

  • NRPSpredictor2--a web server for predicting NRPS adenylation domain specificity.

    abstract::The products of many bacterial non-ribosomal peptide synthetases (NRPS) are highly important secondary metabolites, including vancomycin and other antibiotics. The ability to predict substrate specificity of newly detected NRPS Adenylation (A-) domains by genome sequencing efforts is of great importance to identify an...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr323

    authors: Röttig M,Medema MH,Blin K,Weber T,Rausch C,Kohlbacher O

    更新日期:2011-07-01 00:00:00

  • Bacteriophage T4 regA protein binds to the Shine-Dalgarno region of gene 44 mRNA.

    abstract::We have overproduced and purified wild type regA protein, a translational repressor encoded by bacteriophage T4. The repressor activity of the cloned regA protein has been tested on four known regA target genes (T4 genes: 44, 45, rpbA and regA) using in vitro coupled transcription-translation reactions. We have demons...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/17.23.10047

    authors: Webster KR,Adari HY,Spicer EK

    更新日期:1989-12-11 00:00:00

  • MolliGen, a database dedicated to the comparative genomics of Mollicutes.

    abstract::Bacteria belonging to the class Mollicutes were among the first ones to be selected for complete genome sequencing because of the minimal size of their genomes and their pathogenicity for humans and a broad range of animals and plants. At this time six genome sequences have been publicly released (Mycoplasma genitaliu...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh114

    authors: Barré A,de Daruvar A,Blanchard A

    更新日期:2004-01-01 00:00:00

  • AVPpred: collection and prediction of highly effective antiviral peptides.

    abstract::In the battle against viruses, antiviral peptides (AVPs) had demonstrated the immense potential. Presently, more than 15 peptide-based drugs are in various stages of clinical trials. Emerging and re-emerging viruses further emphasize the efforts to accelerate antiviral drug discovery efforts. Despite, huge importance ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks450

    authors: Thakur N,Qureshi A,Kumar M

    更新日期:2012-07-01 00:00:00

  • Quantitation of supercoiled circular content in plasmid DNA solutions using a fluorescence-based method.

    abstract::A method for quantifying the proportion of supercoiled circular (SC) forms in DNA solutions is described. The method (SCFluo) takes advantage of the reversible denaturation property of SC forms and the high specificity of the PicoGreen fluorochrome for double-stranded (ds)DNA. Fluorescence values of forms capable of r...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.12.e57

    authors: Levy MS,Lotfian P,O'Kennedy R,Lo-Yim MY,Shamlou PA

    更新日期:2000-06-15 00:00:00

  • jpHMM: improving the reliability of recombination prediction in HIV-1.

    abstract::Previously, we developed jumping profile hidden Markov model (jpHMM), a new method to detect recombinations in HIV-1 genomes. The jpHMM predicts recombination breakpoints in a query sequence and assigns to each position of the sequence one of the major HIV-1 subtypes. Since incorrect subtype assignment or recombinatio...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp371

    authors: Schultz AK,Zhang M,Bulla I,Leitner T,Korber B,Morgenstern B,Stanke M

    更新日期:2009-07-01 00:00:00

  • WorfDB: the Caenorhabditis elegans ORFeome Database.

    abstract::WorfDB (Worm ORFeome DataBase; http://worfdb.dfci.harvard.edu) was created to integrate and disseminate the data from the cloning of complete set of approximately 19 000 predicted protein-encoding Open Reading Frames (ORFs) of Caenorhabditis elegans (also referred to as the 'worm ORFeome'). WorfDB serves as a central ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg092

    authors: Vaglio P,Lamesch P,Reboul J,Rual JF,Martinez M,Hill D,Vidal M

    更新日期:2003-01-01 00:00:00

  • Incorporation of 2'-amido-nucleosides in oligodeoxynucleotides and oligoribonucleotides as a model for 2'-linked conjugates.

    abstract::The functionalisation of oligodeoxynucleotides and oligoribonucleotides by incorporation of 2'-amido-2'-deoxyribonucleosides, possibly containing a reporter group via the 2'-amido bond, was examined. Therefore 2'-acetamido-ribonucleosides containing a small methyl group at the 2'-amido bond were synthesized as model c...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.1.51

    authors: Hendrix C,Devreese B,Rozenski J,van Aerschot A,De Bruyn A,Van Beeumen J,Herdewijn P

    更新日期:1995-01-11 00:00:00

  • High accuracy operon prediction method based on STRING database scores.

    abstract::We present a simple and highly accurate computational method for operon prediction, based on intergenic distances and functional relationships between the protein products of contiguous genes, as defined by STRING database (Jensen,L.J., Kuhn,M., Stark,M., Chaffron,S., Creevey,C., Muller,J., Doerks,T., Julien,P., Roth,...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq254

    authors: Taboada B,Verde C,Merino E

    更新日期:2010-07-01 00:00:00

  • Protein binding sites on Escherichia coli 16S ribosomal RNA; RNA regions that are protected by proteins S7, S9 and S19, and by proteins S8, S15 and S17.

    abstract::Selected groups of isolated 14C-labelled proteins from E. coli 30S ribosomal subunits were reconstituted with 32P-labelled 16S RNA, and the reconstituted complexes were partially digested with ribonuclease A. RNA fragments protected by the proteins were separated by gel electrophoresis and subjected to sequence analys...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.4.1233

    authors: Wiener L,Schüler D,Brimacombe R

    更新日期:1988-02-25 00:00:00

  • PARN deadenylase is involved in miRNA-dependent degradation of TP53 mRNA in mammalian cells.

    abstract::mRNA deadenylation is under the control of cis-acting regulatory elements, which include AU-rich elements (AREs) and microRNA (miRNA) targeting sites, within the 3' untranslated region (3' UTRs) of eukaryotic mRNAs. Deadenylases promote miRNA-induced mRNA decay through their interaction with miRNA-induced silencing co...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv959

    authors: Zhang X,Devany E,Murphy MR,Glazman G,Persaud M,Kleiman FE

    更新日期:2015-12-15 00:00:00

  • Dissecting the function of the adult β-globin downstream promoter region using an artificial zinc finger DNA-binding domain.

    abstract::Developmental stage-specific expression of the β-type globin genes is regulated by many cis- and trans-acting components. The adult β-globin gene contains an E-box located 60 bp downstream of the transcription start site that has been shown to bind transcription factor upstream stimulatory factor (USF) and to contribu...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku107

    authors: Barrow JJ,Li Y,Hossain M,Huang S,Bungert J

    更新日期:2014-04-01 00:00:00

  • Subtractive hybridization identifies novel differentially expressed ncRNA species in EBV-infected human B cells.

    abstract::Non-protein-coding RNAs (ncRNAs) fulfill a wide range of cellular functions from protein synthesis to regulation of gene expression. Identification of novel regulatory ncRNAs by experimental approaches commonly includes the generation of specialized cDNA libraries encoding small ncRNA species. However, such identifica...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm244

    authors: Mrázek J,Kreutmayer SB,Grässer FA,Polacek N,Hüttenhofer A

    更新日期:2007-01-01 00:00:00

  • Sequences affecting the V(D)J recombinational activity of the IgH intronic enhancer in a transgenic substrate.

    abstract::The immunoglobulin heavy chain intronic transcriptional enhancer (E mu) is part of a complex cis-regulatory DNA region which has notably been shown to modulate V(D)J rearrangements of associated variable gene segments. We have used recombination substrates comprised of the E mu enhancer together with various lengths o...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/22.5.792

    authors: Fernex C,Caillol D,Capone M,Krippl B,Ferrier P

    更新日期:1994-03-11 00:00:00

  • Maturation of a hypermodified nucleoside in transfer RNA.

    abstract::E. coli C6 rel- met- cys- was cultured in a fully supplemented medium and in media lacking cysteine or methionine. tRNA isolated from the three cultures containted, respectively, a normal complement of modified nucleosides; a deficiency in thiolated nucleosides and a deficiency in methylated nucleosides. Both sulfur-d...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/2.5.691

    authors: Agris PF,Armstrong DJ,Schäfer KP,Söll D

    更新日期:1975-05-01 00:00:00

  • In vivo cleavage rules and target repertoire of RNase III in Escherichia coli.

    abstract::Bacterial RNase III plays important roles in the processing and degradation of RNA transcripts. A major goal is to identify the cleavage targets of this endoribonuclease at a transcriptome-wide scale and delineate its in vivo cleavage rules. Here we applied to Escherichia coli grown to either exponential or stationary...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky684

    authors: Altuvia Y,Bar A,Reiss N,Karavani E,Argaman L,Margalit H

    更新日期:2018-11-02 00:00:00

  • SAHG, a comprehensive database of predicted structures of all human proteins.

    abstract::Most proteins from higher organisms are known to be multi-domain proteins and contain substantial numbers of intrinsically disordered (ID) regions. To analyse such protein sequences, those from human for instance, we developed a special protein-structure-prediction pipeline and accumulated the products in the Structur...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1057

    authors: Motono C,Nakata J,Koike R,Shimizu K,Shirota M,Amemiya T,Tomii K,Nagano N,Sakaya N,Misoo K,Sato M,Kidera A,Hiroaki H,Shirai T,Kinoshita K,Noguchi T,Ota M

    更新日期:2011-01-01 00:00:00