Vespucci: a system for building annotated databases of nascent transcripts.

Abstract:

:Global run-on sequencing (GRO-seq) is a recent addition to the series of high-throughput sequencing methods that enables new insights into transcriptional dynamics within a cell. However, GRO-sequencing presents new algorithmic challenges, as existing analysis platforms for ChIP-seq and RNA-seq do not address the unique problem of identifying transcriptional units de novo from short reads located all across the genome. Here, we present a novel algorithm for de novo transcript identification from GRO-sequencing data, along with a system that determines transcript regions, stores them in a relational database and associates them with known reference annotations. We use this method to analyze GRO-sequencing data from primary mouse macrophages and derive novel quantitative insights into the extent and characteristics of non-coding transcription in mammalian cells. In doing so, we demonstrate that Vespucci expands existing annotations for mRNAs and lincRNAs by defining the primary transcript beyond the polyadenylation site. In addition, Vespucci generates assemblies for un-annotated non-coding RNAs such as those transcribed from enhancer-like elements. Vespucci thereby provides a robust system for defining, storing and analyzing diverse classes of primary RNA transcripts that are of increasing biological interest.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Allison KA,Kaikkonen MU,Gaasterland T,Glass CK

doi

10.1093/nar/gkt1237

subject

Has Abstract

pub_date

2014-02-01 00:00:00

pages

2433-47

issue

4

eissn

0305-1048

issn

1362-4962

pii

gkt1237

journal_volume

42

pub_type

杂志文章
  • Stable complex formation of CENP-B with the CENP-A nucleosome.

    abstract::CENP-A and CENP-B are major components of centromeric chromatin. CENP-A is the histone H3 variant, which forms the centromere-specific nucleosome. CENP-B specifically binds to the CENP-B box DNA sequence on the centromere-specific repetitive DNA. In the present study, we found that the CENP-A nucleosome more stably re...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv405

    authors: Fujita R,Otake K,Arimura Y,Horikoshi N,Miya Y,Shiga T,Osakabe A,Tachiwana H,Ohzeki J,Larionov V,Masumoto H,Kurumizaka H

    更新日期:2015-05-26 00:00:00

  • Ultraviolet resonance Raman study of DNA and of its interaction with actinomycin D.

    abstract::The DNA-Actinomycin D interaction has been studied by resonance Raman effect using DNA as chromophore. First, the resonance Raman spectra of DNA obtained with a U.V. excitation at wavelengths of 300 nm and 280 nm are presented. The main Raman hands are assigned to the convenient nucleic bases by comparison with the sp...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.8.2969

    authors: Chinsky L,Turpin PY

    更新日期:1978-08-01 00:00:00

  • iGNM 2.0: the Gaussian network model database for biomolecular structural dynamics.

    abstract::Gaussian network model (GNM) is a simple yet powerful model for investigating the dynamics of proteins and their complexes. GNM analysis became a broadly used method for assessing the conformational dynamics of biomolecular structures with the development of a user-friendly interface and database, iGNM, in 2005. We pr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv1236

    authors: Li H,Chang YY,Yang LW,Bahar I

    更新日期:2016-01-04 00:00:00

  • Cyclical regulation of the insulin-like growth factor binding protein 3 gene in response to 1alpha,25-dihydroxyvitamin D3.

    abstract::The nuclear receptor vitamin D receptor (VDR) is known to associate with two vitamin D response element (VDRE) containing chromatin regions of the insulin-like growth factor binding protein 3 (IGFBP3) gene. In non-malignant MCF-10A human mammary cells, we show that the natural VDR ligand 1α,25-dihydroxyvitamin D(3) (1...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq820

    authors: Malinen M,Ryynänen J,Heinäniemi M,Väisänen S,Carlberg C

    更新日期:2011-01-01 00:00:00

  • Capping of vesicular stomatitis virus pre-mRNA is required for accurate selection of transcription stop-start sites and virus propagation.

    abstract::The multifunctional RNA-dependent RNA polymerase L protein of vesicular stomatitis virus catalyzes unconventional pre-mRNA capping via the covalent enzyme-pRNA intermediate formation, which requires the histidine-arginine (HR) motif in the polyribonucleotidyltransferase domain. Here, the effects of cap-defective mutat...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku901

    authors: Ogino T

    更新日期:2014-10-29 00:00:00

  • Differential transcription of the orphan receptor RORbeta in nuclear extracts derived from Neuro2A and HeLa cells.

    abstract::An important model system for studying the process leading to productive transcription is provided by the superfamily of nuclear receptors, which are for the most part ligand-controlled transcription factors. Over the past years several 'orphan' nuclear receptors have been isolated for which no ligand has yet been ide...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.16.3424

    authors: Gawlas K,Stunnenberg HG

    更新日期:2001-08-15 00:00:00

  • AlleleHMM: a data-driven method to identify allele specific differences in distributed functional genomic marks.

    abstract::How DNA sequence variation influences gene expression remains poorly understood. Diploid organisms have two homologous copies of their DNA sequence in the same nucleus, providing a rich source of information about how genetic variation affects a wealth of biochemical processes. However, few computational methods have ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz176

    authors: Chou SP,Danko CG

    更新日期:2019-06-20 00:00:00

  • Gene F of plasmid RSF1010 codes for a low-molecular-weight repressor protein that autoregulates expression of the repAC operon.

    abstract::The repAC operon of plasmid RSF1010 consists of the genes for proteins E, F, RepA (DNA helicase), and RepC (origin-binding initiator protein) and is transcriptionally initiated by a promoter called P4. We have studied the expression of the repAC operon in vivo by using fusions to the lacZ reporter gene. The results sh...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.21.6215

    authors: Maeser S,Scholz P,Otto S,Scherzinger E

    更新日期:1990-11-11 00:00:00

  • Characterization of a novel T lymphocyte protein which binds to a site related to steroid/thyroid hormone receptor response elements in the negative regulatory sequence of the human immunodeficiency virus long terminal repeat.

    abstract::We have previously identified a T lymphocyte protein which binds to a site within the LTR of the human immunodeficiency virus type 1 (HIV-1) and exerts an inhibitory effect on virus gene expression. The palindromic site (site B) recognized by this protein is related to the palindromic binding sites of members of the s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.20.5429

    authors: Orchard K,Lang G,Collins M,Latchman D

    更新日期:1992-10-25 00:00:00

  • Linker phosphoramidite reagents for the attachment of the first nucleoside to underivatized solid-phase supports.

    abstract::New linker phosphoramidite reagents containing a cleavable 3'-ester linkage are used for attaching the first nucleoside to the surface of a solid- phase support. Inexpensive, underivatized amino supports, such as long chain alkylamine controlled-pore glass, can serve as universal supports. No modifications to phosphor...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh222

    authors: Pon RT,Yu S

    更新日期:2004-01-29 00:00:00

  • The MH1 domain of Smad3 interacts with Pax6 and represses autoregulation of the Pax6 P1 promoter.

    abstract::Pax6 transcription is under the control of two main promoters (P0 and P1), and these are autoregulated by Pax6. Additionally, Pax6 expression is under the control of the TGFbeta superfamily, although the precise mechanisms of such regulation are not understood. The effect of TGFbeta on Pax6 expression was studied in t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl1105

    authors: Grocott T,Frost V,Maillard M,Johansen T,Wheeler GN,Dawes LJ,Wormstone IM,Chantry A

    更新日期:2007-01-01 00:00:00

  • The major form of MeCP2 has a novel N-terminus generated by alternative splicing.

    abstract::MeCP2 is a methyl-CpG binding protein that can repress transcription of nearby genes. In humans, mutations in the MECP2 gene are the major cause of Rett syndrome. By searching expressed sequence tag (EST) databases we have found a novel MeCP2 splice isoform (MeCP2alpha) which encodes a distinct N-terminus. We demonstr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh349

    authors: Kriaucionis S,Bird A

    更新日期:2004-03-19 00:00:00

  • Pathbase: a database of mutant mouse pathology.

    abstract::Pathbase is a database that stores images of the abnormal histology associated with spontaneous and induced mutations of both embryonic and adult mice including those produced by transgenesis, targeted mutagenesis and chemical mutagenesis. Images of normal mouse histology and strain-dependent background lesions are al...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh124

    authors: Schofield PN,Bard JB,Booth C,Boniver J,Covelli V,Delvenne P,Ellender M,Engstrom W,Goessner W,Gruenberger M,Hoefler H,Hopewell J,Mancuso M,Mothersill C,Potten CS,Quintanilla-Fend L,Rozell B,Sariola H,Sundberg JP,Ward

    更新日期:2004-01-01 00:00:00

  • RNIE: genome-wide prediction of bacterial intrinsic terminators.

    abstract::Bacterial Rho-independent terminators (RITs) are important genomic landmarks involved in gene regulation and terminating gene expression. In this investigation we present RNIE, a probabilistic approach for predicting RITs. The method is based upon covariance models which have been known for many years to be the most a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr168

    authors: Gardner PP,Barquist L,Bateman A,Nawrocki EP,Weinberg Z

    更新日期:2011-08-01 00:00:00

  • NPGPx modulates CPEB2-controlled HIF-1α RNA translation in response to oxidative stress.

    abstract::Non-selenocysteine-containing phospholipid hydroperoxide glutathione peroxidase (NPGPx or GPx7) is an oxidative stress sensor that modulates the antioxidative activity of its target proteins through intermolecular disulfide bond formation. Given NPGPx's role in protecting cells from oxidative damage, identification of...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv1010

    authors: Chen PJ,Weng JY,Hsu PH,Shew JY,Huang YS,Lee WH

    更新日期:2015-10-30 00:00:00

  • Uracil-DNA glycosylase affects mismatch repair efficiency in transformation and bisulfite-induced mutagenesis in Streptococcus pneumoniae.

    abstract::The generalized mismatch repair system of Streptococcus pneumoniae (the Hex system) can eliminate base pair mismatches arising in heteroduplex DNA during transformation or by DNA polymerase errors during replication. Mismatch repair is most likely initiated at nicks or gaps. The present work was started to examine the...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/19.20.5525

    authors: Méjean V,Devedjian JC,Rives I,Alloing G,Claverys JP

    更新日期:1991-10-25 00:00:00

  • Downstream signaling mechanism of the C-terminal activation domain of transcriptional coactivator CoCoA.

    abstract::The coiled-coil coactivator (CoCoA) is a transcriptional coactivator for nuclear receptors and enhances nuclear receptor function by the interaction with the bHLH-PAS domain (AD3) of p160 coactivators. The C-terminal activation domain (AD) of CoCoA possesses strong transactivation activity and is required for the coac...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl361

    authors: Kim JH,Yang CK,Stallcup MR

    更新日期:2006-05-22 00:00:00

  • New observations concerning the chloroacetaldehyde reaction with some tRNA constituents. Stable intermediates, kinetics and selectivity of the reaction.

    abstract::The stable intermediates formed in the reaction of cytosine, cytidine and adenosine with chloracetaldehyde were isolated. The -CH2CH/OH/- bridge between the exo and endo nitrogen atoms of the parent base was found in these compounds by means of PMR spectroscopy. Their acid-induced dehydration resulted in formation of ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.3.789

    authors: Biernat J,Ciesiołka J,Górnicki P,Adamiak RW,Kryzosiak WJ,Wiewiórowski M

    更新日期:1978-03-01 00:00:00

  • Real-time single-molecule imaging reveals a direct interaction between UvrC and UvrB on DNA tightropes.

    abstract::Nucleotide excision DNA repair is mechanistically conserved across all kingdoms of life. In prokaryotes, this multi-enzyme process requires six proteins: UvrA-D, DNA polymerase I and DNA ligase. To examine how UvrC locates the UvrB-DNA pre-incision complex at a site of damage, we have labeled UvrB and UvrC with differ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt177

    authors: Hughes CD,Wang H,Ghodke H,Simons M,Towheed A,Peng Y,Van Houten B,Kad NM

    更新日期:2013-05-01 00:00:00

  • Enzymatic processing of DNA containing tandem dihydrouracil by endonucleases III and VIII.

    abstract::Endonuclease III from Escherichia coli, yeast (yNtg1p and yNtg2p) and human and E.coli endonuclease VIII have a wide substrate specificity, and recognize oxidation products of both thymine and cytosine. DNA containing single dihydrouracil (DHU) and tandem DHU lesions were used as substrates for these repair enzymes. I...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.2.407

    authors: Venkhataraman R,Donald CD,Roy R,You HJ,Doetsch PW,Kow YW

    更新日期:2001-01-15 00:00:00

  • Nucleotide sequence analysis of the linked left and right hand terminal regions of adenovirus type 5 DNA present in the transformed rat cell line 5RK20.

    abstract::A peculiar phenomenon is observed in several adenovirus type 2 or 5 (Ad2 or Ad5) transformed cell lines: the right hand and left hand terminal regions of the viral genome present in the viral DNA insertions of these cell lines are found to be linked together. A large part of the viral DNA insertion present in the Ad5 ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.7.2189

    authors: Visser L,Reemst AC,van Mansfeld AD,Rozijn TH

    更新日期:1982-04-10 00:00:00

  • Synthesis and hydrolysis of oligodeoxyribonucleotides containing 2-aminopurine.

    abstract::A new method is reported for the synthesis of oligodeoxyribonucleotides containing 2-aminopurine residues at selected sites. This method involves protection of the 2-aminopurine ribonucleoside, reduction to the deoxyribonucleoside and standard preparation of the 5'-0- (4,4'-dimethoxytrityl)-3'-O-(2-cyanoethyl)-N,N- di...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.4.754

    authors: Fujimoto J,Nuesca Z,Mazurek M,Sowers LC

    更新日期:1996-02-15 00:00:00

  • A tRNA gene of Xenopus laevis contains at least two sites promoting transcription.

    abstract::A small cloned DNA segment previously shown to contain all genetic information for the expression of the tRNA1met gene of Xenopus laevis was cleaved into an anterior and posterior portion by Hae III restriction. Both restriction fragments were cloned in pCR1 using EcoRI linkers. Starting from these tDNA subclones, a s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/7.7.1749

    authors: Kressmann A,Hofstetter H,Di Capua E,Grosschedl R,Birnstiel ML

    更新日期:1979-12-11 00:00:00

  • Ancestral Genomes: a resource for reconstructed ancestral genes and genomes across the tree of life.

    abstract::A growing number of whole genome sequencing projects, in combination with development of phylogenetic methods for reconstructing gene evolution, have provided us with a window into genomes that existed millions, and even billions, of years ago. Ancestral Genomes (http://ancestralgenomes.org) is a resource for comprehe...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky1009

    authors: Huang X,Albou LP,Mushayahama T,Muruganujan A,Tang H,Thomas PD

    更新日期:2019-01-08 00:00:00

  • CellBase, a comprehensive collection of RESTful web services for retrieving relevant biological information from heterogeneous sources.

    abstract::During the past years, the advances in high-throughput technologies have produced an unprecedented growth in the number and size of repositories and databases storing relevant biological data. Today, there is more biological information than ever but, unfortunately, the current status of many of these repositories is ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks575

    authors: Bleda M,Tarraga J,de Maria A,Salavert F,Garcia-Alonso L,Celma M,Martin A,Dopazo J,Medina I

    更新日期:2012-07-01 00:00:00

  • An ES cell system for rapid, spatial and temporal analysis of gene function in vitro and in vivo.

    abstract::We describe a versatile genetic system for rapid analysis of mammalian gene function. In this, loss of reporter activity in a novel embryonic stem (ES) cell line enables rapid identification of targeting to the ubiquitously expressed Rosa26 locus. Subsequent regulation of gene activity is governed by a dual regulatory...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gni146

    authors: Mao J,Barrow J,McMahon J,Vaughan J,McMahon AP

    更新日期:2005-10-12 00:00:00

  • IL-3A virus infection of a Chlorella-like green alga induces a DNA restriction endonuclease with novel sequence specificity.

    abstract::A type II restriction endonuclease, named CviJI, was isolated from a eukaryotic Chlorella-like green alga infected with the dsDNA containing virus IL-3A. CviJI is the first restriction endonuclease to recognize the sequence PuGCPy; CviJI cleaves DNA between the G and C. Methylation of the cytosine in PuGCPy sequences ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/15.15.6075

    authors: Xia YN,Burbank DE,Uher L,Rabussay D,Van Etten JL

    更新日期:1987-08-11 00:00:00

  • Heteroduplexes of phiX174 and G4 DNAs: orientation to genetic map and comparison with predictions from nucleotide sequences.

    abstract::Heteroduplexes between the viral DNA of phiX174 and DNA from the replicative form (RF) of phage G4 were examined by electron microscopy. The single Eco RI site of G4-RF was utilized as a physical marker by preparing the heteroduplexes from the denatured, linear DNA obtained by restricting G4-RF with Eco RI endonucleas...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.5.1979

    authors: Smiley BL,Warner RC

    更新日期:1979-01-01 00:00:00

  • Organization, inducible-expression and chromosome localization of the human HMG-I(Y) nonhistone protein gene.

    abstract::Members of the HMG-I(Y) family of mammalian nonhistone proteins are of importance because they have been demonstrated to bind specifically to the minor groove of A.T-rich sequences both in vitro and in vivo and to function as gene transcriptional regulatory proteins in vivo. Here we report the cloning, sequencing, cha...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.18.4259

    authors: Friedmann M,Holth LT,Zoghbi HY,Reeves R

    更新日期:1993-09-11 00:00:00

  • Assays for DNA double-strand break repair by microhomology-based end-joining repair mechanisms.

    abstract::DNA double stranded breaks (DSBs) are one of the most deleterious types of DNA lesions. The main pathways responsible for repairing these breaks in eukaryotic cells are homologous recombination (HR) and non-homologous end-joining (NHEJ). However, a third group of still poorly characterized DSB repair pathways, collect...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv1349

    authors: Kostyrko K,Mermod N

    更新日期:2016-04-07 00:00:00