OLego: fast and sensitive mapping of spliced mRNA-Seq reads using small seeds.

Abstract:

:A crucial step in analyzing mRNA-Seq data is to accurately and efficiently map hundreds of millions of reads to the reference genome and exon junctions. Here we present OLego, an algorithm specifically designed for de novo mapping of spliced mRNA-Seq reads. OLego adopts a multiple-seed-and-extend scheme, and does not rely on a separate external aligner. It achieves high sensitivity of junction detection by strategic searches with small seeds (~14 nt for mammalian genomes). To improve accuracy and resolve ambiguous mapping at junctions, OLego uses a built-in statistical model to score exon junctions by splice-site strength and intron size. Burrows-Wheeler transform is used in multiple steps of the algorithm to efficiently map seeds, locate junctions and identify small exons. OLego is implemented in C++ with fully multithreaded execution, and allows fast processing of large-scale data. We systematically evaluated the performance of OLego in comparison with published tools using both simulated and real data. OLego demonstrated better sensitivity, higher or comparable accuracy and substantially improved speed. OLego also identified hundreds of novel micro-exons (<30 nt) in the mouse transcriptome, many of which are phylogenetically conserved and can be validated experimentally in vivo. OLego is freely available at http://zhanglab.c2b2.columbia.edu/index.php/OLego.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Wu J,Anczuków O,Krainer AR,Zhang MQ,Zhang C

doi

10.1093/nar/gkt216

subject

Has Abstract

pub_date

2013-05-01 00:00:00

pages

5149-63

issue

10

eissn

0305-1048

issn

1362-4962

pii

gkt216

journal_volume

41

pub_type

杂志文章
  • Decomposition of time-dependent fluorescence signals reveals codon-specific kinetics of protein synthesis.

    abstract::During protein synthesis, the nascent peptide chain traverses the peptide exit tunnel of the ribosome. We monitor the co-translational movement of the nascent peptide using a fluorescent probe attached to the N-terminus of the nascent chain. Due to fluorophore quenching, the time-dependent fluorescence signal emitted ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky740

    authors: Haase N,Holtkamp W,Lipowsky R,Rodnina M,Rudorf S

    更新日期:2018-12-14 00:00:00

  • Stacking self-association of pyrimidine nucleosides and of cytosines: effects of methylation and thiolation.

    abstract::Stacking self-association equilibria in aqueous solutions of m3uridine, m34,2',3',5'uridine, 2'-deoxyuridine, m13,4,4cytosine, m14,4,4,5cytosine, s2cytidine and s4thymidine were studied at various temperatures by vapour-pressure osmometry. Equilibrium constants Kst's were computed on the assumption of the isodesmic mo...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/3.5.1295

    authors: Plesiewicz E,Stepień E,Bolewska K,Wierzchowski KL

    更新日期:1976-05-01 00:00:00

  • Hydration of single-stranded phosphodiester and phosphorothioate oligodeoxyribonucleotides.

    abstract::Infrared spectroscopy was used to identify hydration-sensitive structural differences between single- stranded phosphorothioate (PS) and phosphodiester (PO) oligodeoxyribonucleotides. Spectra were recorded in the mid-infrared region, 500-1800 cm-1, at relative humidities between 0 and 98%; the PS and PO spectra are su...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.16.3261

    authors: White AP,Reeves KK,Snyder E,Farrell J,Powell JW,Mohan V,Griffey RH

    更新日期:1996-08-15 00:00:00

  • Cloning and expression of cDNA encoding mouse tyrosinase.

    abstract::We have isolated a pigment cell-specific cDNA clone from a B16 mouse melanoma cDNA library by differential hybridization. The mRNA of isolated cDNA is highly expressed in B16 melanoma cells and in black mouse (C57BL/6) skin, but is not detectable in mouse neuroblastoma cells nor in K1735 mouse amelanotic melanoma cell...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/14.6.2413

    authors: Shibahara S,Tomita Y,Sakakura T,Nager C,Chaudhuri B,Müller R

    更新日期:1986-03-25 00:00:00

  • A search for base analogs to enhance third-strand binding to 'inverted' target base pairs of triplexes in the pyrimidine/parallel motif.

    abstract::Eight base analogs were tested as third strand residues in otherwise homopyrimidine strands opposite each of the 'direct' (A.T and G.C) and 'inverted' (T.A and C.G) Watson-Crick base pairs, using UV melting profiles to assess triplex stability. The target duplexes contained 20 A.T base pairs and a central test base pa...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/27.23.4632

    authors: Amosova OA,Fresco JR

    更新日期:1999-12-01 00:00:00

  • Identification of sample-specific sequences in mammalian cDNA and genomic DNA by the novel ligation-mediated subtraction (Limes).

    abstract::The representational difference analysis (RDA) and other subtraction techniques are used to enrich sample-specific sequences by elimination of ubiquitous sequences existing in both the sample of interest (tester) and the subtraction partner (driver). While applying the RDA to genomic DNA of cutaneous lymphoma cells in...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.4.e20

    authors: Hansen-Hagge TE,Trefzer U,zu Reventlow AS,Kaltoft K,Sterry W

    更新日期:2001-02-15 00:00:00

  • Structural features and stability of an RNA triple helix in solution.

    abstract::A 30 nt RNA with a sequence designed to form an intramolecular triple helix was analyzed by one-and two-dimensional NMR spectroscopy and UV absorption measurements. NMR data show that the RNA contains seven pyrimidine-purine-pyrimidine base triples stabilized by Watson-Crick and Hoogsteen interactions. The temperature...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.14.2841

    authors: Holland JA,Hoffman DW

    更新日期:1996-07-15 00:00:00

  • Requirements for selective recruitment of Ets proteins and activation of mb-1/Ig-alpha gene transcription by Pax-5 (BSAP).

    abstract::Pax-5, a member of the paired domain family of transcription factors, is a key regulator of B lymphocyte-specific transcription and differentiation. A major target of Pax-5-mediated activation is the mb-1 gene, which encodes the essential transmembrane signaling protein Ig-alpha. Pax-5 recruits three members of the Et...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg785

    authors: Maier H,Ostraat R,Parenti S,Fitzsimmons D,Abraham LJ,Garvie CW,Hagman J

    更新日期:2003-10-01 00:00:00

  • DNA nanostructures constructed with multi-stranded motifs.

    abstract::Earlier studies in DNA self-assembly have foretold the feasibility of building addressable nanostructures with multi-stranded motifs, which is fully validated in this study. In realizing this feasibility in DNA nanotechnology, a diversified set of motifs of modified domain lengths is extended from a classic type. The ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx121

    authors: Yang D,Tan Z,Mi Y,Wei B

    更新日期:2017-04-07 00:00:00

  • Reactivity and selectivity in light-induced free radical reactions of 2-propanol with purine and pyrimidine mononucleotides and dinucleoside monophosphates.

    abstract::Photoalkylation reactions with 2-propanol, initiated with di-tert-butyl peroxide, of a variety of purine and pyrimidine mononucleotides and dinucleoside monophosphates lead to the substitution of an alpha-hydroxyisopropyl group for the H-8 atom of adenosine and the addition of the alcohol across the 5,6-double bond of...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/3.7.1715

    authors: Havron A,Sperling J,Elad D

    更新日期:1976-07-01 00:00:00

  • Satellite DNA from Xenopus laevis: comparative analysis of 745 and 1037 base pair Hind III tandem repeats.

    abstract::Highly repetitive Hind III restriction fragments of 0.72-0.76 KBP from total Xenopus laevis genomic DNA are organized in a tandem like arrangement. Cloning of these fragments in pBR 322 with subsequent restriction site mapping and nucleotide sequence analysis of some selected clones showed two different types of seque...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/11.20.6997

    authors: Meyerhof W,Tappeser B,Korge E,Knöchel W

    更新日期:1983-10-25 00:00:00

  • Localization of mitochondrial DNA base excision repair to an inner membrane-associated particulate fraction.

    abstract::Mitochondrial DNA (mtDNA) contains high levels of oxidative damage relative to nuclear DNA. A full, functional DNA base excision repair (BER) pathway is present in mitochondria, to repair oxidative DNA lesions. However, little is known about the organization of this pathway within mitochondria. Here, we provide eviden...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki683

    authors: Stuart JA,Mayard S,Hashiguchi K,Souza-Pinto NC,Bohr VA

    更新日期:2005-07-08 00:00:00

  • Electronic fingerprinting of RNA.

    abstract::Software has been developed to assist RNA fingerprinting analysis. One program generates, from a DNA sequence data file, the oligonucleotides resulting from digestion of an RNA transcript labeled with any specified nucleotide(s). Oligonucleotides are sorted according to their position on the fingerprint. Expected mola...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.5.1799

    authors: Gegenheimer P

    更新日期:1988-03-11 00:00:00

  • Sequences within and flanking hypersensitive sites 3 and 2 of the beta-globin locus control region required for synergistic versus additive interaction with the epsilon-globin gene promoter.

    abstract::The locus control region is required for high-level, position-independent expression of mammalian beta-globin genes. It is marked by five major DNase hypersensitive sites (HSs) in a 16 kb region of chromatin, and the protein-DNA complexes that form these HSs may interact in a holocomplex that carries out the full func...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.21.4327

    authors: Jackson JD,Miller W,Hardison RC

    更新日期:1996-11-01 00:00:00

  • Synthesis and restriction enzyme analysis of oligodeoxyribonucleotides containing the anti-cancer drug 2',2'-difluoro-2'-deoxycytidine.

    abstract::The anti-cancer drug 2',2'-difluoro-2'-deoxycytidine (dFdC) is internally incorporated into DNA in vitro. To determine the effects of this incorporation on DNA structure and function, the beta-cyanoethyl phosphoramidite of dFdC was synthesized and oligodeoxyribonucleotides containing dFdC were made using automated sol...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.7.1763

    authors: Richardson FC,Richardson KK,Kroin JS,Hertel LW

    更新日期:1992-04-11 00:00:00

  • SDM: a server for predicting effects of mutations on protein stability.

    abstract::Here, we report a webserver for the improved SDM, used for predicting the effects of mutations on protein stability. As a pioneering knowledge-based approach, SDM has been highlighted as the most appropriate method to use in combination with many other approaches. We have updated the environment-specific amino-acid su...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx439

    authors: Pandurangan AP,Ochoa-Montaño B,Ascher DB,Blundell TL

    更新日期:2017-07-03 00:00:00

  • Analysis of pooled DNA samples on high density arrays without prior knowledge of differential hybridization rates.

    abstract::Array based DNA pooling techniques facilitate genome-wide scale genotyping of large samples. We describe a structured analysis method for pooled data using internal replication information in large scale genotyping sets. The method takes advantage of information from single nucleotide polymorphisms (SNPs) typed in par...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl136

    authors: Macgregor S,Visscher PM,Montgomery G

    更新日期:2006-04-20 00:00:00

  • MAGNET: MicroArray Gene expression and Network Evaluation Toolkit.

    abstract::MicroArray Gene expression and Network Evaluation Toolkit (MAGNET) is a web-based application that provides tools to generate and score both protein-protein interaction networks and coexpression networks. MAGNET integrates user-provided experimental measurements with high-throughput proteomic datasets, generating weig...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks526

    authors: Linderman GC,Chance MR,Bebek G

    更新日期:2012-07-01 00:00:00

  • A database of macromolecular motions.

    abstract::We describe a database of macromolecular motions meant to be of general use to the structural community. The database, which is accessible on the World Wide Web with an entry point at http://bioinfo.mbb.yale.edu/MolMovDB , attempts to systematize all instances of protein and nucleic acid movement for which there is at...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/26.18.4280

    authors: Gerstein M,Krebs W

    更新日期:1998-09-15 00:00:00

  • The COP9 signalosome is vital for timely repair of DNA double-strand breaks.

    abstract::The DNA damage response is vigorously activated by DNA double-strand breaks (DSBs). The chief mobilizer of the DSB response is the ATM protein kinase. We discovered that the COP9 signalosome (CSN) is a crucial player in the DSB response and an ATM target. CSN is a protein complex that regulates the activity of cullin ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv270

    authors: Meir M,Galanty Y,Kashani L,Blank M,Khosravi R,Fernández-Ávila MJ,Cruz-García A,Star A,Shochot L,Thomas Y,Garrett LJ,Chamovitz DA,Bodine DM,Kurz T,Huertas P,Ziv Y,Shiloh Y

    更新日期:2015-05-19 00:00:00

  • A nucleobase-binding pocket in a viral RNA-dependent RNA polymerase contributes to elongation complex stability.

    abstract::The enterovirus 71 (EV71) 3Dpol is an RNA-dependent RNA polymerase (RdRP) that plays the central role in the viral genome replication, and is an important target in antiviral studies. Here, we report a crystal structure of EV71 3Dpol elongation complex (EC) at 1.8 Å resolution. The structure reveals that the 5'-end gu...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz1170

    authors: Shi W,Ye HQ,Deng CL,Li R,Zhang B,Gong P

    更新日期:2020-02-20 00:00:00

  • Heat-inducible translational coupling in Bacillus subtilis.

    abstract::Bacillus subtilis plasmid pGR71 is a promoter-probe shuttle vector derived from pUB110. The expression of the cat gene on pGR71 in B. subtilis requires the insertion of a Bacillus promoter and a ribosomal binding site (RBS) into the HindIII cloning site immediately upstream from the cat gene. A recombinant plasmid of ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.4.739

    authors: Fujiwara S,Tsubokura N,Kurusu Y,Minami K,Kobayashi Y

    更新日期:1990-02-25 00:00:00

  • Identification of essential domains for Escherichia coli tRNA(leu) aminoacylation and amino acid editing using minimalist RNA molecules.

    abstract::Escherichia coli leucyl-tRNA synthetase (LeuRS) aminoacylates up to six different class II tRNA(leu) molecules. Each has a distinct anticodon and varied nucleotides in other regions of the tRNA. Attempts to construct a minihelix RNA that can be aminoacylated with leucine have been unsuccessful. Herein, we describe the...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/30.10.2103

    authors: Larkin DC,Williams AM,Martinis SA,Fox GE

    更新日期:2002-05-15 00:00:00

  • Haplotyping by CRISPR-mediated DNA circularization (CRISPR-hapC) broadens allele-specific gene editing.

    abstract::Allele-specific protospacer adjacent motif (asPAM)-positioning SNPs and CRISPRs are valuable resources for gene therapy of dominant disorders. However, one technical hurdle is to identify the haplotype comprising the disease-causing allele and the distal asPAM SNPs. Here, we describe a novel CRISPR-based method (CRISP...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz1233

    authors: Yu J,Xiang X,Huang J,Liang X,Pan X,Dong Z,Petersen TS,Qu K,Yang L,Zhao X,Li S,Zheng T,Xu Z,Liu C,Han P,Xu F,Yang H,Liu X,Zhang X,Bolund L,Luo Y,Lin L

    更新日期:2020-03-18 00:00:00

  • Binding site requirements and differential representation of TGF factors in nuclear ASF-1 activity.

    abstract::Activating sequence factor 1 (ASF-1) is a nuclear DNA-binding activity that is found in monocots and dicots. It interacts with several TGACG-containing elements that have been characterized from viral and T-DNA genes, the prototypes of which are the as-1 element of the CaMV 35S promoter and the ocs element from the oc...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.18.3778

    authors: Lam E,Lam YK

    更新日期:1995-09-25 00:00:00

  • Identification of a distal regulatory element in the 5' flanking region of the bovine prolactin gene.

    abstract::The 5'-flanking region of the bovine prolactin gene was cloned and sequenced. The expression of chimeric gene constructs containing 5'-flanking DNA fragments from the prolactin gene joined to a reporter gene encoding human growth hormone (hGH) was examined using transiently transfected rat pituitary cells. Prolactin n...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:

    authors: Wolf JB,David VA,Deutch AH

    更新日期:1990-08-25 00:00:00

  • Human DNA topoisomerase IIbeta binds and cleaves four-way junction DNA in vitro.

    abstract::We have used gel retardation analysis to show that human DNA topoisomerase IIbeta can bind a 40 bp linear duplex containing a single DNA topoisomerase IIbeta cleavage site. Furthermore, we demonstrate for the first time that human DNA topoisomerase IIbeta binds to four-way junction DNA. This supports previous suggesti...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/27.4.984

    authors: West KL,Austin CA

    更新日期:1999-02-15 00:00:00

  • Crystal structure of Hfq from Bacillus subtilis in complex with SELEX-derived RNA aptamer: insight into RNA-binding properties of bacterial Hfq.

    abstract::Bacterial Hfq is a protein that plays an important role in the regulation of genes in cooperation with sRNAs. Escherichia coli Hfq (EcHfq) has two or more sites that bind RNA(s) including U-rich and/or the poly(A) tail of mRNA. However, functional and structural information about Bacillus subtilis Hfq (BsHfq) includin...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr892

    authors: Someya T,Baba S,Fujimoto M,Kawai G,Kumasaka T,Nakamura K

    更新日期:2012-02-01 00:00:00

  • RPAN: rice pan-genome browser for ∼3000 rice genomes.

    abstract::A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes w...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw958

    authors: Sun C,Hu Z,Zheng T,Lu K,Zhao Y,Wang W,Shi J,Wang C,Lu J,Zhang D,Li Z,Wei C

    更新日期:2017-01-25 00:00:00

  • Two distinct overstretched DNA states.

    abstract::The DNA double helix undergoes an 'overstretching' transition in a narrow force range near 65 pN. Despite numerous studies the basic question of whether the strands are separated or not remains controversial. Here we show that overstretching in fact involves two distinct types of double-helix reorganization: slow hyst...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq309

    authors: Fu H,Chen H,Marko JF,Yan J

    更新日期:2010-09-01 00:00:00