sRNAnalyzer-a flexible and customizable small RNA sequencing data analysis pipeline.

Abstract:

:Although many tools have been developed to analyze small RNA sequencing (sRNA-Seq) data, it remains challenging to accurately analyze the small RNA population, mainly due to multiple sequence ID assignment caused by short read length. Additional issues in small RNA analysis include low consistency of microRNA (miRNA) measurement results across different platforms, miRNA mapping associated with miRNA sequence variation (isomiR) and RNA editing, and the origin of those unmapped reads after screening against all endogenous reference sequence databases. To address these issues, we built a comprehensive and customizable sRNA-Seq data analysis pipeline-sRNAnalyzer, which enables: (i) comprehensive miRNA profiling strategies to better handle isomiRs and summarization based on each nucleotide position to detect potential SNPs in miRNAs, (ii) different sequence mapping result assignment approaches to simulate results from microarray/qRT-PCR platforms and a local probabilistic model to assign mapping results to the most-likely IDs, (iii) comprehensive ribosomal RNA filtering for accurate mapping of exogenous RNAs and summarization based on taxonomy annotation. We evaluated our pipeline on both artificial samples (including synthetic miRNA and Escherichia coli cultures) and biological samples (human tissue and plasma). sRNAnalyzer is implemented in Perl and available at: http://srnanalyzer.systemsbiology.net/.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Wu X,Kim TK,Baxter D,Scherler K,Gordon A,Fong O,Etheridge A,Galas DJ,Wang K

doi

10.1093/nar/gkx999

subject

Has Abstract

pub_date

2017-12-01 00:00:00

pages

12140-12151

issue

21

eissn

0305-1048

issn

1362-4962

pii

4563315

journal_volume

45

pub_type

杂志文章
  • The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli.

    abstract::The CRISPR/Cas adaptive immune system provides resistance against phages and plasmids in Archaea and Bacteria. CRISPR loci integrate short DNA sequences from invading genetic elements that provide small RNA-mediated interference in subsequent exposure to matching nucleic acids. In Streptococcus thermophilus, it was pr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr606

    authors: Sapranauskas R,Gasiunas G,Fremaux C,Barrangou R,Horvath P,Siksnys V

    更新日期:2011-11-01 00:00:00

  • SAHG, a comprehensive database of predicted structures of all human proteins.

    abstract::Most proteins from higher organisms are known to be multi-domain proteins and contain substantial numbers of intrinsically disordered (ID) regions. To analyse such protein sequences, those from human for instance, we developed a special protein-structure-prediction pipeline and accumulated the products in the Structur...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1057

    authors: Motono C,Nakata J,Koike R,Shimizu K,Shirota M,Amemiya T,Tomii K,Nagano N,Sakaya N,Misoo K,Sato M,Kidera A,Hiroaki H,Shirai T,Kinoshita K,Noguchi T,Ota M

    更新日期:2011-01-01 00:00:00

  • DNA flap creation by the RarA/MgsA protein of Escherichia coli.

    abstract::We identify a novel activity of the RarA (also MgsA) protein of Escherichia coli, demonstrating that this protein functions at DNA ends to generate flaps. A AAA+ ATPase in the clamp loader clade, RarA protein is part of a highly conserved family of DNA metabolism proteins. We demonstrate that RarA binds to double-stra...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw1322

    authors: Stanage TH,Page AN,Cox MM

    更新日期:2017-03-17 00:00:00

  • High resolution deletion breakpoint mapping in the DMD gene by whole cosmid hybridization.

    abstract::The locus DXS269 (P20) defines a deletion hotspot in the distal part of the Duchenne Muscular Dystrophy gene. We have cloned over 90 kilobase-pairs of genomic DNA from this region in overlapping cosmids. The use of whole cosmids as probes in a competitive DNA hybridization analysis proves a fast and convenient method ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/17.14.5611

    authors: Blonden LA,den Dunnen JT,van Paassen HM,Wapenaar MC,Grootscholten PM,Ginjaar HB,Bakker E,Pearson PL,van Ommen GJ

    更新日期:1989-07-25 00:00:00

  • Mitome: dynamic and interactive database for comparative mitochondrial genomics in metazoan animals.

    abstract::Mitome is a specialized mitochondrial genome database designed for easy comparative analysis of various features of metazoan mitochondrial genomes such as base frequency, A+T skew, codon usage and gene arrangement pattern. A particular function of the database is the automatic reconstruction of phylogenetic relationsh...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm763

    authors: Lee YS,Oh J,Kim YU,Kim N,Yang S,Hwang UW

    更新日期:2008-01-01 00:00:00

  • YeastFab: the design and construction of standard biological parts for metabolic engineering in Saccharomyces cerevisiae.

    abstract::It is a routine task in metabolic engineering to introduce multicomponent pathways into a heterologous host for production of metabolites. However, this process sometimes may take weeks to months due to the lack of standardized genetic tools. Here, we present a method for the design and construction of biological part...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv464

    authors: Guo Y,Dong J,Zhou T,Auxillos J,Li T,Zhang W,Wang L,Shen Y,Luo Y,Zheng Y,Lin J,Chen GQ,Wu Q,Cai Y,Dai J

    更新日期:2015-07-27 00:00:00

  • 2-Thiouracil deprived of thiocarbonyl function preferentially base pairs with guanine rather than adenine in RNA and DNA duplexes.

    abstract::2-Thiouracil-containing nucleosides are essential modified units of natural and synthetic nucleic acids. In particular, the 5-substituted-2-thiouridines (S2Us) present in tRNA play an important role in tuning the translation process through codon-anticodon interactions. The enhanced thermodynamic stability of S2U-cont...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv109

    authors: Sochacka E,Szczepanowski RH,Cypryk M,Sobczak M,Janicka M,Kraszewska K,Bartos P,Chwialkowska A,Nawrot B

    更新日期:2015-03-11 00:00:00

  • Crosstalk between transposase subunits during cleavage of the mariner transposon.

    abstract::Mariner transposition is a complex reaction that involves three recombination sites and six strand breaking and joining reactions. This requires precise spatial and temporal coordination between the different components to ensure a productive outcome and minimize genomic instability. We have investigated how the cleav...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku172

    authors: Claeys Bouuaert C,Walker N,Liu D,Chalmers R

    更新日期:2014-05-01 00:00:00

  • miROrtho: computational survey of microRNA genes.

    abstract::MicroRNAs (miRNAs) are short, non-protein coding RNAs that direct the widespread phenomenon of post-transcriptional regulation of metazoan genes. The mature approximately 22-nt long RNA molecules are processed from genome-encoded stem-loop structured precursor genes. Hundreds of such genes have been experimentally val...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkn707

    authors: Gerlach D,Kriventseva EV,Rahman N,Vejnar CE,Zdobnov EM

    更新日期:2009-01-01 00:00:00

  • JBP1 and JBP2 are two distinct thymidine hydroxylases involved in J biosynthesis in genomic DNA of African trypanosomes.

    abstract::Genomic DNA of African trypanosomes contains a hypermodified thymidine residue termed base J (beta-d-glucosyl-HOMedU). This modified base is localized primarily to repetitive DNA, namely the telomeres, and is implicated in the regulation of antigenic variation. The base is synthesized in a two-step pathway. Initially,...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkn1067

    authors: Cliffe LJ,Kieft R,Southern T,Birkeland SR,Marshall M,Sweeney K,Sabatini R

    更新日期:2009-04-01 00:00:00

  • Effect of DNA bases and backbone on sigma70 holoenzyme binding and isomerization using fork junction probes.

    abstract::Abasic substitutions in the non-template strand and promoter sequence changes were made to assess the roles of various promoter features in sigma70 holoenzyme interactions with fork junction probes. Removal of -10 element non-template single strand bases, leaving the phosphodiester backbone intact, did not interfere w...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg400

    authors: Fenton MS,Gralla JD

    更新日期:2003-06-01 00:00:00

  • 5'-UTR G-quadruplex structures acting as translational repressors.

    abstract::Given that greater than 90% of the human genome is expressed, it is logical to assume that post-transcriptional regulatory mechanisms must be the primary means of controlling the flow of information from mRNA to protein. This report describes a robust approach that includes in silico, in vitro and in cellulo experimen...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq557

    authors: Beaudoin JD,Perreault JP

    更新日期:2010-11-01 00:00:00

  • Structural requirements of polynucleotides for the activation of (2' - 5')An polymerase and protein kinase.

    abstract::Two enzymatic pathways are involved in the inhibitory effects of double-stranded (ds)RNA on protein synthesis in cell extracts derived from interferon-treated human fibroblasts or HeLa cells, an oligonucleotide polymerase that synthesizes (2'-5')An from ATP and a protein kinase that phosphorylates the alpha subunit of...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/9.19.4939

    authors: Baglioni C,Minks MA,De Clercq E

    更新日期:1981-10-10 00:00:00

  • The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection.

    abstract::With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Further...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv677

    authors: Jiang Y,Turinsky AL,Brudno M

    更新日期:2015-09-03 00:00:00

  • BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems.

    abstract::BioModels Database (http://www.ebi.ac.uk/biomodels/), part of the international initiative BioModels.net, provides access to published, peer-reviewed, quantitative models of biochemical and cellular systems. Each model is carefully curated to verify that it corresponds to the reference publication and gives the proper...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj092

    authors: Le Novère N,Bornstein B,Broicher A,Courtot M,Donizelli M,Dharuri H,Li L,Sauro H,Schilstra M,Shapiro B,Snoep JL,Hucka M

    更新日期:2006-01-01 00:00:00

  • A non-canonical multisubunit RNA polymerase encoded by the AR9 phage recognizes the template strand of its uracil-containing promoters.

    abstract::AR9 is a giant Bacillus subtilis phage whose uracil-containing double-stranded DNA genome encodes distant homologs of β and β' subunits of bacterial RNA polymerase (RNAP). The products of these genes are thought to assemble into two non-canonical multisubunit RNAPs - a virion RNAP (vRNAP) that is injected into the hos...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx264

    authors: Sokolova M,Borukhov S,Lavysh D,Artamonova T,Khodorkovskii M,Severinov K

    更新日期:2017-06-02 00:00:00

  • Synergistic effects of ATP and RNA binding to human DEAD-box protein DDX1.

    abstract::RNA helicases of the DEAD-box protein family form the largest group of helicases. The human DEAD-box protein 1 (DDX1) plays an important role in tRNA and mRNA processing, is involved in tumor progression and is also hijacked by several virus families such as HIV-1 for replication and nuclear export. Although important...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv106

    authors: Kellner JN,Reinstein J,Meinhart A

    更新日期:2015-03-11 00:00:00

  • The International Gene Trap Consortium Website: a portal to all publicly available gene trap cell lines in mouse.

    abstract::Gene trapping is a method of generating murine embryonic stem (ES) cell lines containing insertional mutations in known and novel genes. A number of international groups have used this approach to create sizeable public cell line repositories available to the scientific community for the generation of mutant mouse str...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj097

    authors: Nord AS,Chang PJ,Conklin BR,Cox AV,Harper CA,Hicks GG,Huang CC,Johns SJ,Kawamoto M,Liu S,Meng EC,Morris JH,Rossant J,Ruiz P,Skarnes WC,Soriano P,Stanford WL,Stryke D,von Melchner H,Wurst W,Yamamura K,Young SG,

    更新日期:2006-01-01 00:00:00

  • Elastic properties of ribosomal RNA building blocks: molecular dynamics of the GTPase-associated center rRNA.

    abstract::Explicit solvent molecular dynamics (MD) was used to describe the intrinsic flexibility of the helix 42-44 portion of the 23S rRNA (abbreviated as Kt-42+rGAC; kink-turn 42 and GTPase-associated center rRNA). The bottom part of this molecule consists of alternating rigid and flexible segments. The first flexible segmen...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm245

    authors: Rázga F,Koca J,Mokdad A,Sponer J

    更新日期:2007-01-01 00:00:00

  • Optimizing DNA assembly based on statistical language modelling.

    abstract::By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the process of assembling...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx859

    authors: Fang G,Zhang S,Dong Y

    更新日期:2017-12-15 00:00:00

  • Selective catalysis of A.T base pair proton exchange in DNA complexes: imino proton NMR analysis.

    abstract::Polyaromatic molecules with amino chain substituents, upon binding with DNA, selectively catalyze exchange of the A.T base pair protons with bulk water protons. The amine-catalyzed exchange is mediated by compounds which are A.T and G.C base sequence specific, intercalators, and outside binders. A mechanism for the se...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/15.20.8511

    authors: Strekowski L,Watson RA,Wilson WD

    更新日期:1987-10-26 00:00:00

  • Structural insights into RNA encapsidation and helical assembly of the Toscana virus nucleoprotein.

    abstract::Toscana virus is an emerging bunyavirus in Mediterranean Europe where it accounts for 80% of pediatric meningitis cases during the summer. The negative-strand ribonucleic acid (RNA) genome of the virus is wrapped around the virally encoded nucleoprotein N to form the ribonucleoprotein complex (RNP). We determined crys...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku229

    authors: Olal D,Dick A,Woods VL Jr,Liu T,Li S,Devignot S,Weber F,Saphire EO,Daumke O

    更新日期:2014-05-01 00:00:00

  • Participation of altered upstream stimulatory factor in the induction of rat heme oxygenase-1 by cadmium.

    abstract::We have reported that an upstream stimulatory factor (USF) binding site is functional in transcription of the heme oxygenase-1 gene. In this study, we examined the role of USF in the induced state. By transient expression analyses with the chloramphenicol acetyl-transferase gene, we found that the USF binding site pla...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.15.2959

    authors: Maeshima H,Sato M,Ishikawa K,Katagata Y,Yoshida T

    更新日期:1996-08-01 00:00:00

  • DDBJ in preparation for overview of research activities behind data submissions.

    abstract::In the past year, DDBJ (http://www.ddbj.nig.ac.jp) collected and released 1,956,826 entries or 1,741,313,111 bases. The released data include approximately 90,000 ESTs and cDNAs of Macaca fascicularis, and 280 million bases of mouse GSS. In addition to the data collection, we have indexed the submitted data to the Int...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj111

    authors: Okubo K,Sugawara H,Gojobori T,Tateno Y

    更新日期:2006-01-01 00:00:00

  • Nucleotide excision repair efficiency in quiescent human fibroblasts is modulated by circadian clock.

    abstract::The efficiency of Nucleotide Excision Repair (NER)process is crucial for maintaining genomic integrity because in many organisms, including humans, it represents the only system able to repair a wide range of DNA damage. The aim of the work was to investigate whether the efficiency of the repair of photoproducts induc...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv081

    authors: Bee L,Marini S,Pontarin G,Ferraro P,Costa R,Albrecht U,Celotti L

    更新日期:2015-02-27 00:00:00

  • DBD: a transcription factor prediction database.

    abstract::Regulation of gene expression influences almost all biological processes in an organism; sequence-specific DNA-binding transcription factors are critical to this control. For most genomes, the repertoire of transcription factors is only partially known. Hitherto transcription factor identification has been largely bas...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj131

    authors: Kummerfeld SK,Teichmann SA

    更新日期:2006-01-01 00:00:00

  • Contributions of DNA interstrand cross-links to aging of cells and organisms.

    abstract::Impaired DNA damage repair, especially deficient transcription-coupled nucleotide excision repair, leads to segmental progeroid syndromes in human patients as well as in rodent models. Furthermore, DNA double-strand break signalling has been pinpointed as a key inducer of cellular senescence. Several recent findings s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章,评审

    doi:10.1093/nar/gkm1065

    authors: Grillari J,Katinger H,Voglauer R

    更新日期:2007-01-01 00:00:00

  • The transposable element Mdg3 in Drosophila melanogaster is flanked with the perfect direct and mismatched inverted repeats.

    abstract::MDg3 is a family of mobile dispersed genetic elements represented by 15 copies in the haploid genome of D. melanogaster and flanked, like other similar elements, by the regions of homology. In the present work, these regions of mdg3 have been sequenced. The existence of perfect direct repeats 268 base pairs long has b...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/8.15.3263

    authors: Bayev AA Jr,Krayev AS,Lyubomirskaya NV,Ilyin YV,Skryabin KG,Georgiev GP

    更新日期:1980-08-11 00:00:00

  • Two types of triplicated alpha-globin loci in humans.

    abstract::DNA from healthy Malaysian newborns was studied on gene maps after digestion with different restriction endonucleases. Of 65 newborns, two were found to be carriers of two different variants of triplicated alpha-globin loci. In variant no. 1, found in an Malay, the three alpha-globin genes are in an elongated DNA frag...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/9.15.3707

    authors: Lie-Injo LE,Herrera AR,Kan YW

    更新日期:1981-08-11 00:00:00

  • Atomic resolution structure of CAG RNA repeats: structural insights and implications for the trinucleotide repeat expansion diseases.

    abstract::CAG repeats occur predominantly in the coding regions of human genes, which suggests their functional importance. In some genes, these sequences can undergo pathogenic expansions leading to neurodegenerative polyglutamine (poly-Q) diseases. The mutant transcripts containing expanded CAG repeats possibly contribute to ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq700

    authors: Kiliszek A,Kierzek R,Krzyzosiak WJ,Rypniewski W

    更新日期:2010-12-01 00:00:00