MasterOfPores: A Workflow for the Analysis of Oxford Nanopore Direct RNA Sequencing Datasets.

Abstract:

:The direct RNA sequencing platform offered by Oxford Nanopore Technologies allows for direct measurement of RNA molecules without the need of conversion to complementary DNA, fragmentation or amplification. As such, it is virtually capable of detecting any given RNA modification present in the molecule that is being sequenced, as well as provide polyA tail length estimations at the level of individual RNA molecules. Although this technology has been publicly available since 2017, the complexity of the raw Nanopore data, together with the lack of systematic and reproducible pipelines, have greatly hindered the access of this technology to the general user. Here we address this problem by providing a fully benchmarked workflow for the analysis of direct RNA sequencing reads, termed MasterOfPores. The pipeline starts with a pre-processing module, which converts raw current intensities into multiple types of processed data including FASTQ and BAM, providing metrics of the quality of the run, quality-filtering, demultiplexing, base-calling and mapping. In a second step, the pipeline performs downstream analyses of the mapped reads, including prediction of RNA modifications and estimation of polyA tail lengths. Four direct RNA MinION sequencing runs can be fully processed and analyzed in 10 h on 100 CPUs. The pipeline can also be executed in GPU locally or in the cloud, decreasing the run time fourfold. The software is written using the NextFlow framework for parallelization and portability, and relies on Linux containers such as Docker and Singularity for achieving better reproducibility. The MasterOfPores workflow can be executed on any Unix-compatible OS on a computer, cluster or cloud without the need of installing any additional software or dependencies, and is freely available in Github (https://github.com/biocorecrg/master_of_pores). This workflow simplifies direct RNA sequencing data analyses, facilitating the study of the (epi)transcriptome at single molecule resolution.

journal_name

Front Genet

journal_title

Frontiers in genetics

authors

Cozzuto L,Liu H,Pryszcz LP,Pulido TH,Delgado-Tejedor A,Ponomarenko J,Novoa EM

doi

10.3389/fgene.2020.00211

subject

Has Abstract

pub_date

2020-03-17 00:00:00

pages

211

issn

1664-8021

journal_volume

11

pub_type

杂志文章
  • Screening and Identification of Potential Prognostic Biomarkers in Adrenocortical Carcinoma.

    abstract::Objective: Adrenocortical carcinoma (ACC) is a rare but aggressive malignant cancer that has been attracting growing attention over recent decades. This study aims to integrate protein interaction networks with gene expression profiles to identify potential biomarkers with prognostic value in silico. Methods: Three mi...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00821

    authors: Xu WH,Wu J,Wang J,Wan FN,Wang HK,Cao DL,Qu YY,Zhang HL,Ye DW

    更新日期:2019-09-11 00:00:00

  • Interaction between oxytocin receptor DNA methylation and genotype is associated with risk of postpartum depression in women without depression in pregnancy.

    abstract::Postpartum depression (PPD) affects up to 19% of women, negatively impacting maternal and infant health. Reductions in plasma oxytocin levels have been associated with PPD and heritability studies have established a genetic contribution. Epigenetic regulation of the oxytocin receptor gene (OXTR) has been demonstrated ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2015.00243

    authors: Bell AF,Carter CS,Steer CD,Golding J,Davis JM,Steffen AD,Rubin LH,Lillard TS,Gregory SP,Harris JC,Connelly JJ

    更新日期:2015-07-21 00:00:00

  • Genome-Wide Association Study for Milk Protein Composition Traits in a Chinese Holstein Population Using a Single-Step Approach.

    abstract::Genome-wide association studies (GWASs) have been widely used to determine the genetic architecture of quantitative traits in dairy cattle. In this study, with the aim of identifying candidate genes that affect milk protein composition traits, we conducted a GWAS for nine such traits (αs1-casein, αs2-casein, β-casein,...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00072

    authors: Zhou C,Li C,Cai W,Liu S,Yin H,Shi S,Zhang Q,Zhang S

    更新日期:2019-02-19 00:00:00

  • Noncoding RNAs regulate NF-κB signaling to modulate blood vessel inflammation.

    abstract::Cardiovascular diseases such as atherosclerosis are one of the leading causes of morbidity and mortality worldwide. The clinical manifestations of atherosclerosis, which include heart attack and stroke, occur several decades after initiation of the disease and become more severe with age. Inflammation of blood vessels...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章,评审

    doi:10.3389/fgene.2014.00422

    authors: Cheng HS,Njock MS,Khyzha N,Dang LT,Fish JE

    更新日期:2014-12-10 00:00:00

  • Estimation of Gene Expression at Isoform Level from mRNA-Seq Data by Bayesian Hierarchical Modeling.

    abstract::mRNA-Seq is a precise and highly reproducible technique for measurement of transcripts levels and yields sequence information of a transcriptome at a single nucleotide base-level thus enabling us to determine splice junctions and alternative splicing events with high confidence. Often analysis of mRNA-Seq data does no...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2012.00239

    authors: Bhattacharjee M,Gupta R,Davuluri RV

    更新日期:2012-11-27 00:00:00

  • Genomic basis of evolutionary change: evolving immunity.

    abstract::Complex traits are manifestations of intricate gene interaction networks. Evolution of complex traits revolves around the genetic variation in such networks. Genomics has increased our ability to investigate the complex gene interaction networks, and characterize the extent of genetic variation in these networks. Immu...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2015.00222

    authors: Wertheim B

    更新日期:2015-06-19 00:00:00

  • Comprehensive RNA-Seq Data Analysis Identifies Key mRNAs and lncRNAs in Atrial Fibrillation.

    abstract::Long non-coding RNAs (lncRNAs) are an emerging class of RNA species that may play a critical regulatory role in gene expression. However, the association between lncRNAs and atrial fibrillation (AF) is still not fully understood. In this study, we used RNA sequencing data to identify and quantify the both protein codi...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00908

    authors: Wu DM,Zhou ZK,Fan SH,Zheng ZH,Wen X,Han XR,Wang S,Wang YJ,Zhang ZF,Shan Q,Li MQ,Hu B,Lu J,Chen GQ,Hong XW,Zheng YL

    更新日期:2019-10-02 00:00:00

  • Increased Expression of TICRR Predicts Poor Clinical Outcomes: A Potential Therapeutic Target for Papillary Renal Cell Carcinoma.

    abstract::Background: Papillary renal cell carcinoma (PRCC), although the second-most common type of renal cell carcinoma, still lacks specific biomarkers for diagnosis, treatment, and prognosis. TopBP1-interacting checkpoint and replication regulator (TICRR) is a DNA replication initiation regulator upregulated in various canc...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.605378

    authors: Xia S,Lin Y,Lin J,Li X,Tan X,Huang Z

    更新日期:2021-01-11 00:00:00

  • DeepLRHE: A Deep Convolutional Neural Network Framework to Evaluate the Risk of Lung Cancer Recurrence and Metastasis From Histopathology Images.

    abstract::It is critical for patients who cannot undergo eradicable surgery to predict the risk of lung cancer recurrence and metastasis; therefore, the physicians can design the appropriate adjuvant therapy plan. However, traditional circulating tumor cell (CTC) detection or next-generation sequencing (NGS)-based methods are u...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.00768

    authors: Wu Z,Wang L,Li C,Cai Y,Liang Y,Mo X,Lu Q,Dong L,Liu Y

    更新日期:2020-08-25 00:00:00

  • Association of Fibroblast Growth Factor 23 With Ischemic Stroke and Its Subtypes: A Mendelian Randomization Study.

    abstract::Fibroblast growth factor 23 (FGF23), which is involved in the regulation of vitamin D, is an emerging independent risk factor for cardiovascular diseases. Previous studies have demonstrated a positive association between FGF23 and stroke. In this study, we aimed to assess the association of FGF23 with ischemic stroke ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.608517

    authors: Zheng K,Lin L,Cui P,Liu T,Chen L,Yang C,Jiang W

    更新日期:2020-12-23 00:00:00

  • Integrated Analysis of Large-Scale Omics Data Revealed Relationship Between Tissue Specificity and Evolutionary Dynamics of Small RNAs in Maize (Zea mays).

    abstract::The evolutionary dynamics and tissue specificity of protein-coding genes are well documented in plants. However, the evolutionary consequences of small RNAs (sRNAs) on tissue-specific functions remain poorly understood. Here, we performed integrated analysis of 195 deeply sequenced sRNA libraries of maize B73, represe...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.00051

    authors: Xu Y,Zhang T,Li Y,Miao Z

    更新日期:2020-02-11 00:00:00

  • Using Landscape Genetics Simulations for Planting Blister Rust Resistant Whitebark Pine in the US Northern Rocky Mountains.

    abstract::Recent population declines to the high elevation western North America foundation species whitebark pine, have been driven by the synergistic effects of the invasive blister rust pathogen, mountain pine beetle (MPB), fire exclusion, and climate change. This has led to consideration for listing whitebark pine (WBP) as ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2017.00009

    authors: Landguth EL,Holden ZA,Mahalovich MF,Cushman SA

    更新日期:2017-02-10 00:00:00

  • Effects of Genotype by Environment Interaction on Genetic Gain and Genetic Parameter Estimates in Red Tilapia (Oreochromis spp.).

    abstract::The extent to which genetic gain achieved from selection programs under strictly controlled environments in the nucleus that can be expressed in commercial production systems is not well-documented in aquaculture species. The main aim of this paper was to assess the effects of genotype by environment interaction on ge...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2017.00082

    authors: Nguyen NH,Hamzah A,Thoa NP

    更新日期:2017-06-13 00:00:00

  • Cancer as a Tissue Anomaly: Classifying Tumor Transcriptomes Based Only on Healthy Data.

    abstract::Since the turn of the century, researchers have sought to diagnose cancer based on gene expression signatures measured from the blood or biopsy as biomarkers. This task, known as classification, is typically solved using a suite of algorithms that learn a mathematical rule capable of discriminating one group ("cases")...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00599

    authors: Quinn TP,Nguyen T,Lee SC,Venkatesh S

    更新日期:2019-07-02 00:00:00

  • Post-transcriptional Gene Regulation in Colitis Associated Cancer.

    abstract::Colitis-associated cancer (CAC) has been linked to microRNA (miRNA) aberrant expression elicited by inflammation. In this study, we used the AOM/DSS-induced CAC mice model to explore the ectopic expression of miRNAs in the precancerous stage of CAC. As a result, we found that miR-31-5p, miR-223-3p, and let-7f-5p were ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00585

    authors: Chen G,Feng Y,Li X,Jiang Z,Bei B,Zhang L,Han Y,Li Y,Li N

    更新日期:2019-06-19 00:00:00

  • Proband Whole-Exome Sequencing Identified Genes Responsible for Autosomal Recessive Non-Syndromic Hearing Loss in 33 Chinese Nuclear Families.

    abstract::Autosomal recessive non-syndromic hearing loss (ARNSHL) is a highly heterogeneous disease involving more than 70 pathogenic genes. However, most ARNSHL families have small-sized pedigrees with limited genetic information, rendering challenges for the molecular diagnosis of these patients. Therefore, we attempted to es...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00639

    authors: Sang S,Ling J,Liu X,Mei L,Cai X,Li T,Li W,Li M,Wen J,Liu X,Liu J,Liu Y,Chen H,He C,Feng Y

    更新日期:2019-07-17 00:00:00

  • A Mechanism for Genome Size Reduction Following Genomic Rearrangements.

    abstract::The factors behind genome size evolution have been of great interest, considering that eukaryotic genomes vary in size by more than three orders of magnitude. Using a model of two wild peanut relatives, Arachis duranensis and Arachis ipaensis, in which one genome experienced large rearrangements, we find that the main...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2018.00454

    authors: Ren L,Huang W,Cannon EKS,Bertioli DJ,Cannon SB

    更新日期:2018-10-09 00:00:00

  • Lost in Translation: Ribosome-Associated mRNA and Protein Quality Controls.

    abstract::Aberrant, misfolded, and mislocalized proteins are often toxic to cells and result in many human diseases. All proteins and their mRNA templates are subject to quality control. There are several distinct mechanisms that control the quality of mRNAs and proteins during translation at the ribosome. mRNA quality control ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章,评审

    doi:10.3389/fgene.2018.00431

    authors: Karamyshev AL,Karamysheva ZN

    更新日期:2018-10-04 00:00:00

  • Association of the Lactase Persistence Haplotype Block With Disease Risk in Populations of European Descent.

    abstract::Among people of European descent, the ability to digest lactose into adulthood arose via strong positive selection of a highly advantageous allele encompassing the lactase gene. Lactose-tolerant and intolerant individuals may have different disease risks due to the shared genetics of their haplotype block. Therefore, ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.558762

    authors: Joslin SEK,Durbin-Johnson BP,Britton M,Settles ML,Korf I,Lemay DG

    更新日期:2020-10-29 00:00:00

  • Long noncoding RNAs: a potential novel class of cancer biomarkers.

    abstract::Long noncoding RNAs (lncRNAs) are a novel class of RNA molecules defined as transcripts longer than 200 nucleotides that lack protein coding potential. They constitute a major, but still poorly characterized part of human transcriptome, however, evidence is growing that they are important regulatory molecules involved...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章,评审

    doi:10.3389/fgene.2015.00145

    authors: Yarmishyn AA,Kurochkin IV

    更新日期:2015-04-23 00:00:00

  • A comparison of association methods for cytotoxicity mapping in pharmacogenomics.

    abstract::Cytotoxicity assays of immortalized lymphoblastoid cell lines (LCLs) represent a promising new in vitro approach in pharmacogenomics research. However, previous studies employing LCLs in gene mapping have used simple association methods, which may not adequately capture the true differences in non-linear response prof...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2011.00086

    authors: Brown C,Havener TM,Everitt L,McLeod H,Motsinger-Reif AA

    更新日期:2011-12-14 00:00:00

  • Indy mutants: live long and prosper.

    abstract::Indy encodes the fly homolog of a mammalian transporter of di and tricarboxylate components of the Krebs cycle. Reduced expression of fly Indy or two of the C. elegans Indy homologs leads to an increase in life span. Fly and worm tissues that play key roles in intermediary metabolism are also the places where Indy gen...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2012.00013

    authors: Frankel S,Rogina B

    更新日期:2012-02-17 00:00:00

  • A Comparative Genome-Wide Analysis of the R2R3-MYB Gene Family Among Four Gossypium Species and Their Sequence Variation and Association With Fiber Quality Traits in an Interspecific G. hirsutum × G. barbadense Population.

    abstract::Cotton (Gossypium spp.) is the most important natural fiber crop in the world. The R2R3-MYB gene family is a large gene family involved in many plant functions including cotton fiber development. Although previous studies have reported its phylogenetic relationships, gene structures, and expression patterns in tetrapl...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00741

    authors: Wang N,Ma Q,Ma J,Pei W,Liu G,Cui Y,Wu M,Zang X,Zhang J,Yu S,Ma L,Yu J

    更新日期:2019-08-15 00:00:00

  • Validity and power of missing data imputation for extreme sampling and terminal measures designs in mediation analysis.

    abstract::Several authors have acknowledged that testing mediational hypotheses between treatments, genes, physiological measures, and behaviors may substantially advance our understanding of how these associations operate. In psychiatric research, the costs of measuring the putative mediator or the outcome can be prohibitive. ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2011.00075

    authors: Makowsky R,Beasley TM,Gadbury GL,Albert JM,Kennedy RE,Allison DB

    更新日期:2011-10-31 00:00:00

  • Elimination of Reference Mapping Bias Reveals Robust Immune Related Allele-Specific Expression in Crossbred Sheep.

    abstract::Pervasive allelic variation at both gene and single nucleotide level (SNV) between individuals is commonly associated with complex traits in humans and animals. Allele-specific expression (ASE) analysis, using RNA-Seq, can provide a detailed annotation of allelic imbalance and infer the existence of cis-acting transcr...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.00863

    authors: Salavati M,Bush SJ,Palma-Vera S,McCulloch MEB,Hume DA,Clark EL

    更新日期:2019-09-19 00:00:00

  • Redundancy of the genetic code enables translational pausing.

    abstract::The codon redundancy ("degeneracy") found in protein-coding regions of mRNA also prescribes Translational Pausing (TP). When coupled with the appropriate interpreters, multiple meanings and functions are programmed into the same sequence of configurable switch-settings. This additional layer of Ontological Prescriptiv...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章,评审

    doi:10.3389/fgene.2014.00140

    authors: D'Onofrio DJ,Abel DL

    更新日期:2014-05-20 00:00:00

  • S100A6 Promotes B Lymphocyte Penetration Through the Blood-Brain Barrier in Autoimmune Encephalitis.

    abstract::Autoimmune encephalitis (AE) is a severe neurological disease. The brain of the AE patient is attacked by a dysregulated immune system, which is caused by the excessive production of autoantibodies against neuronal receptors and synaptic proteins. AE is also characterized by the uncontrolled B lymphocyte infiltration ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2019.01188

    authors: Tsai MH,Lin CH,Tsai KW,Lin MH,Ho CJ,Lu YT,Weng KP,Lin Y,Lin PH,Li SC

    更新日期:2019-11-22 00:00:00

  • Comprehensive Analysis of Respiratory Burst Oxidase Homologs (Rboh) Gene Family and Function of GbRboh5/18 on Verticillium Wilt Resistance in Gossypium barbadense.

    abstract::Respiratory burst oxidase homologs (Rbohs) play a predominant role in reactive oxygen species (ROS) production, which is crucial in plant growth, differentiation, as well as their responses to biotic and abiotic stresses. To date, however, there is little knowledge about the function of cotton Rboh genes. Here, we ide...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2020.00788

    authors: Chang Y,Li B,Shi Q,Geng R,Geng S,Liu J,Zhang Y,Cai Y

    更新日期:2020-09-11 00:00:00

  • The molecular pathways underlying host resistance and tolerance to pathogens.

    abstract::Breeding livestock that are better able to withstand the onslaught of endemic- and exotic pathogens is high on the wish list of breeders and farmers world-wide. However, the defense systems in both pathogens and their hosts are complex and the degree of genetic variation in resistance and tolerance will depend on the ...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章

    doi:10.3389/fgene.2012.00263

    authors: Glass EJ

    更新日期:2012-12-14 00:00:00

  • The Trp73 Mutant Mice: A Ciliopathy Model That Uncouples Ciliogenesis From Planar Cell Polarity.

    abstract::p73 transcription factor belongs to one of the most important gene families in vertebrate biology, the p53-family. Trp73 gene, like the other family members, generates multiple isoforms named TA and DNp73, with different and, sometimes, antagonist functions. Although p73 shares many biological functions with p53, it a...

    journal_title:Frontiers in genetics

    pub_type: 杂志文章,评审

    doi:10.3389/fgene.2019.00154

    authors: Marques MM,Villoch-Fernandez J,Maeso-Alonso L,Fuertes-Alvarez S,Marin MC

    更新日期:2019-03-15 00:00:00