Differentially expressed genes from RNA-Seq and functional enrichment results are affected by the choice of single-end versus paired-end reads and stranded versus non-stranded protocols.

Abstract:

BACKGROUND:RNA-Seq is now widely used as a research tool. Choices must be made whether to use paired-end (PE) or single-end (SE) sequencing, and whether to use strand-specific or non-specific (NS) library preparation kits. To date there has been no analysis of the effect of these choices on identifying differentially expressed genes (DEGs) between controls and treated samples and on downstream functional analysis. RESULTS:We undertook four mammalian transcriptomics experiments to compare the effect of SE and PE protocols on read mapping, feature counting, identification of DEGs and functional analysis. For three of these experiments we also compared a non-stranded (NS) and a strand-specific approach to mapping the paired-end data. SE mapping resulted in a reduced number of reads mapped to features, in all four experiments, and lower read count per gene. Up to 4.3% of genes in the SE data and up to 12.3% of genes in the NS data had read counts which were significantly different compared to the PE data. Comparison of DEGs showed the presence of false positives (average 5%, using voom) and false negatives (average 5%, using voom) using the SE reads. These increased further, by one or two percentage points, with the NS data. Gene ontology functional enrichment (GO) of the DEGs arising from SE or NS approaches, revealed striking differences in the top 20 GO terms, with as little as 40% concordance with PE results. Caution is therefore advised in the interpretation of such results. By comparison, there was overall consistency in gene set enrichment analysis results. CONCLUSIONS:A strand-specific protocol should be used in library preparation to generate the most reliable and accurate profile of expression. Ideally PE reads are also recommended particularly for transcriptome assembly. Whilst SE reads produce a DEG list with around 5% of false positives and false negatives, this method can substantially reduce sequencing cost and this saving could be used to increase the number of biological replicates thereby increasing the power of the experiment. As SE reads, when used in association with gene set enrichment, can generate accurate biological results, this may be a desirable trade-off.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Corley SM,MacKenzie KL,Beverdam A,Roddam LF,Wilkins MR

doi

10.1186/s12864-017-3797-0

subject

Has Abstract

pub_date

2017-05-23 00:00:00

pages

399

issue

1

issn

1471-2164

pii

10.1186/s12864-017-3797-0

journal_volume

18

pub_type

杂志文章
  • Estimating survival time of patients with glioblastoma multiforme and characterization of the identified microRNA signatures.

    abstract:BACKGROUND:Though glioblastoma multiforme (GBM) is the most frequently occurring brain malignancy in adults, clinical treatment still faces challenges due to poor prognoses and tumor relapses. Recently, microRNAs (miRNAs) have been extensively used with the aim of developing accurate molecular therapies, because of the...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3321-y

    authors: Yerukala Sathipati S,Huang HL,Ho SY

    更新日期:2016-12-22 00:00:00

  • Discovery and profiling of small RNAs responsive to stress conditions in the plant pathogen Pectobacterium atrosepticum.

    abstract:BACKGROUND:Small RNAs (sRNAs) have emerged as important regulatory molecules and have been studied in several bacteria. However, to date, there have been no whole-transcriptome studies on sRNAs in any of the Soft Rot Enterobacteriaceae (SRE) group of pathogens. Although the main ecological niches for these pathogens ar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2376-0

    authors: Kwenda S,Gorshkov V,Ramesh AM,Naidoo S,Rubagotti E,Birch PR,Moleleki LN

    更新日期:2016-01-12 00:00:00

  • Gene expression analyses in Atlantic salmon challenged with infectious salmon anemia virus reveal differences between individuals with early, intermediate and late mortality.

    abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-179

    authors: Jørgensen SM,Afanasyev S,Krasnov A

    更新日期:2008-04-18 00:00:00

  • Construction of an integrated genetic linkage map for the A genome of Brassica napus using SSR markers derived from sequenced BACs in B. rapa.

    abstract:BACKGROUND:The Multinational Brassica rapa Genome Sequencing Project (BrGSP) has developed valuable genomic resources, including BAC libraries, BAC-end sequences, genetic and physical maps, and seed BAC sequences for Brassica rapa. An integrated linkage map between the amphidiploid B. napus and diploid B. rapa will fac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-594

    authors: Xu J,Qian X,Wang X,Li R,Cheng X,Yang Y,Fu J,Zhang S,King GJ,Wu J,Liu K

    更新日期:2010-10-22 00:00:00

  • Genome-wide survey of two-component signal transduction systems in the plant growth-promoting bacterium Azospirillum.

    abstract:BACKGROUND:Two-component systems (TCS) play critical roles in sensing and responding to environmental cues. Azospirillum is a plant growth-promoting rhizobacterium living in the rhizosphere of many important crops. Despite numerous studies about its plant beneficial properties, little is known about how the bacterium s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1962-x

    authors: Borland S,Oudart A,Prigent-Combaret C,Brochier-Armanet C,Wisniewski-Dyé F

    更新日期:2015-10-22 00:00:00

  • Comparative analysis of surface-exposed virulence factors of Acinetobacter baumannii.

    abstract:BACKGROUND:Acinetobacter baumannii is a significant hospital pathogen, particularly due to the dissemination of highly multidrug resistant isolates. Genome data have revealed that A. baumannii is highly genetically diverse, which correlates with major variations seen at the phenotypic level. Thus far, comparative genom...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1020

    authors: Eijkelkamp BA,Stroeher UH,Hassan KA,Paulsen IT,Brown MH

    更新日期:2014-11-25 00:00:00

  • Functional genomic analysis of constitutive and inducible defense responses to Fusarium verticillioides infection in maize genotypes with contrasting ear rot resistance.

    abstract:BACKGROUND:Fusarium verticillioides causes ear rot in maize (Zea mays L.) and accumulation of mycotoxins, that affect human and animal health. Currently, chemical and agronomic measures to control Fusarium ear rot are not very effective and selection of more resistant genotypes is a desirable strategy to reduce contami...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-710

    authors: Lanubile A,Ferrarini A,Maschietto V,Delledonne M,Marocco A,Bellin D

    更新日期:2014-08-25 00:00:00

  • Determining multiallelic complex copy number and sequence variation from high coverage exome sequencing data.

    abstract:BACKGROUND:Copy number variation (CNV) is a major component of genomic variation, yet methods to accurately type genomic CNV lag behind methods that type single nucleotide variation. High-throughput sequencing can contribute to these methods by using sequence read depth, which takes the number of reads that map to a gi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2123-y

    authors: Forni D,Martin D,Abujaber R,Sharp AJ,Sironi M,Hollox EJ

    更新日期:2015-11-02 00:00:00

  • Bos taurus genome assembly.

    abstract:BACKGROUND:We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque. RESULTS:The assembly process consisted of multipl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-180

    authors: Liu Y,Qin X,Song XZ,Jiang H,Shen Y,Durbin KJ,Lien S,Kent MP,Sodeland M,Ren Y,Zhang L,Sodergren E,Havlak P,Worley KC,Weinstock GM,Gibbs RA

    更新日期:2009-04-24 00:00:00

  • Transcriptome profiling of antiviral immune and dietary fatty acid dependent responses of Atlantic salmon macrophage-like cells.

    abstract:BACKGROUND:Due to the limited availability and high cost of fish oil in the face of increasing aquaculture production, there is a need to reduce usage of fish oil in aquafeeds without compromising farm fish health. Therefore, the present study was conducted to determine if different levels of vegetable and fish oils ca...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4099-2

    authors: Eslamloo K,Xue X,Hall JR,Smith NC,Caballero-Solares A,Parrish CC,Taylor RG,Rise ML

    更新日期:2017-09-08 00:00:00

  • Antennal transcriptome analysis of the chemosensory gene families in the tree killing bark beetles, Ips typographus and Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae).

    abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-198

    authors: Andersson MN,Grosse-Wilde E,Keeling CI,Bengtsson JM,Yuen MM,Li M,Hillbur Y,Bohlmann J,Hansson BS,Schlyter F

    更新日期:2013-03-21 00:00:00

  • Comparative analysis of protein-protein interactions in the defense response of rice and wheat.

    abstract:BACKGROUND:Despite the importance of wheat as a major staple crop and the negative impact of diseases on its production worldwide, the genetic mechanisms and gene interactions involved in the resistance response in wheat are still poorly understood. The complete sequence of the rice genome has provided an extremely use...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-166

    authors: Cantu D,Yang B,Ruan R,Li K,Menzo V,Fu D,Chern M,Ronald PC,Dubcovsky J

    更新日期:2013-03-12 00:00:00

  • RNA profiles of rat olfactory epithelia: individual and age related variations.

    abstract:BACKGROUND:Mammalian genomes contain a large number (approximately 1000) of olfactory receptor (OR) genes, many of which (20 to 50%) are pseudogenes. OR gene transcription is not restricted to the olfactory epithelium, but is found in numerous tissues. Using microarray hybridization and RTqPCR, we analyzed the mRNA pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-572

    authors: Rimbault M,Robin S,Vaysse A,Galibert F

    更新日期:2009-12-02 00:00:00

  • ICPD-a new peak detection algorithm for LC/MS.

    abstract:BACKGROUND:The identification and quantification of proteins using label-free Liquid Chromatography/Mass Spectrometry (LC/MS) play crucial roles in biological and biomedical research. Increasing evidence has shown that biomarkers are often low abundance proteins. However, LC/MS systems are subject to considerable noise...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S3-S8

    authors: Zhang J,Haskins W

    更新日期:2010-12-01 00:00:00

  • The venom composition of the parasitic wasp Chelonus inanitus resolved by combined expressed sequence tags analysis and proteomic approach.

    abstract:BACKGROUND:Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-693

    authors: Vincent B,Kaeslin M,Roth T,Heller M,Poulain J,Cousserans F,Schaller J,Poirié M,Lanzrein B,Drezen JM,Moreau SJ

    更新日期:2010-12-07 00:00:00

  • Analysis of splice variants of the human protein disulfide isomerase (P4HB) gene.

    abstract:BACKGROUND:Protein Disulfide Isomerases are thiol oxidoreductase chaperones from thioredoxin superfamily with crucial roles in endoplasmic reticulum proteostasis, implicated in many diseases. The family prototype PDIA1 is also involved in vascular redox cell signaling. PDIA1 is coded by the P4HB gene. While forced chan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07164-y

    authors: Kajihara D,Hon CC,Abdullah AN,Wosniak J Jr,Moretti AIS,Poloni JF,Bonatto D,Hashimoto K,Carninci P,Laurindo FRM

    更新日期:2020-11-04 00:00:00

  • In silico analysis of the core signaling proteome from the barley powdery mildew pathogen (Blumeria graminis f.sp. hordei).

    abstract:BACKGROUND:Compared to other ascomycetes, the barley powdery mildew pathogen Blumeria graminis f.sp. hordei (Bgh) has a large genome (ca. 120 Mbp) that harbors a relatively small number of protein-coding genes (ca. 6500). This genomic assemblage is thought to be the result of numerous gene losses, which likely represen...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-843

    authors: Kusch S,Ahmadinejad N,Panstruga R,Kuhn H

    更新日期:2014-10-02 00:00:00

  • Multi-species data integration and gene ranking enrich significant results in an alcoholism genome-wide association study.

    abstract:BACKGROUND:A variety of species and experimental designs have been used to study genetic influences on alcohol dependence, ethanol response, and related traits. Integration of these heterogeneous data can be used to produce a ranked target gene list for additional investigation. RESULTS:In this study, we performed a u...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S8-S16

    authors: Zhao Z,Guo AY,van den Oord EJ,Aliev F,Jia P,Edenberg HJ,Riley BP,Dick DM,Bettinger JC,Davies AG,Grotewiel MS,Schuckit MA,Agrawal A,Kramer J,Nurnberger JI Jr,Kendler KS,Webb BT,Miles MF

    更新日期:2012-01-01 00:00:00

  • In silico discovery of transcription regulatory elements in Plasmodium falciparum.

    abstract:BACKGROUND:With the sequence of the Plasmodium falciparum genome and several global mRNA and protein life cycle expression profiling projects now completed, elucidating the underlying networks of transcriptional control important for the progression of the parasite life cycle is highly pertinent to the development of n...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-70

    authors: Young JA,Johnson JR,Benner C,Yan SF,Chen K,Le Roch KG,Zhou Y,Winzeler EA

    更新日期:2008-02-07 00:00:00

  • Multivariate genome wide association and network analysis of subcortical imaging phenotypes in Alzheimer's disease.

    abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many individual genes associated with brain imaging quantitative traits (QTs) in Alzheimer's disease (AD). However single marker level association discovery may not be able to address the underlying biological interactions with disease mechanism. RESULT...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07282-7

    authors: Meng X,Li J,Zhang Q,Chen F,Bian C,Yao X,Yan J,Xu Z,Risacher SL,Saykin AJ,Liang H,Shen L,Alzheimer’s Disease Neuroimaging Initiative.

    更新日期:2020-12-29 00:00:00

  • Information-theoretic gene-gene and gene-environment interaction analysis of quantitative traits.

    abstract:BACKGROUND:The purpose of this research was to develop a novel information theoretic method and an efficient algorithm for analyzing the gene-gene (GGI) and gene-environmental interactions (GEI) associated with quantitative traits (QT). The method is built on two information-theoretic metrics, the k-way interaction inf...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-509

    authors: Chanda P,Sucheston L,Liu S,Zhang A,Ramanathan M

    更新日期:2009-11-04 00:00:00

  • Comparative transcriptomics between high and low rubber producing Taraxacum kok-saghyz R. plants.

    abstract:BACKGROUND:Taraxacum kok-saghyz R. (Tks) is a promising alternative species to Hevea brasiliensis for production of high quality natural rubber (NR). A comparative transcriptome analysis of plants with differential production of NR will contribute to elucidate which genes are involved in the synthesis, regulation and a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5287-4

    authors: Panara F,Lopez L,Daddiego L,Fantini E,Facella P,Perrotta G

    更新日期:2018-12-04 00:00:00

  • H2B ubiquitylation is part of chromatin architecture that marks exon-intron structure in budding yeast.

    abstract:BACKGROUND:The packaging of DNA into chromatin regulates transcription from initiation through 3' end processing. One aspect of transcription in which chromatin plays a poorly understood role is the co-transcriptional splicing of pre-mRNA. RESULTS:Here we provide evidence that H2B monoubiquitylation (H2BK123ub1) marks...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-627

    authors: Shieh GS,Pan CH,Wu JH,Sun YJ,Wang CC,Hsiao WC,Lin CY,Tung L,Chang TH,Fleming AB,Hillyer C,Lo YC,Berger SL,Osley MA,Kao CF

    更新日期:2011-12-22 00:00:00

  • Population and sex differences in Drosophila melanogaster brain gene expression.

    abstract:BACKGROUND:Changes in gene regulation are thought to be crucial for the adaptation of organisms to their environment. Transcriptome analyses can be used to identify candidate genes for ecological adaptation, but can be complicated by variation in gene expression between tissues, sexes, or individuals. Here we use high-...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-654

    authors: Catalán A,Hutter S,Parsch J

    更新日期:2012-11-21 00:00:00

  • Comparing copy-number profiles under multi-copy amplifications and deletions.

    abstract:BACKGROUND:During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6611-3

    authors: Cordonnier G,Lafond M

    更新日期:2020-04-16 00:00:00

  • Comparative analysis of the silk gland transcriptomes between the domestic and wild silkworms.

    abstract:BACKGROUND:Bombyx mori was domesticated from the Chinese wild silkworm, Bombyx mandarina. Wild and domestic silkworms are good models in which to investigate genes related to silk protein synthesis that may be differentially expressed in silk glands, because their silk productions are very different. Here we used the m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1287-9

    authors: Fang SM,Hu BL,Zhou QZ,Yu QY,Zhang Z

    更新日期:2015-02-06 00:00:00

  • Genome-wide profiling of G protein-coupled receptors in cerebellar granule neurons using high-throughput, real-time PCR.

    abstract:BACKGROUND:G protein-coupled receptors (GPCRs) are major players in cell communication, regulate a whole range of physiological functions during development and throughout adult life, are affected in numerous pathological situations, and constitute so far the largest class of drugable targets for human diseases. The co...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-241

    authors: Maurel B,Le Digarcher A,Dantec C,Journot L

    更新日期:2011-05-16 00:00:00

  • Transcriptomic study of Salmonella enterica subspecies enterica serovar Typhi biofilm.

    abstract:BACKGROUND:Typhoid fever is an acute systemic infection of humans caused by Salmonella enterica subspecies enterica serovar Typhi (S. Typhi). In chronic carriers, the bacteria survive the harsh environment of the gallbladder by producing biofilm. The phenotype of S. Typhi biofilm cells is significantly different from t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4212-6

    authors: Chin KCJ,Taylor TD,Hebrard M,Anbalagan K,Dashti MG,Phua KK

    更新日期:2017-10-31 00:00:00

  • Xylem transcription profiles indicate potential metabolic responses for economically relevant characteristics of Eucalyptus species.

    abstract:BACKGROUND:Eucalyptus is one of the most important sources of industrial cellulose. Three species of this botanical group are intensively used in breeding programs: E. globulus, E. grandis and E. urophylla. E. globulus is adapted to subtropical/temperate areas and is considered a source of high-quality cellulose; E. gr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-201

    authors: Salazar MM,Nascimento LC,Camargo EL,Gonçalves DC,Lepikson Neto J,Marques WL,Teixeira PJ,Mieczkowski P,Mondego JM,Carazzolle MF,Deckmann AC,Pereira GA

    更新日期:2013-03-22 00:00:00

  • Computational prediction and experimental validation of evolutionarily conserved microRNA target genes in bilaterian animals.

    abstract:BACKGROUND:In many eukaryotes, microRNAs (miRNAs) bind to complementary sites in the 3'-untranslated regions (3'-UTRs) of target messenger RNAs (mRNAs) and regulate their expression at the stage of translation. Recent studies have revealed that many miRNAs are evolutionarily conserved; however, the evolution of their t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-101

    authors: Takane K,Fujishima K,Watanabe Y,Sato A,Saito N,Tomita M,Kanai A

    更新日期:2010-02-09 00:00:00