Abstract:
BACKGROUND:RNA-Seq is now widely used as a research tool. Choices must be made whether to use paired-end (PE) or single-end (SE) sequencing, and whether to use strand-specific or non-specific (NS) library preparation kits. To date there has been no analysis of the effect of these choices on identifying differentially expressed genes (DEGs) between controls and treated samples and on downstream functional analysis. RESULTS:We undertook four mammalian transcriptomics experiments to compare the effect of SE and PE protocols on read mapping, feature counting, identification of DEGs and functional analysis. For three of these experiments we also compared a non-stranded (NS) and a strand-specific approach to mapping the paired-end data. SE mapping resulted in a reduced number of reads mapped to features, in all four experiments, and lower read count per gene. Up to 4.3% of genes in the SE data and up to 12.3% of genes in the NS data had read counts which were significantly different compared to the PE data. Comparison of DEGs showed the presence of false positives (average 5%, using voom) and false negatives (average 5%, using voom) using the SE reads. These increased further, by one or two percentage points, with the NS data. Gene ontology functional enrichment (GO) of the DEGs arising from SE or NS approaches, revealed striking differences in the top 20 GO terms, with as little as 40% concordance with PE results. Caution is therefore advised in the interpretation of such results. By comparison, there was overall consistency in gene set enrichment analysis results. CONCLUSIONS:A strand-specific protocol should be used in library preparation to generate the most reliable and accurate profile of expression. Ideally PE reads are also recommended particularly for transcriptome assembly. Whilst SE reads produce a DEG list with around 5% of false positives and false negatives, this method can substantially reduce sequencing cost and this saving could be used to increase the number of biological replicates thereby increasing the power of the experiment. As SE reads, when used in association with gene set enrichment, can generate accurate biological results, this may be a desirable trade-off.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Corley SM,MacKenzie KL,Beverdam A,Roddam LF,Wilkins MRdoi
10.1186/s12864-017-3797-0subject
Has Abstractpub_date
2017-05-23 00:00:00pages
399issue
1issn
1471-2164pii
10.1186/s12864-017-3797-0journal_volume
18pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Though glioblastoma multiforme (GBM) is the most frequently occurring brain malignancy in adults, clinical treatment still faces challenges due to poor prognoses and tumor relapses. Recently, microRNAs (miRNAs) have been extensively used with the aim of developing accurate molecular therapies, because of the...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3321-y
更新日期:2016-12-22 00:00:00
abstract:BACKGROUND:Small RNAs (sRNAs) have emerged as important regulatory molecules and have been studied in several bacteria. However, to date, there have been no whole-transcriptome studies on sRNAs in any of the Soft Rot Enterobacteriaceae (SRE) group of pathogens. Although the main ecological niches for these pathogens ar...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2376-0
更新日期:2016-01-12 00:00:00
abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-179
更新日期:2008-04-18 00:00:00
abstract:BACKGROUND:The Multinational Brassica rapa Genome Sequencing Project (BrGSP) has developed valuable genomic resources, including BAC libraries, BAC-end sequences, genetic and physical maps, and seed BAC sequences for Brassica rapa. An integrated linkage map between the amphidiploid B. napus and diploid B. rapa will fac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-594
更新日期:2010-10-22 00:00:00
abstract:BACKGROUND:Two-component systems (TCS) play critical roles in sensing and responding to environmental cues. Azospirillum is a plant growth-promoting rhizobacterium living in the rhizosphere of many important crops. Despite numerous studies about its plant beneficial properties, little is known about how the bacterium s...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1962-x
更新日期:2015-10-22 00:00:00
abstract:BACKGROUND:Acinetobacter baumannii is a significant hospital pathogen, particularly due to the dissemination of highly multidrug resistant isolates. Genome data have revealed that A. baumannii is highly genetically diverse, which correlates with major variations seen at the phenotypic level. Thus far, comparative genom...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1020
更新日期:2014-11-25 00:00:00
abstract:BACKGROUND:Fusarium verticillioides causes ear rot in maize (Zea mays L.) and accumulation of mycotoxins, that affect human and animal health. Currently, chemical and agronomic measures to control Fusarium ear rot are not very effective and selection of more resistant genotypes is a desirable strategy to reduce contami...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-710
更新日期:2014-08-25 00:00:00
abstract:BACKGROUND:Copy number variation (CNV) is a major component of genomic variation, yet methods to accurately type genomic CNV lag behind methods that type single nucleotide variation. High-throughput sequencing can contribute to these methods by using sequence read depth, which takes the number of reads that map to a gi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2123-y
更新日期:2015-11-02 00:00:00
abstract:BACKGROUND:We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque. RESULTS:The assembly process consisted of multipl...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-180
更新日期:2009-04-24 00:00:00
abstract:BACKGROUND:Due to the limited availability and high cost of fish oil in the face of increasing aquaculture production, there is a need to reduce usage of fish oil in aquafeeds without compromising farm fish health. Therefore, the present study was conducted to determine if different levels of vegetable and fish oils ca...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4099-2
更新日期:2017-09-08 00:00:00
abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-198
更新日期:2013-03-21 00:00:00
abstract:BACKGROUND:Despite the importance of wheat as a major staple crop and the negative impact of diseases on its production worldwide, the genetic mechanisms and gene interactions involved in the resistance response in wheat are still poorly understood. The complete sequence of the rice genome has provided an extremely use...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-166
更新日期:2013-03-12 00:00:00
abstract:BACKGROUND:Mammalian genomes contain a large number (approximately 1000) of olfactory receptor (OR) genes, many of which (20 to 50%) are pseudogenes. OR gene transcription is not restricted to the olfactory epithelium, but is found in numerous tissues. Using microarray hybridization and RTqPCR, we analyzed the mRNA pro...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-572
更新日期:2009-12-02 00:00:00
abstract:BACKGROUND:The identification and quantification of proteins using label-free Liquid Chromatography/Mass Spectrometry (LC/MS) play crucial roles in biological and biomedical research. Increasing evidence has shown that biomarkers are often low abundance proteins. However, LC/MS systems are subject to considerable noise...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-S3-S8
更新日期:2010-12-01 00:00:00
abstract:BACKGROUND:Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasit...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-693
更新日期:2010-12-07 00:00:00
abstract:BACKGROUND:Protein Disulfide Isomerases are thiol oxidoreductase chaperones from thioredoxin superfamily with crucial roles in endoplasmic reticulum proteostasis, implicated in many diseases. The family prototype PDIA1 is also involved in vascular redox cell signaling. PDIA1 is coded by the P4HB gene. While forced chan...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07164-y
更新日期:2020-11-04 00:00:00
abstract:BACKGROUND:Compared to other ascomycetes, the barley powdery mildew pathogen Blumeria graminis f.sp. hordei (Bgh) has a large genome (ca. 120 Mbp) that harbors a relatively small number of protein-coding genes (ca. 6500). This genomic assemblage is thought to be the result of numerous gene losses, which likely represen...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-843
更新日期:2014-10-02 00:00:00
abstract:BACKGROUND:A variety of species and experimental designs have been used to study genetic influences on alcohol dependence, ethanol response, and related traits. Integration of these heterogeneous data can be used to produce a ranked target gene list for additional investigation. RESULTS:In this study, we performed a u...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S8-S16
更新日期:2012-01-01 00:00:00
abstract:BACKGROUND:With the sequence of the Plasmodium falciparum genome and several global mRNA and protein life cycle expression profiling projects now completed, elucidating the underlying networks of transcriptional control important for the progression of the parasite life cycle is highly pertinent to the development of n...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-70
更新日期:2008-02-07 00:00:00
abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many individual genes associated with brain imaging quantitative traits (QTs) in Alzheimer's disease (AD). However single marker level association discovery may not be able to address the underlying biological interactions with disease mechanism. RESULT...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07282-7
更新日期:2020-12-29 00:00:00
abstract:BACKGROUND:The purpose of this research was to develop a novel information theoretic method and an efficient algorithm for analyzing the gene-gene (GGI) and gene-environmental interactions (GEI) associated with quantitative traits (QT). The method is built on two information-theoretic metrics, the k-way interaction inf...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-509
更新日期:2009-11-04 00:00:00
abstract:BACKGROUND:Taraxacum kok-saghyz R. (Tks) is a promising alternative species to Hevea brasiliensis for production of high quality natural rubber (NR). A comparative transcriptome analysis of plants with differential production of NR will contribute to elucidate which genes are involved in the synthesis, regulation and a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5287-4
更新日期:2018-12-04 00:00:00
abstract:BACKGROUND:The packaging of DNA into chromatin regulates transcription from initiation through 3' end processing. One aspect of transcription in which chromatin plays a poorly understood role is the co-transcriptional splicing of pre-mRNA. RESULTS:Here we provide evidence that H2B monoubiquitylation (H2BK123ub1) marks...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-627
更新日期:2011-12-22 00:00:00
abstract:BACKGROUND:Changes in gene regulation are thought to be crucial for the adaptation of organisms to their environment. Transcriptome analyses can be used to identify candidate genes for ecological adaptation, but can be complicated by variation in gene expression between tissues, sexes, or individuals. Here we use high-...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-654
更新日期:2012-11-21 00:00:00
abstract:BACKGROUND:During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance b...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6611-3
更新日期:2020-04-16 00:00:00
abstract:BACKGROUND:Bombyx mori was domesticated from the Chinese wild silkworm, Bombyx mandarina. Wild and domestic silkworms are good models in which to investigate genes related to silk protein synthesis that may be differentially expressed in silk glands, because their silk productions are very different. Here we used the m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1287-9
更新日期:2015-02-06 00:00:00
abstract:BACKGROUND:G protein-coupled receptors (GPCRs) are major players in cell communication, regulate a whole range of physiological functions during development and throughout adult life, are affected in numerous pathological situations, and constitute so far the largest class of drugable targets for human diseases. The co...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-241
更新日期:2011-05-16 00:00:00
abstract:BACKGROUND:Typhoid fever is an acute systemic infection of humans caused by Salmonella enterica subspecies enterica serovar Typhi (S. Typhi). In chronic carriers, the bacteria survive the harsh environment of the gallbladder by producing biofilm. The phenotype of S. Typhi biofilm cells is significantly different from t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4212-6
更新日期:2017-10-31 00:00:00
abstract:BACKGROUND:Eucalyptus is one of the most important sources of industrial cellulose. Three species of this botanical group are intensively used in breeding programs: E. globulus, E. grandis and E. urophylla. E. globulus is adapted to subtropical/temperate areas and is considered a source of high-quality cellulose; E. gr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-201
更新日期:2013-03-22 00:00:00
abstract:BACKGROUND:In many eukaryotes, microRNAs (miRNAs) bind to complementary sites in the 3'-untranslated regions (3'-UTRs) of target messenger RNAs (mRNAs) and regulate their expression at the stage of translation. Recent studies have revealed that many miRNAs are evolutionarily conserved; however, the evolution of their t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-101
更新日期:2010-02-09 00:00:00