A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data.

Abstract:

BACKGROUND:The recent advance of high-throughput sequencing makes it feasible to study entire transcriptomes through the application of de novo sequence assembly algorithms. While a popular strategy is to first construct an intermediate de Bruijn graph structure to represent the transcriptome, an additional step is needed to construct predicted transcripts from the graph. RESULTS:Since the de Bruijn graph contains all branching possibilities, we develop a memory-efficient algorithm to recover alternative splicing information and library-specific expression information directly from the graph without prior genomic knowledge. We implement the algorithm as a postprocessing module of the Velvet assembler. We validate our algorithm by simulating the transcriptome assembly of Drosophila using its known genome, and by performing Drosophila transcriptome assembly using publicly available RNA-Seq libraries. Under a range of conditions, our algorithm recovers sequences and alternative splicing junctions with higher specificity than Oases or Trans-ABySS. CONCLUSIONS:Since our postprocessing algorithm does not consume as much memory as Velvet and is less memory-intensive than Oases, it allows biologists to assemble large libraries with limited computational resources. Our algorithm has been applied to perform transcriptome assembly of the non-model blow fly Lucilia sericata that was reported in a previous article, which shows that the assembly is of high quality and it facilitates comparison of the Lucilia sericata transcriptome to Drosophila and two mosquitoes, prediction and experimental validation of alternative splicing, investigation of differential expression among various developmental stages, and identification of transposable elements.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Sze SH,Tarone AM

doi

10.1186/1471-2164-15-S5-S6

subject

Has Abstract

pub_date

2014-01-01 00:00:00

pages

S6

issn

1471-2164

pii

1471-2164-15-S5-S6

journal_volume

15 Suppl 5

pub_type

杂志文章
  • A phenomics-based approach for the detection and interpretation of shared genetic influences on 29 biochemical indices in southern Chinese men.

    abstract:BACKGROUND:Phenomics provides new technologies and platforms as a systematic phenome-genome approach. However, few studies have reported on the systematic mining of shared genetics among clinical biochemical indices based on phenomics methods, especially in China. This study aimed to apply phenomics to systematically e...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6363-0

    authors: Hu Y,Tan A,Yu L,Hou C,Kuang H,Wu Q,Su J,Zhou Q,Zhu Y,Zhang C,Wei W,Li L,Li W,Huang Y,Huang H,Xie X,Lu T,Zhang H,Yang X,Gao Y,Li T,Jiang Y,Mo Z

    更新日期:2019-12-16 00:00:00

  • Digital gene expression approach over multiple RNA-Seq data sets to detect neoblast transcriptional changes in Schmidtea mediterranea.

    abstract:BACKGROUND:The freshwater planarian Schmidtea mediterranea is recognised as a valuable model for research into adult stem cells and regeneration. With the advent of the high-throughput sequencing technologies, it has become feasible to undertake detailed transcriptional analysis of its unique stem cell population, the ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1533-1

    authors: Rodríguez-Esteban G,González-Sastre A,Rojo-Laguna JI,Saló E,Abril JF

    更新日期:2015-05-08 00:00:00

  • Contrasting gene expression patterns in grain of high and low asparagine wheat genotypes in response to sulphur supply.

    abstract:BACKGROUND:Free asparagine is the precursor for acrylamide formation during cooking and processing of grains, tubers, beans and other crop products. In wheat grain, free asparagine, free glutamine and total free amino acids accumulate to high levels in response to sulphur deficiency. In this study, RNA-seq data were ac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5991-8

    authors: Curtis TY,Raffan S,Wan Y,King R,Gonzalez-Uriarte A,Halford NG

    更新日期:2019-08-01 00:00:00

  • Widespread promoter methylation of synaptic plasticity genes in long-term potentiation in the adult brain in vivo.

    abstract:BACKGROUND:DNA methylation is a key modulator of gene expression in mammalian development and cellular differentiation, including neurons. To date, the role of DNA modifications in long-term potentiation (LTP) has not been explored. RESULTS:To investigate the occurrence of DNA methylation changes in LTP, we undertook ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3621-x

    authors: Maag JL,Kaczorowski DC,Panja D,Peters TJ,Bramham CR,Wibrand K,Dinger ME

    更新日期:2017-03-23 00:00:00

  • Comparative analysis of the predicted secretomes of Rosaceae scab pathogens Venturia inaequalis and V. pirina reveals expanded effector families and putative determinants of host range.

    abstract:BACKGROUND:Fungal plant pathogens belonging to the genus Venturia cause damaging scab diseases of members of the Rosaceae. In terms of economic impact, the most important of these are V. inaequalis, which infects apple, and V. pirina, which is a pathogen of European pear. Given that Venturia fungi colonise the sub-cuti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3699-1

    authors: Deng CH,Plummer KM,Jones DAB,Mesarich CH,Shiller J,Taranto AP,Robinson AJ,Kastner P,Hall NE,Templeton MD,Bowen JK

    更新日期:2017-05-02 00:00:00

  • Pea genomic selection for Italian environments.

    abstract:BACKGROUND:A thorough verification of the ability of genomic selection (GS) to predict estimated breeding values for pea (Pisum sativum L.) grain yield is pending. Prediction for different environments (inter-environment prediction) has key importance when breeding for target environments featuring high genotype × envi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5920-x

    authors: Annicchiarico P,Nazzicari N,Pecetti L,Romani M,Russi L

    更新日期:2019-07-22 00:00:00

  • Origin of a novel protein-coding gene family with similar signal sequence in Schistosoma japonicum.

    abstract:BACKGROUND:Evolution of novel protein-coding genes is the bedrock of adaptive evolution. Recently, we identified six protein-coding genes with similar signal sequence from Schistosoma japonicum egg stage mRNA using signal sequence trap (SST). To find the mechanism underlying the origination of these genes with similar ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-260

    authors: Mbanefo EC,Chuanxin Y,Kikuchi M,Shuaibu MN,Boamah D,Kirinoki M,Hayashi N,Chigusa Y,Osada Y,Hamano S,Hirayama K

    更新日期:2012-06-20 00:00:00

  • The odds of duplicate gene persistence after polyploidization.

    abstract:BACKGROUND:Gene duplication is an important biological phenomenon associated with genomic redundancy, degeneration, specialization, innovation, and speciation. After duplication, both copies continue functioning when natural selection favors duplicated protein function or expression, or when mutations make them functio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-599

    authors: Chain FJ,Dushoff J,Evans BJ

    更新日期:2011-12-12 00:00:00

  • Structure, phylogeny, allelic haplotypes and expression of sucrose transporter gene families in Saccharum.

    abstract:BACKGROUND:Sugarcane is an economically important crop contributing to about 80% of the world sugar production. Increasing efforts in molecular biological studies have been performed for improving the sugar yield and other relevant important agronomic traits. However, due to sugarcane's complicated genomes, it is still...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2419-6

    authors: Zhang Q,Hu W,Zhu F,Wang L,Yu Q,Ming R,Zhang J

    更新日期:2016-02-01 00:00:00

  • Transcriptome deep-sequencing and clustering of expressed isoforms from Favia corals.

    abstract:BACKGROUND:Genomic and transcriptomic sequence data are essential tools for tackling ecological problems. Using an approach that combines next-generation sequencing, de novo transcriptome assembly, gene annotation and synthetic gene construction, we identify and cluster the protein families from Favia corals from the n...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-546

    authors: Pooyaei Mehr SF,DeSalle R,Kao HT,Narechania A,Han Z,Tchernov D,Pieribone V,Gruber DF

    更新日期:2013-08-12 00:00:00

  • Transcript profiling of Populus tomentosa genes in normal, tension, and opposite wood by RNA-seq.

    abstract:BACKGROUND:Wood formation affects the chemical and physical properties of wood, and thus affects its utility as a building material or a feedstock for biofuels, pulp and paper. To obtain genome-wide insights on the transcriptome changes and regulatory networks in wood formation, we used high-throughput RNA sequencing t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1390-y

    authors: Chen J,Chen B,Zhang D

    更新日期:2015-03-10 00:00:00

  • ESAP plus: a web-based server for EST-SSR marker development.

    abstract:BACKGROUND:Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence T...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3328-4

    authors: Ponyared P,Ponsawat J,Tongsima S,Seresangtakul P,Akkasaeng C,Tantisuwichwong N

    更新日期:2016-12-22 00:00:00

  • GiSAO.db: a database for ageing research.

    abstract:BACKGROUND:Age-related gene expression patterns of Homo sapiens as well as of model organisms such as Mus musculus, Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster are a basis for understanding the genetic mechanisms of ageing. For an effective analysis and interpretation of expression prof...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-262

    authors: Hofer E,Laschober GT,Hackl M,Thallinger GG,Lepperdinger G,Grillari J,Jansen-Dürr P,Trajanoski Z

    更新日期:2011-05-24 00:00:00

  • Predicting tissue specific transcription factor binding sites.

    abstract:BACKGROUND:Studies of gene regulation often utilize genome-wide predictions of transcription factor (TF) binding sites. Most existing prediction methods are based on sequence information alone, ignoring biological contexts such as developmental stages and tissue types. Experimental methods to study in vivo binding, inc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-796

    authors: Zhong S,He X,Bar-Joseph Z

    更新日期:2013-11-15 00:00:00

  • Genome-wide survey and expression profiles of the AP2/ERF family in castor bean (Ricinus communis L.).

    abstract:BACKGROUND:The AP2/ERF transcription factor, one of the largest gene families in plants, plays a crucial role in the regulation of growth and development, metabolism, and responses to biotic and abiotic stresses. Castor bean (Ricinus communis L., Euphobiaceae) is one of most important non-edible oilseed crops and its s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-785

    authors: Xu W,Li F,Ling L,Liu A

    更新日期:2013-11-13 00:00:00

  • Profile and functional analysis of small RNAs derived from Aspergillus fumigatus infected with double-stranded RNA mycoviruses.

    abstract:BACKGROUND:Mycoviruses are viruses that naturally infect and replicate in fungi. Aspergillus fumigatus, an opportunistic pathogen causing fungal lung diseases in humans and animals, was recently shown to harbour several different types of mycoviruses. A well-characterised defence against virus infection is RNA silencin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3773-8

    authors: Özkan S,Mohorianu I,Xu P,Dalmay T,Coutts RHA

    更新日期:2017-05-30 00:00:00

  • The repetitive component of the sunflower genome as shown by different procedures for assembling next generation sequencing reads.

    abstract:BACKGROUND:Next generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-686

    authors: Natali L,Cossu RM,Barghini E,Giordani T,Buti M,Mascagni F,Morgante M,Gill N,Kane NC,Rieseberg L,Cavallini A

    更新日期:2013-10-06 00:00:00

  • Exploring the genetics of trotting racing ability in horses using a unique Nordic horse model.

    abstract:BACKGROUND:Horses have been strongly selected for speed, strength, and endurance-exercise traits since the onset of domestication. As a result, highly specialized horse breeds have developed with many modern horse breeds often representing closed populations with high phenotypic and genetic uniformity. However, a great...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5484-9

    authors: Velie BD,Lillie M,Fegraeus KJ,Rosengren MK,Solé M,Wiklund M,Ihler CF,Strand E,Lindgren G

    更新日期:2019-02-04 00:00:00

  • Motif depletion in bacteriophages infecting hosts with CRISPR systems.

    abstract:BACKGROUND:CRISPR is a microbial immune system likely to be involved in host-parasite coevolution. It functions using target sequences encoded by the bacterial genome, which interfere with invading nucleic acids using a homology-dependent system. The system also requires protospacer associated motifs (PAMs), short moti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-663

    authors: Kupczok A,Bollback JP

    更新日期:2014-08-08 00:00:00

  • Analysis of functional variants in mitochondrial DNA of Finnish athletes.

    abstract:BACKGROUND:We have previously reported on paucity of mitochondrial DNA (mtDNA) haplogroups J and K among Finnish endurance athletes. Here we aimed to further explore differences in mtDNA variants between elite endurance and sprint athletes. For this purpose, we determined the rate of functional variants and the mutatio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6171-6

    authors: Kiiskilä J,Moilanen JS,Kytövuori L,Niemi AK,Majamaa K

    更新日期:2019-10-29 00:00:00

  • Dual RNA-seq transcriptional analysis of wheat roots colonized by Azospirillum brasilense reveals up-regulation of nutrient acquisition and cell cycle genes.

    abstract:BACKGROUND:The rapid growth of the world's population demands an increase in food production that no longer can be reached by increasing amounts of nitrogenous fertilizers. Plant growth promoting bacteria (PGPB) might be an alternative to increase nitrogenous use efficiency (NUE) in important crops such wheat. Azospiri...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-378

    authors: Camilios-Neto D,Bonato P,Wassem R,Tadra-Sfeir MZ,Brusamarello-Santos LC,Valdameri G,Donatti L,Faoro H,Weiss VA,Chubatsu LS,Pedrosa FO,Souza EM

    更新日期:2014-05-16 00:00:00

  • cnvScan: a CNV screening and annotation tool to improve the clinical utility of computational CNV prediction from exome sequencing data.

    abstract:BACKGROUND:With advances in next generation sequencing technology and analysis methods, single nucleotide variants (SNVs) and indels can be detected with high sensitivity and specificity in exome sequencing data. Recent studies have demonstrated the ability to detect disease-causing copy number variants (CNVs) in exome...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2374-2

    authors: Samarakoon PS,Sorte HS,Stray-Pedersen A,Rødningen OK,Rognes T,Lyle R

    更新日期:2016-01-14 00:00:00

  • Comprehensive analysis of CCCH-type zinc finger family genes facilitates functional gene discovery and reflects recent allopolyploidization event in tetraploid switchgrass.

    abstract:BACKGROUND:In recent years, dozens of Arabidopsis and rice CCCH-type zinc finger genes have been functionally studied, many of which confer important traits, such as abiotic and biotic stress tolerance, delayed leaf senescence and improved plant architecture. Switchgrass (Panicum virgatum) is an important bioenergy cro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1328-4

    authors: Yuan S,Xu B,Zhang J,Xie Z,Cheng Q,Yang Z,Cai Q,Huang B

    更新日期:2015-02-25 00:00:00

  • Loss of WSTF results in spontaneous fluctuations of heterochromatin formation and resolution, combined with substantial changes to gene expression.

    abstract:BACKGROUND:Williams syndrome transcription factor (WSTF) is a multifaceted protein that is involved in several nuclear processes, including replication, transcription, and the DNA damage response. WSTF participates in a chromatin-remodeling complex with the ISWI ATPase, SNF2H, and is thought to contribute to the mainte...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-740

    authors: Culver-Cochran AE,Chadwick BP

    更新日期:2013-10-29 00:00:00

  • Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution.

    abstract:BACKGROUND:New gene emergence is so far assumed to be mostly driven by duplication and divergence of existing genes. The possibility that entirely new genes could emerge out of the non-coding genomic background was long thought to be almost negligible. With the increasing availability of fully sequenced genomes across ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-117

    authors: Neme R,Tautz D

    更新日期:2013-02-21 00:00:00

  • Whole-genome phylogenies of the family Bacillaceae and expansion of the sigma factor gene family in the Bacillus cereus species-group.

    abstract:BACKGROUND:The Bacillus cereus sensu lato group consists of six species (B. anthracis, B. cereus, B. mycoides, B. pseudomycoides, B. thuringiensis, and B. weihenstephanensis). While classical microbial taxonomy proposed these organisms as distinct species, newer molecular phylogenies and comparative genome sequencing s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-430

    authors: Schmidt TR,Scott EJ 2nd,Dyer DW

    更新日期:2011-08-24 00:00:00

  • Transcriptomic analysis reveals the gene expression profile that specifically responds to IBA during adventitious rooting in mung bean seedlings.

    abstract:BACKGROUND:Auxin plays a critical role in inducing adventitious rooting in many plants. Indole-3-butyric acid (IBA) is the most widely employed auxin for adventitious rooting. However, the molecular mechanisms by which auxin regulate the process of adventitious rooting are less well known. RESULTS:The RNA-Seq data ana...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2372-4

    authors: Li SW,Shi RF,Leng Y,Zhou Y

    更新日期:2016-01-12 00:00:00

  • Genomic and systems evolution in Vibrionaceae species.

    abstract:BACKGROUND:The steadily increasing number of prokaryotic genomes has accelerated the study of genome evolution; in particular, the availability of sets of genomes from closely related bacteria has facilitated the exploration of the mechanisms underlying genome plasticity. The family Vibrionaceae is found in the Gammapr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S1-S11

    authors: Gu J,Neary J,Cai H,Moshfeghian A,Rodriguez SA,Lilburn TG,Wang Y

    更新日期:2009-07-07 00:00:00

  • An advanced bioinformatics approach for analyzing RNA-seq data reveals sigma H-dependent regulation of competence genes in Listeria monocytogenes.

    abstract:BACKGROUND:Alternative σ factors are important transcriptional regulators in bacteria. While σ(B) has been shown to control a large regulon and play important roles in stress response and virulence in the pathogen Listeria monocytogenes, the function of σ(H) has not yet been well defined in Listeria, even though σ(H) c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2432-9

    authors: Liu Y,Orsi RH,Boor KJ,Wiedmann M,Guariglia-Oropeza V

    更新日期:2016-02-16 00:00:00

  • Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis.

    abstract:BACKGROUND:The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB r...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-544

    authors: Katiyar A,Smita S,Lenka SK,Rajwanshi R,Chinnusamy V,Bansal KC

    更新日期:2012-10-10 00:00:00