Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.

Abstract:

BACKGROUND:The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. RESULTS:We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different assembly strategies, spanning six assemblers, were tested for performance: overlap-layout-consensus algorithms Newbler, Celera and Minimo; de Bruijn graphs algorithms Velvet and MetaVelvet; and read probabilistic model Genovo. The performance of the assemblies was measured by the length of resulting contigs (using N50), the percentage of reads assembled and the overall accuracy when comparing against corresponding reference genomes. Additionally, the number of chimeras per contig and the lowest common ancestor were estimated in order to assess the effect of assembling on taxonomic and functional annotation. The functional classification of the reads was evaluated by counting the reads that correctly matched the functional data previously reported for the original genomes and calculating the number of over-represented functional categories in chimeric contigs. The sensitivity and specificity of tBLASTx, PhymmBL and the k-mer frequencies were measured by accurate predictions when comparing simulated reads against the NCBI Virus genomes RefSeq database. CONCLUSIONS:Assembling improves functional annotation by increasing accurate assignations and decreasing ambiguous hits between viruses and bacteria. However, the success is limited by the chimeric contigs occurring at all taxonomic levels. The assembler and its parameters should be selected based on the focus of each study. Minimo's non-chimeric contigs and Genovo's long contigs excelled in taxonomy assignation and functional annotation, respectively.tBLASTx stood out as the best approach for taxonomic annotation for virus identification. PhymmBL proved useful in datasets in which no related sequences are present as it uses genomic features that may help identify distant taxa. The k-frequencies underperformed in all viral datasets.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Vázquez-Castellanos JF,García-López R,Pérez-Brocal V,Pignatelli M,Moya A

doi

10.1186/1471-2164-15-37

subject

Has Abstract

pub_date

2014-01-18 00:00:00

pages

37

issn

1471-2164

pii

1471-2164-15-37

journal_volume

15

pub_type

杂志文章
  • Whole genome and transcriptome analysis reveal adaptive strategies and pathogenesis of Calonectria pseudoreteaudii to Eucalyptus.

    abstract:BACKGROUND:Leaf blight caused by Calonectria spp. is one of the most destructive diseases to affect Eucalyptus nurseries and plantations. These pathogens mainly attack Eucalyptus, a tree with a diversity of secondary metabolites employed as defense-related phytoalexins. To unravel the fungal adaptive mechanisms to vari...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4739-1

    authors: Ye X,Zhong Z,Liu H,Lin L,Guo M,Guo W,Wang Z,Zhang Q,Feng L,Lu G,Zhang F,Chen Q

    更新日期:2018-05-10 00:00:00

  • New enumeration algorithm for protein structure comparison and classification.

    abstract:BACKGROUND:Protein structure comparison and classification is an effective method for exploring protein structure-function relations. This problem is computationally challenging. Many different computational approaches for protein structure comparison apply the secondary structure elements (SSEs) representation of prot...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-S2-S1

    authors: Ashby C,Johnson D,Walker K,Kanj IA,Xia G,Huang X

    更新日期:2013-01-01 00:00:00

  • MixClone: a mixture model for inferring tumor subclonal populations.

    abstract:BACKGROUND:Tumor genomes are often highly heterogeneous, consisting of genomes from multiple subclonal types. Complete characterization of all subclonal types is a fundamental need in tumor genome analysis. With the advancement of next-generation sequencing, computational methods have recently been developed to infer t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S2-S1

    authors: Li Y,Xie X

    更新日期:2015-01-01 00:00:00

  • Assessing genomic diversity and signatures of selection in Jiaxian Red cattle using whole-genome sequencing data.

    abstract:BACKGROUND:Native cattle breeds are an important source of genetic variation because they might carry alleles that enable them to adapt to local environment and tough feeding conditions. Jiaxian Red, a Chinese native cattle breed, is reported to have originated from crossbreeding between taurine and indicine cattle; th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07340-0

    authors: Xia X,Zhang S,Zhang H,Zhang Z,Chen N,Li Z,Sun H,Liu X,Lyu S,Wang X,Li Z,Yang P,Xu J,Ding X,Shi Q,Wang E,Ru B,Xu Z,Lei C,Chen H,Huang Y

    更新日期:2021-01-09 00:00:00

  • High burden of private mutations due to explosive human population growth and purifying selection.

    abstract:BACKGROUND:Recent studies have shown that human populations have experienced a complex demographic history, including a recent epoch of rapid population growth that led to an excess in the proportion of rare genetic variants in humans today. This excess can impact the burden of private mutations for each individual, de...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S4-S3

    authors: Gao F,Keinan A

    更新日期:2014-01-01 00:00:00

  • Evidence for a non-canonical JAK/STAT signaling pathway in the synthesis of the brain's major ion channels and neurotransmitter receptors.

    abstract:BACKGROUND:Brain-derived neurotrophic factor (BDNF) is a major signaling molecule that the brain uses to control a vast network of intracellular cascades fundamental to properties of learning and memory, and cognition. While much is known about BDNF signaling in the healthy nervous system where it controls the mitogen ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6033-2

    authors: Hixson KM,Cogswell M,Brooks-Kayal AR,Russek SJ

    更新日期:2019-08-28 00:00:00

  • Single nucleotide polymorphism discovery from expressed sequence tags in the waterflea Daphnia magna.

    abstract:BACKGROUND:Daphnia (Crustacea: Cladocera) plays a central role in standing aquatic ecosystems, has a well known ecology and is widely used in population studies and environmental risk assessments. Daphnia magna is, especially in Europe, intensively used to study stress responses of natural populations to pollutants, cl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-309

    authors: Orsini L,Jansen M,Souche EL,Geldof S,De Meester L

    更新日期:2011-06-13 00:00:00

  • Integrative analysis of multi-omics data for identifying multi-markers for diagnosing pancreatic cancer.

    abstract:BACKGROUND:microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. METHOD...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S9-S4

    authors: Kwon MS,Kim Y,Lee S,Namkung J,Yun T,Yi SG,Han S,Kang M,Kim SW,Jang JY,Park T

    更新日期:2015-01-01 00:00:00

  • In silico miRNA prediction in metazoan genomes: balancing between sensitivity and specificity.

    abstract:BACKGROUND:MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-204

    authors: van der Burgt A,Fiers MW,Nap JP,van Ham RC

    更新日期:2009-04-30 00:00:00

  • Heterologous oligonucleotide microarrays for transcriptomics in a non-model species; a proof-of-concept study of drought stress in Musa.

    abstract:BACKGROUND:'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be ...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-10-436

    authors: Davey MW,Graham NS,Vanholme B,Swennen R,May ST,Keulemans J

    更新日期:2009-09-16 00:00:00

  • Effects of the total replacement of fish-based diet with plant-based diet on the hepatic transcriptome of two European sea bass (Dicentrarchus labrax) half-sibfamilies showing different growth rates with the plant-based diet.

    abstract:BACKGROUND:Efforts towards utilisation of diets without fish meal (FM) or fish oil (FO) in finfish aquaculture have been being made for more than two decades. Metabolic responses to substitution of fishery products have been shown to impact growth performance and immune system of fish as well as their subsequent nutrit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-522

    authors: Geay F,Ferraresso S,Zambonino-Infante JL,Bargelloni L,Quentel C,Vandeputte M,Kaushik S,Cahu CL,Mazurais D

    更新日期:2011-10-23 00:00:00

  • Gene expression changes during caste-specific neuronal development in the damp-wood termite Hodotermopsis sjostedti.

    abstract:BACKGROUND:One of the key characters of social insects is the division of labor, in which different tasks are allocated to various castes. In termites, one of the representative groups of social insects, morphological differences as well as behavioral differences can be recognized among castes. However, very little is ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-314

    authors: Ishikawa Y,Okada Y,Ishikawa A,Miyakawa H,Koshikawa S,Miura T

    更新日期:2010-05-20 00:00:00

  • EPAS1 gene variants are associated with sprint/power athletic performance in two cohorts of European athletes.

    abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-382

    authors: Voisin S,Cieszczyk P,Pushkarev VP,Dyatlov DA,Vashlyayev BF,Shumaylov VA,Maciejewska-Karlowska A,Sawczuk M,Skuza L,Jastrzebski Z,Bishop DJ,Eynon N

    更新日期:2014-05-18 00:00:00

  • Meta-analysis of expression signatures of muscle atrophy: gene interaction networks in early and late stages.

    abstract:BACKGROUND:Skeletal muscle mass can be markedly reduced through a process called atrophy, as a consequence of many diseases or critical physiological and environmental situations. Atrophy is characterised by loss of contractile proteins and reduction of fiber volume. Although in the last decade the molecular aspects un...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-9-630

    authors: Calura E,Cagnin S,Raffaello A,Laveder P,Lanfranchi G,Romualdi C

    更新日期:2008-12-23 00:00:00

  • Antagonism between Staphylococcus epidermidis and Propionibacterium acnes and its genomic basis.

    abstract:BACKGROUND:Propionibacterium acnes and Staphylococcus epidermidis live in close proximity on human skin, and both bacterial species can be isolated from normal and acne vulgaris-affected skin sites. The antagonistic interactions between the two species are poorly understood, as well as the potential significance of bac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2489-5

    authors: Christensen GJ,Scholz CF,Enghild J,Rohde H,Kilian M,Thürmer A,Brzuszkiewicz E,Lomholt HB,Brüggemann H

    更新日期:2016-02-29 00:00:00

  • Parallel progressive multiple sequence alignment on reconfigurable meshes.

    abstract:BACKGROUND:One of the most fundamental and challenging tasks in bio-informatics is to identify related sequences and their hidden biological significance. The most popular and proven best practice method to accomplish this task is aligning multiple sequences together. However, multiple sequence alignment is a computing...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-S5-S4

    authors: Nguyen KD,Pan Y,Nong G

    更新日期:2011-12-23 00:00:00

  • Systems perspectives on erythromycin biosynthesis by comparative genomic and transcriptomic analyses of S. erythraea E3 and NRRL23338 strains.

    abstract:BACKGROUND:S. erythraea is a Gram-positive filamentous bacterium used for the industrial-scale production of erythromycin A which is of high clinical importance. In this work, we sequenced the whole genome of a high-producing strain (E3) obtained by random mutagenesis and screening from the wild-type strain NRRL23338, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-523

    authors: Li YY,Chang X,Yu WB,Li H,Ye ZQ,Yu H,Liu BH,Zhang Y,Zhang SL,Ye BC,Li YX

    更新日期:2013-07-31 00:00:00

  • Histological and global gene expression analysis of the 'lactating' pigeon crop.

    abstract:BACKGROUND:Both male and female pigeons have the ability to produce a nutrient solution in their crop for the nourishment of their young. The production of the nutrient solution has been likened to lactation in mammals, and hence the product has been called pigeon 'milk'. It has been shown that pigeon 'milk' is essenti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-452

    authors: Gillespie MJ,Haring VR,McColl KA,Monaghan P,Donald JA,Nicholas KR,Moore RJ,Crowley TM

    更新日期:2011-09-19 00:00:00

  • Genome and Transcriptome sequence of Finger millet (Eleusine coracana (L.) Gaertn.) provides insights into drought tolerance and nutraceutical properties.

    abstract:BACKGROUND:Finger millet (Eleusine coracana (L.) Gaertn.) is an important staple food crop widely grown in Africa and South Asia. Among the millets, finger millet has high amount of calcium, methionine, tryptophan, fiber, and sulphur containing amino acids. In addition, it has C4 photosynthetic carbon assimilation mech...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3850-z

    authors: Hittalmani S,Mahesh HB,Shirke MD,Biradar H,Uday G,Aruna YR,Lohithaswa HC,Mohanrao A

    更新日期:2017-06-15 00:00:00

  • Networking in microbes: conjugative elements and plasmids in the genus Alteromonas.

    abstract:BACKGROUND:To develop evolutionary models for the free living bacterium Alteromonas the genome sequences of isolates of the genus have been extensively analyzed. However, the main genetic exchange drivers in these microbes, conjugative elements (CEs), have not been considered in detail thus far. In this work, CEs have ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3461-0

    authors: López-Pérez M,Ramon-Marco N,Rodriguez-Valera F

    更新日期:2017-01-05 00:00:00

  • Genome-wide analysis of the effect of histone modifications on the coexpression of neighboring genes in Saccharomyces cerevisiae.

    abstract:BACKGROUND:Neighboring gene pairs in the genome of Saccharomyces cerevisiae have a tendency to be expressed at the same time. The distribution of histone modifications along chromatin fibers is suggested to be an important mechanism responsible for such coexpression. However, the extent of the contribution of histone m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-550

    authors: Deng Y,Dai X,Xiang Q,Dai Z,He C,Wang J,Feng J

    更新日期:2010-10-09 00:00:00

  • Chronic wounds alter the proteome profile in skin mucus of farmed gilthead seabream.

    abstract:BACKGROUND:Skin and its mucus are known to be the first barrier of defence against any external stressors. In fish, skin wounds frequently appear as a result of intensive culture and also some diseases have skin ulcers as external clinical signs. However, there is no information about the changes produced by the wounds...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4349-3

    authors: Cordero H,Brinchmann MF,Cuesta A,Esteban MA

    更新日期:2017-12-02 00:00:00

  • Cross-species transcriptomic analysis elucidates constitutive aryl hydrocarbon receptor activity.

    abstract:BACKGROUND:Research on the aryl hydrocarbon receptor (AHR) has largely focused on variations in toxic outcomes resulting from its activation by halogenated aromatic hydrocarbons. But the AHR also plays key roles in regulating pathways critical for development, and after decades of research the mechanisms underlying phy...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1053

    authors: Sun RX,Chong LC,Simmons TT,Houlahan KE,Prokopec SD,Watson JD,Moffat ID,Lensu S,Lindén J,P'ng C,Okey AB,Pohjanvirta R,Boutros PC

    更新日期:2014-12-03 00:00:00

  • Variant detection and runs of homozygosity in next generation sequencing data elucidate the genetic background of Lundehund syndrome.

    abstract:BACKGROUND:The Lundehund is a highly specialized breed characterized by a unique flexibility of the joints and polydactyly in all four limbs. The extremely small population size and high inbreeding has promoted a high frequency of diseased dogs affected by the Lundehund syndrome (LS), a severe gastro-enteropathic disea...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2844-6

    authors: Metzger J,Pfahler S,Distl O

    更新日期:2016-08-02 00:00:00

  • Toxicity evaluation of manufactured CeO2 nanoparticles before and after alteration: combined physicochemical and whole-genome expression analysis in Caco-2 cells.

    abstract:BACKGROUND:Engineered nanomaterials may release nanosized residues, by degradation, throughout their life cycle. These residues may be a threat for living organisms. They may be ingested by humans through food and water. Although the toxicity of pristine CeO2 nanoparticles (NPs) has been documented, there is a lack of ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-700

    authors: Fisichella M,Berenguer F,Steinmetz G,Auffan M,Rose J,Prat O

    更新日期:2014-08-21 00:00:00

  • Effects of pathogenic CNVs on physical traits in participants of the UK Biobank.

    abstract:BACKGROUND:Copy number variants (CNVs) have been shown to increase risk for physical anomalies, developmental, psychiatric and medical disorders. Some of them have been associated with changes in weight, height, and other physical traits. As most studies have been performed on children and young people, these effects o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5292-7

    authors: Owen D,Bracher-Smith M,Kendall KM,Rees E,Einon M,Escott-Price V,Owen MJ,O'Donovan MC,Kirov G

    更新日期:2018-12-04 00:00:00

  • Assessing structural variation in a personal genome-towards a human reference diploid genome.

    abstract:BACKGROUND:Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity,...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1479-3

    authors: English AC,Salerno WJ,Hampton OA,Gonzaga-Jauregui C,Ambreth S,Ritter DI,Beck CR,Davis CF,Dahdouli M,Ma S,Carroll A,Veeraraghavan N,Bruestle J,Drees B,Hastie A,Lam ET,White S,Mishra P,Wang M,Han Y,Zhang F,Stankie

    更新日期:2015-04-11 00:00:00

  • Investigation of transmembrane proteins using a computational approach.

    abstract:BACKGROUND:An important subfamily of membrane proteins are the transmembrane alpha-helical proteins, in which the membrane-spanning regions are made up of alpha-helices. Given the obvious biological and medical significance of these proteins, it is of tremendous practical importance to identify the location of transmem...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S1-S7

    authors: Yang JY,Yang MQ,Dunker AK,Deng Y,Huang X

    更新日期:2008-01-01 00:00:00

  • Stringent comparative sequence analysis reveals SOX10 as a putative inhibitor of glial cell differentiation.

    abstract:BACKGROUND:The transcription factor SOX10 is essential for all stages of Schwann cell development including myelination. SOX10 cooperates with other transcription factors to activate the expression of key myelin genes in Schwann cells and is therefore a context-dependent, pro-myelination transcription factor. As such, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3167-3

    authors: Gopinath C,Law WD,Rodríguez-Molina JF,Prasad AB,Song L,Crawford GE,Mullikin JC,Svaren J,Antonellis A

    更新日期:2016-11-07 00:00:00

  • Rapid identification of causative insertions underlying Medicago truncatula Tnt1 mutants defective in symbiotic nitrogen fixation from a forward genetic screen by whole genome sequencing.

    abstract:BACKGROUND:In the model legume Medicago truncatula, the near saturation genome-wide Tnt1 insertion mutant population in ecotype R108 is a valuable tool in functional genomics studies. Forward genetic screens have identified many Tnt1 mutants defective in nodule development and symbiotic nitrogen fixation (SNF). However...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2452-5

    authors: Veerappan V,Jani M,Kadel K,Troiani T,Gale R,Mayes T,Shulaev E,Wen J,Mysore KS,Azad RK,Dickstein R

    更新日期:2016-02-27 00:00:00