Abstract:
BACKGROUND:The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. RESULTS:We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different assembly strategies, spanning six assemblers, were tested for performance: overlap-layout-consensus algorithms Newbler, Celera and Minimo; de Bruijn graphs algorithms Velvet and MetaVelvet; and read probabilistic model Genovo. The performance of the assemblies was measured by the length of resulting contigs (using N50), the percentage of reads assembled and the overall accuracy when comparing against corresponding reference genomes. Additionally, the number of chimeras per contig and the lowest common ancestor were estimated in order to assess the effect of assembling on taxonomic and functional annotation. The functional classification of the reads was evaluated by counting the reads that correctly matched the functional data previously reported for the original genomes and calculating the number of over-represented functional categories in chimeric contigs. The sensitivity and specificity of tBLASTx, PhymmBL and the k-mer frequencies were measured by accurate predictions when comparing simulated reads against the NCBI Virus genomes RefSeq database. CONCLUSIONS:Assembling improves functional annotation by increasing accurate assignations and decreasing ambiguous hits between viruses and bacteria. However, the success is limited by the chimeric contigs occurring at all taxonomic levels. The assembler and its parameters should be selected based on the focus of each study. Minimo's non-chimeric contigs and Genovo's long contigs excelled in taxonomy assignation and functional annotation, respectively.tBLASTx stood out as the best approach for taxonomic annotation for virus identification. PhymmBL proved useful in datasets in which no related sequences are present as it uses genomic features that may help identify distant taxa. The k-frequencies underperformed in all viral datasets.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Vázquez-Castellanos JF,García-López R,Pérez-Brocal V,Pignatelli M,Moya Adoi
10.1186/1471-2164-15-37subject
Has Abstractpub_date
2014-01-18 00:00:00pages
37issn
1471-2164pii
1471-2164-15-37journal_volume
15pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Leaf blight caused by Calonectria spp. is one of the most destructive diseases to affect Eucalyptus nurseries and plantations. These pathogens mainly attack Eucalyptus, a tree with a diversity of secondary metabolites employed as defense-related phytoalexins. To unravel the fungal adaptive mechanisms to vari...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4739-1
更新日期:2018-05-10 00:00:00
abstract:BACKGROUND:Protein structure comparison and classification is an effective method for exploring protein structure-function relations. This problem is computationally challenging. Many different computational approaches for protein structure comparison apply the secondary structure elements (SSEs) representation of prot...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-S2-S1
更新日期:2013-01-01 00:00:00
abstract:BACKGROUND:Tumor genomes are often highly heterogeneous, consisting of genomes from multiple subclonal types. Complete characterization of all subclonal types is a fundamental need in tumor genome analysis. With the advancement of next-generation sequencing, computational methods have recently been developed to infer t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S2-S1
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Native cattle breeds are an important source of genetic variation because they might carry alleles that enable them to adapt to local environment and tough feeding conditions. Jiaxian Red, a Chinese native cattle breed, is reported to have originated from crossbreeding between taurine and indicine cattle; th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07340-0
更新日期:2021-01-09 00:00:00
abstract:BACKGROUND:Recent studies have shown that human populations have experienced a complex demographic history, including a recent epoch of rapid population growth that led to an excess in the proportion of rare genetic variants in humans today. This excess can impact the burden of private mutations for each individual, de...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-S4-S3
更新日期:2014-01-01 00:00:00
abstract:BACKGROUND:Brain-derived neurotrophic factor (BDNF) is a major signaling molecule that the brain uses to control a vast network of intracellular cascades fundamental to properties of learning and memory, and cognition. While much is known about BDNF signaling in the healthy nervous system where it controls the mitogen ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6033-2
更新日期:2019-08-28 00:00:00
abstract:BACKGROUND:Daphnia (Crustacea: Cladocera) plays a central role in standing aquatic ecosystems, has a well known ecology and is widely used in population studies and environmental risk assessments. Daphnia magna is, especially in Europe, intensively used to study stress responses of natural populations to pollutants, cl...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-309
更新日期:2011-06-13 00:00:00
abstract:BACKGROUND:microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. METHOD...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S9-S4
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-204
更新日期:2009-04-30 00:00:00
abstract:BACKGROUND:'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be ...
journal_title:BMC genomics
pub_type: 杂志文章,meta分析
doi:10.1186/1471-2164-10-436
更新日期:2009-09-16 00:00:00
abstract:BACKGROUND:Efforts towards utilisation of diets without fish meal (FM) or fish oil (FO) in finfish aquaculture have been being made for more than two decades. Metabolic responses to substitution of fishery products have been shown to impact growth performance and immune system of fish as well as their subsequent nutrit...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-522
更新日期:2011-10-23 00:00:00
abstract:BACKGROUND:One of the key characters of social insects is the division of labor, in which different tasks are allocated to various castes. In termites, one of the representative groups of social insects, morphological differences as well as behavioral differences can be recognized among castes. However, very little is ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-314
更新日期:2010-05-20 00:00:00
abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-382
更新日期:2014-05-18 00:00:00
abstract:BACKGROUND:Skeletal muscle mass can be markedly reduced through a process called atrophy, as a consequence of many diseases or critical physiological and environmental situations. Atrophy is characterised by loss of contractile proteins and reduction of fiber volume. Although in the last decade the molecular aspects un...
journal_title:BMC genomics
pub_type: 杂志文章,meta分析
doi:10.1186/1471-2164-9-630
更新日期:2008-12-23 00:00:00
abstract:BACKGROUND:Propionibacterium acnes and Staphylococcus epidermidis live in close proximity on human skin, and both bacterial species can be isolated from normal and acne vulgaris-affected skin sites. The antagonistic interactions between the two species are poorly understood, as well as the potential significance of bac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2489-5
更新日期:2016-02-29 00:00:00
abstract:BACKGROUND:One of the most fundamental and challenging tasks in bio-informatics is to identify related sequences and their hidden biological significance. The most popular and proven best practice method to accomplish this task is aligning multiple sequences together. However, multiple sequence alignment is a computing...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-S5-S4
更新日期:2011-12-23 00:00:00
abstract:BACKGROUND:S. erythraea is a Gram-positive filamentous bacterium used for the industrial-scale production of erythromycin A which is of high clinical importance. In this work, we sequenced the whole genome of a high-producing strain (E3) obtained by random mutagenesis and screening from the wild-type strain NRRL23338, ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-523
更新日期:2013-07-31 00:00:00
abstract:BACKGROUND:Both male and female pigeons have the ability to produce a nutrient solution in their crop for the nourishment of their young. The production of the nutrient solution has been likened to lactation in mammals, and hence the product has been called pigeon 'milk'. It has been shown that pigeon 'milk' is essenti...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-452
更新日期:2011-09-19 00:00:00
abstract:BACKGROUND:Finger millet (Eleusine coracana (L.) Gaertn.) is an important staple food crop widely grown in Africa and South Asia. Among the millets, finger millet has high amount of calcium, methionine, tryptophan, fiber, and sulphur containing amino acids. In addition, it has C4 photosynthetic carbon assimilation mech...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3850-z
更新日期:2017-06-15 00:00:00
abstract:BACKGROUND:To develop evolutionary models for the free living bacterium Alteromonas the genome sequences of isolates of the genus have been extensively analyzed. However, the main genetic exchange drivers in these microbes, conjugative elements (CEs), have not been considered in detail thus far. In this work, CEs have ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3461-0
更新日期:2017-01-05 00:00:00
abstract:BACKGROUND:Neighboring gene pairs in the genome of Saccharomyces cerevisiae have a tendency to be expressed at the same time. The distribution of histone modifications along chromatin fibers is suggested to be an important mechanism responsible for such coexpression. However, the extent of the contribution of histone m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-550
更新日期:2010-10-09 00:00:00
abstract:BACKGROUND:Skin and its mucus are known to be the first barrier of defence against any external stressors. In fish, skin wounds frequently appear as a result of intensive culture and also some diseases have skin ulcers as external clinical signs. However, there is no information about the changes produced by the wounds...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4349-3
更新日期:2017-12-02 00:00:00
abstract:BACKGROUND:Research on the aryl hydrocarbon receptor (AHR) has largely focused on variations in toxic outcomes resulting from its activation by halogenated aromatic hydrocarbons. But the AHR also plays key roles in regulating pathways critical for development, and after decades of research the mechanisms underlying phy...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1053
更新日期:2014-12-03 00:00:00
abstract:BACKGROUND:The Lundehund is a highly specialized breed characterized by a unique flexibility of the joints and polydactyly in all four limbs. The extremely small population size and high inbreeding has promoted a high frequency of diseased dogs affected by the Lundehund syndrome (LS), a severe gastro-enteropathic disea...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2844-6
更新日期:2016-08-02 00:00:00
abstract:BACKGROUND:Engineered nanomaterials may release nanosized residues, by degradation, throughout their life cycle. These residues may be a threat for living organisms. They may be ingested by humans through food and water. Although the toxicity of pristine CeO2 nanoparticles (NPs) has been documented, there is a lack of ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-700
更新日期:2014-08-21 00:00:00
abstract:BACKGROUND:Copy number variants (CNVs) have been shown to increase risk for physical anomalies, developmental, psychiatric and medical disorders. Some of them have been associated with changes in weight, height, and other physical traits. As most studies have been performed on children and young people, these effects o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5292-7
更新日期:2018-12-04 00:00:00
abstract:BACKGROUND:Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1479-3
更新日期:2015-04-11 00:00:00
abstract:BACKGROUND:An important subfamily of membrane proteins are the transmembrane alpha-helical proteins, in which the membrane-spanning regions are made up of alpha-helices. Given the obvious biological and medical significance of these proteins, it is of tremendous practical importance to identify the location of transmem...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-S1-S7
更新日期:2008-01-01 00:00:00
abstract:BACKGROUND:The transcription factor SOX10 is essential for all stages of Schwann cell development including myelination. SOX10 cooperates with other transcription factors to activate the expression of key myelin genes in Schwann cells and is therefore a context-dependent, pro-myelination transcription factor. As such, ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3167-3
更新日期:2016-11-07 00:00:00
abstract:BACKGROUND:In the model legume Medicago truncatula, the near saturation genome-wide Tnt1 insertion mutant population in ecotype R108 is a valuable tool in functional genomics studies. Forward genetic screens have identified many Tnt1 mutants defective in nodule development and symbiotic nitrogen fixation (SNF). However...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2452-5
更新日期:2016-02-27 00:00:00