Abstract:
BACKGROUND:Microbial communities play a crucial role in our environment and may influence human health tremendously. Despite being the place where human interaction is most abundant we still know little about the urban microbiome. This is highlighted by the large amount of unclassified DNA reads found in urban metagenome samples. The only in silico approach that allows us to find unknown species, is the assembly and classification of draft genomes from a metagenomic dataset. In this study we (1) investigate the applicability of an assembly and binning approach for urban metagenome datasets, and (2) develop a new method for the generation of in silico gold standards to better understand the specific challenges of such datasets and provide a guide in the selection of available software. RESULTS:We applied combinations of three assembly (Megahit, SPAdes and MetaSPAdes) and three binning tools (MaxBin, MetaBAT and CONCOCT) to whole genome shotgun datasets from the CAMDA 2017 Challenge. Complex in silico gold standards with a simulated bacterial fraction were generated for representative samples of each surface type and city. Using these gold standards, we found the combination of SPAdes and MetaBAT to be optimal for urban metagenome datasets by providing the best trade-off between the number of high-quality genome draft bins (MIMAG standards) retrieved, the least amount of misassemblies and contamination. The assembled draft genomes included known species like Propionibacterium acnes but also novel species according to respective ANI values. CONCLUSIONS:In our work, we showed that, even for datasets with high diversity and low sequencing depth from urban environments, assembly and binning-based methods can provide high-quality genome drafts. Of vital importance to retrieve high-quality genome drafts is sequence depth but even more so a high proportion of the bacterial sequence fraction too achieve high coverage for bacterial genomes. In contrast to read-based methods relying on database knowledge, genome-centric methods as applied in this study can provide valuable information about unknown species and strains as well as functional contributions of single community members within a sample. Furthermore, we present a method for the generation of sample-specific highly complex in silico gold standards. REVIEWERS:This article was reviewed by Craig Herbold, Serghei Mangul and Yana Bromberg.
journal_name
Biol Directjournal_title
Biology directauthors
Gerner SM,Rattei T,Graf ABdoi
10.1186/s13062-018-0225-6subject
Has Abstractpub_date
2018-10-12 00:00:00pages
22issue
1issn
1745-6150pii
10.1186/s13062-018-0225-6journal_volume
13pub_type
杂志文章相关文献
Biology Direct文献大全abstract:BACKGROUND:Microscopic examination of living cells often reveals that cells from some cell strains appear to be in a permanent state of disarray without obvious reason. In all probability such a disorderly state affects cell functioning. The aim of this study was to establish whether a disorderly state could occur that...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-1-9
更新日期:2006-04-02 00:00:00
abstract:BACKGROUND:The translation machinery underlies a multitude of biological processes within the cell. The design and implementation of the modern translation apparatus on even the simplest course of action is extremely complex, and involves different RNA and protein factors. According to the "RNA world" idea, the critica...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-8-17
更新日期:2013-07-08 00:00:00
abstract:BACKGROUND:A basic tenet of protein science is that all information about the spatial structure of proteins is present in their sequences. Nonetheless, many proteins fail to attain native structure upon experimental denaturation and refolding in vitro, raising the question of the specific role of cellular machinery in ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-017-0186-1
更新日期:2017-05-31 00:00:00
abstract:BACKGROUND:In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs. The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the ta...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-29
更新日期:2009-08-25 00:00:00
abstract:BACKGROUND:The origin of eukaryotic cells was an important transition in evolution. The factors underlying the origin and evolutionary success of the eukaryote lineage are still discussed. One camp argues that mitochondria were essential for eukaryote origin because of the unique configuration of internalized bioenerge...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-018-0221-x
更新日期:2018-10-03 00:00:00
abstract:BACKGROUND:In the presence of horizontal gene transfer (HGT), the concepts of lineage and genealogy in the microbial world become more ambiguous because chimeric genomes trace their ancestry from a myriad of sources, both living and extinct. RESULTS:We present the evolutionary histories of three aminoacyl-tRNA synthet...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-47
更新日期:2011-09-23 00:00:00
abstract:BACKGROUND:The overwhelming majority of animal species exhibit bilateral symmetry. However, the precise evolutionary importance of bilateral symmetry is unknown, although elements of the understanding of the phenomenon have been present within the scientific community for decades. PRESENTATION OF THE HYPOTHESIS:Here w...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-7-22
更新日期:2012-07-12 00:00:00
abstract::Chron's Disease is a chronic inflammatory intestinal disease, first described at the beginning of the last century. The disease is characterized by the alternation of periods of flares and remissions influenced by a complex pathogenesis in which inflammation plays a key role. Crohn's disease evolution is mediated by a...
journal_title:Biology direct
pub_type: 杂志文章,评审
doi:10.1186/s13062-020-00280-5
更新日期:2020-11-07 00:00:00
abstract::All modern cells are bounded by cell membranes best described by the fluid mosaic model. This statement is so widely accepted by biologists that little attention is generally given to the theoretical importance of cell membranes in describing the cell. This has not always been the case. When the Cell Theory was first ...
journal_title:Biology direct
pub_type: 历史文章,杂志文章,评审
doi:10.1186/s13062-014-0032-7
更新日期:2014-12-19 00:00:00
abstract:BACKGROUND:Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-53
更新日期:2011-10-13 00:00:00
abstract:BACKGROUND:An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on suc...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-2-33
更新日期:2007-11-27 00:00:00
abstract:BACKGROUND:Telocytes (TCs) is an interstitial cell with extremely long and thin telopodes (Tps) with thin segments (podomers) and dilations (podoms) to interact with neighboring cells. TCs have been found in different organs, while there is still a lack of TCs-specific biomarkers to distinguish TCs from the other cells...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0042-0
更新日期:2015-03-11 00:00:00
abstract::Shannon entropy is used to provide an estimate of the number of interpretable components in a principal component analysis. In addition, several ad hoc stopping rules for dimension determination are reviewed and a modification of the broken stick model is presented. The modification incorporates a test for the presenc...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-2-2
更新日期:2007-01-17 00:00:00
abstract:UNLABELLED:In this work we review past articles that have mathematically studied cancer heterogeneity and the impact of this heterogeneity on the structure of optimal therapy. We look at past works on modeling how heterogeneous tumors respond to radiotherapy, and take a particularly close look at how the optimal radiot...
journal_title:Biology direct
pub_type: 杂志文章,评审
doi:10.1186/s13062-016-0142-5
更新日期:2016-08-23 00:00:00
abstract:BACKGROUND:In genetics it is customary to refer to double-stranded DNA as containing a "Watson strand" and a "Crick strand." However, there seems to be no consensus in the literature on the exact meaning of these two terms, and the many usages contradict one another as well as the original definition. Here, we review t...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-7
更新日期:2011-02-08 00:00:00
abstract::The provenance and biochemical roles of eukaryotic MORC proteins have remained poorly understood since the discovery of their prototype MORC1, which is required for meiotic nuclear division in animals. The MORC family contains a combination of a gyrase, histidine kinase, and MutL (GHKL) and S5 domains that together co...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-3-8
更新日期:2008-03-17 00:00:00
abstract:BACKGROUND:Accurate estimation of the isoelectric point (pI) based on the amino acid sequence is useful for many analytical biochemistry and proteomics techniques such as 2-D polyacrylamide gel electrophoresis, or capillary isoelectric focusing used in combination with high-throughput mass spectrometry. Additionally, p...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-016-0159-9
更新日期:2016-10-21 00:00:00
abstract:BACKGROUND:It is common belief that all cellular life forms on earth have a common origin. This view is supported by the universality of the genetic code and the universal conservation of multiple genes, particularly those that encode key components of the translation system. A remarkable recent study claims to provide...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-5-64
更新日期:2010-11-18 00:00:00
abstract:BACKGROUND:H. sapiens-M. tuberculosis H37Rv protein-protein interaction (PPI) data are essential for understanding the infection mechanism of the formidable pathogen M. tuberculosis H37Rv. Computational prediction is an important strategy to fill the gap in experimental H. sapiens-M. tuberculosis H37Rv PPI data. Homolo...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-9-5
更新日期:2014-04-08 00:00:00
abstract:BACKGROUND:Currently a huge amount of protein-protein interaction data is available therefore extracting meaningful ones are a challenging task. In a protein-protein interaction network, hubs are considered as key proteins maintaining function and stability of the network. Therefore, studying protein-protein complexes ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-49
更新日期:2011-10-05 00:00:00
abstract::Functional biologists, like Claude Bernard, ask "How?", meaning that they investigate the mechanisms underlying the emergence of biological functions (proximal causes), while evolutionary biologists, like Charles Darwin, asks "Why?", meaning that they search the causes of adaptation, survival and evolution (remote cau...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-016-0109-6
更新日期:2016-02-09 00:00:00
abstract:BACKGROUND:The evidence for universal common ancestry (UCA) is vast and persuasive. A phylogenetic test has been proposed for quantifying its odds against independently originated sequences based on the comparison between one versus several trees. This test was successfully applied to a well-supported homologous sequen...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-016-0120-y
更新日期:2016-04-07 00:00:00
abstract::It is well-known that Charles Darwin sketched abstract trees of relationship in his 1837 notebook, and depicted a tree in the Origin of Species (1859). Here I attempt to place Darwin's trees in historical context. By the mid-Eighteenth century the Great Chain of Being was increasingly seen to be an inadequate descript...
journal_title:Biology direct
pub_type: 历史文章,杂志文章,评审
doi:10.1186/1745-6150-4-43
更新日期:2009-11-16 00:00:00
abstract::Plant viruses of the recently recognized family Amalgaviridae have monopartite double-stranded (ds) RNA genomes and encode two proteins: an RNA-dependent RNA polymerase (RdRp) and a putative capsid protein (CP). Whereas the RdRp of amalgaviruses has been found to be most closely related to the RdRps of dsRNA viruses o...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0047-8
更新日期:2015-03-29 00:00:00
abstract:BACKGROUND:Recent studies suggest that gene expression profiles are a promising alternative for clinical cancer classification. One major problem in applying DNA microarrays for classification is the dimension of obtained data sets. In this paper we propose a multiclass gene selection method based on Partial Least Squa...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-7-33
更新日期:2012-10-02 00:00:00
abstract:BACKGROUND:The current analysis of transposon elements (TE) in Drosophila melanogaster at Evolution Canyon, (EC), Israel, is based on data and analysis done by our collaborators (Drs. J. Gonzalez, J. Martinez and W. Makalowski, this issue). They estimated the frequencies of 28 TEs (transposon elements) in fruit flies (...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/s13062-015-0074-5
更新日期:2015-10-14 00:00:00
abstract:BACKGROUND:A dramatic increase in the prevalence of autism and Autistic Spectrum Disorders (ASD) has been observed over the last two decades in USA, Europe and Asia. Given the accumulating data on the possible role of translation in the etiology of ASD, we analyzed potential effects of rare synonymous substitutions ass...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-9-16
更新日期:2014-07-10 00:00:00
abstract:BACKGROUND:The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-6-24
更新日期:2011-05-18 00:00:00
abstract:BACKGROUND:The elucidation of the dominant role of horizontal gene transfer (HGT) in the evolution of prokaryotes led to a severe crisis of the Tree of Life (TOL) concept and intense debates on this subject. CONCEPT:Prompted by the crisis of the TOL, we attempt to define the primary units and the fundamental patterns ...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-4-33
更新日期:2009-09-29 00:00:00
abstract:UNLABELLED:Primase and GINS are essential factors for chromosomal DNA replication in eukaryotic and archaeal cells. Here we describe a previously undetected relationship between the C-terminal domain of the catalytic subunit (PriS) of archaeal primase and the B-domains of the archaeo-eukaryotic GINS proteins in the for...
journal_title:Biology direct
pub_type: 杂志文章
doi:10.1186/1745-6150-5-17
更新日期:2010-04-12 00:00:00