Abstract:
BACKGROUND:The problem of supervised DNA sequence classification arises in several fields of computational molecular biology. Although this problem has been extensively studied, it is still computationally challenging due to size of the datasets that modern sequencing technologies can produce. RESULTS:We introduce CLARK a novel approach to classify metagenomic reads at the species or genus level with high accuracy and high speed. Extensive experimental results on various metagenomic samples show that the classification accuracy of CLARK is better or comparable to the best state-of-the-art tools and it is significantly faster than any of its competitors. In its fastest single-threaded mode CLARK classifies, with high accuracy, about 32 million metagenomic short reads per minute. CLARK can also classify BAC clones or transcripts to chromosome arms and centromeric regions. CONCLUSIONS:CLARK is a versatile, fast and accurate sequence classification method, especially useful for metagenomics and genomics applications. It is freely available at http://clark.cs.ucr.edu/ .
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Ounit R,Wanamaker S,Close TJ,Lonardi Sdoi
10.1186/s12864-015-1419-2subject
Has Abstractpub_date
2015-03-25 00:00:00pages
236issn
1471-2164pii
10.1186/s12864-015-1419-2journal_volume
16pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Distinguishing orthologous and paralogous relationships between genes across multiple species is essential for comparative genomic analyses. Various computational approaches have been developed to resolve these evolutionary relationships, but strong trade-offs between precision and recall of orthologue predi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4079-6
更新日期:2017-08-31 00:00:00
abstract:BACKGROUND:Hypoxia Inducible Factor (HIF) regulates a cascade of transcriptional events in response to decreased oxygenation, acting from the cellular to the physiological level. This response is evolutionarily conserved, allowing the use of zebrafish (Danio rerio) as a model for studying the hypoxic response. Activati...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2169-x
更新日期:2015-11-11 00:00:00
abstract:BACKGROUND:Bacteria in the genus Ruminococcus are ubiquitous members of the mammalian gastrointestinal tract. In particular, they are important in ruminants where they digest a wide range of plant cell wall polysaccharides. For example, Ruminococcus albus 7 is a primary cellulose degrader that produces acetate usable b...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1066
更新日期:2014-12-04 00:00:00
abstract:BACKGROUND:Large mammals are capable of thermoregulation shortly after birth due to the presence of brown adipose tissue (BAT). The majority of BAT disappears after birth and is replaced by white adipose tissue (WAT). RESULTS:We analyzed the postnatal transformation of adipose in sheep with a time course study of the ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1405-8
更新日期:2015-03-19 00:00:00
abstract:BACKGROUND:Development of the soil amoeba Dictyostelium discoideum is triggered by starvation. When placed on a solid substrate, the starving solitary amoebae cease growth, communicate via extracellular cAMP, aggregate by tens of thousands and develop into multicellular organisms. Early phases of the developmental prog...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1491-7
更新日期:2015-04-13 00:00:00
abstract:BACKGROUND:Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango variet...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1784-x
更新日期:2015-07-30 00:00:00
abstract:BACKGROUND:Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3094-3
更新日期:2016-09-21 00:00:00
abstract:BACKGROUND:Papain-like cysteine proteases (PLCPs), a large group of cysteine proteases structurally related to papain, play important roles in plant development, senescence, and defense responses. Papain, the first cysteine protease whose structure was determined by X-ray crystallography, plays a crucial role in protec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4394-y
更新日期:2018-01-06 00:00:00
abstract:BACKGROUND:Co-expressing genes tend to cluster in eukaryotic genomes. This paper analyzes correlation between the proximity of eukaryotic genes and their transcriptional expression pattern in the zebrafish (Danio rerio) genome using available microarray data and gene annotation. RESULTS:The analyses show that neighbou...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-42
更新日期:2009-01-22 00:00:00
abstract:BACKGROUND:Small RNA (sRNA) sequences are known to have a broad impact on gene regulation by various mechanisms. Their performance for the prediction of hybrid traits has not yet been analyzed. Our objective was to analyze the relation of parental sRNA expression with the performance of their hybrids, to develop a sRNA...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4708-8
更新日期:2018-05-21 00:00:00
abstract:BACKGROUND:Microsatellites are widely used for many genetic studies. In contrast to single nucleotide polymorphism (SNP) and genotyping-by-sequencing methods, they are readily typed in samples of low DNA quality/concentration (e.g. museum/non-invasive samples), and enable the quick, cheap identification of species, hyb...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-176
更新日期:2013-03-15 00:00:00
abstract:BACKGROUND:While increasing data on bacterial evolution in controlled environments are available, our understanding of bacterial genome evolution in natural environments is limited. We thus performed full genome analyses on four Listeria monocytogenes, including human and food isolates from both a 1988 case of sporadic...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-539
更新日期:2008-11-13 00:00:00
abstract:BACKGROUND:The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5998-1
更新日期:2019-08-06 00:00:00
abstract:BACKGROUND:Large amounts of mammalian protein-protein interaction (PPI) data have been generated and are available for public use. From a systems biology perspective, Proteins/genes interactions encode the key mechanisms distinguishing disease and health, and such mechanisms can be uncovered through network analysis. A...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-325
更新日期:2012-07-20 00:00:00
abstract:BACKGROUND:Transcription factors (TFs) play essential roles during plant development and response to environmental stresses. However, the relationships among transcription factors, cis-acting elements and target gene expression under endo- and exogenous stimuli have not been systematically characterized. RESULTS:Here,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4469-4
更新日期:2018-05-09 00:00:00
abstract:BACKGROUND:The process of alternative splicing provides a unique mechanism by which eukaryotes are able to produce numerous protein products from the same gene. Heightened variability in the proteome has been thought to potentiate increased behavioral complexity and response flexibility to environmental stimuli, thus c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6600-6
更新日期:2020-03-23 00:00:00
abstract:BACKGROUND:Sweet sorghum is an annual C4 crop considered to be one of the most promising bio-energy crops due to its high sugar content in stem, yet it is poorly understood how this plant increases its sugar content in response to salt stress. In response to high NaCl, many of its major processes, such as photosynthesi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1760-5
更新日期:2015-07-19 00:00:00
abstract:BACKGROUND:Phenomena such as incomplete lineage sorting, horizontal gene transfer, gene duplication and subsequent sub- and neo-functionalisation can result in distinct local phylogenetic relationships that are discordant with species phylogeny. In order to assess the possible biological roles for these subdivisions, t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-347
更新日期:2013-05-24 00:00:00
abstract:BACKGROUND:Human Malaria is transmitted by mosquitoes of the genus Anopheles. Transmission is a complex phenomenon involving biological and environmental factors of humans, parasites and mosquitoes. Among more than 500 anopheline species, only a few species from different branches of the mosquito evolutionary tree tran...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-207
更新日期:2012-05-30 00:00:00
abstract:BACKGROUND:Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamica...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-520
更新日期:2013-07-30 00:00:00
abstract:BACKGROUND:Ammonia is one of the most common toxicological environment factors affecting shrimp health. Although ammonia tolerance in shrimp is closely related to successful industrial production, few genetic studies of this trait are available. RESULTS:In this study, we constructed a high-density genetic map of the P...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07254-x
更新日期:2020-12-02 00:00:00
abstract:BACKGROUND:Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species ar...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-S5-S5
更新日期:2011-12-23 00:00:00
abstract:BACKGROUND:The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species' survival. To better understand the Tasma...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-172
更新日期:2012-05-06 00:00:00
abstract:BACKGROUND:Nucleosomes facilitate the packaging of the eukaryotic genome and modulate the access of regulators to DNA. A detailed description of the nucleosomal organization under different transcriptional programmes is essential to understand their contribution to genomic regulation. RESULTS:To visualize the dynamics...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-813
更新日期:2013-11-21 00:00:00
abstract:BACKGROUND:The molecular mechanisms of transcriptional regulation are poorly understood in Plasmodium falciparum. In addition, most of the genes in Plasmodium falciparum are transcriptionally poised and only a handful of cis-regulatory elements are known to operate in transcriptional regulation. Here, we employed an ep...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4052-4
更新日期:2017-08-23 00:00:00
abstract:BACKGROUND:Osteoporosis is a common and debilitating bone disease that is characterised by a low bone mineral density (BMD), a highly heritable trait. Genome-wide association studies (GWAS) have proven to be very successful in identifying common genetic variants associated with BMD adjusted for age, gender and weight, ...
journal_title:BMC genomics
pub_type: 杂志文章,meta分析
doi:10.1186/s12864-016-2481-0
更新日期:2016-02-25 00:00:00
abstract:BACKGROUND:Despite the importance of wheat as a major staple crop and the negative impact of diseases on its production worldwide, the genetic mechanisms and gene interactions involved in the resistance response in wheat are still poorly understood. The complete sequence of the rice genome has provided an extremely use...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-166
更新日期:2013-03-12 00:00:00
abstract:BACKGROUND:Runs of Homozygosity (ROH) are genomic regions where identical haplotypes are inherited from each parent. Since their first detection due to technological advances in the late 1990s, ROHs have been shedding light on human population history and deciphering the genetic basis of monogenic and complex traits an...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4489-0
更新日期:2018-01-30 00:00:00
abstract::We describe an emerging initiative - the 'Functional Annotation of All Salmonid Genomes' (FAASG), which will leverage the extensive trait diversity that has evolved since a whole genome duplication event in the salmonid ancestor, to develop an integrative understanding of the functional genomic basis of phenotypic var...
journal_title:BMC genomics
pub_type: 社论
doi:10.1186/s12864-017-3862-8
更新日期:2017-06-27 00:00:00
abstract:BACKGROUND:Streptococcus pneumoniae causes over one million deaths worldwide annually, despite recent developments in vaccine and antibiotic therapy. Host susceptibility to pneumococcal infection and disease is controlled by a combination of genetic and environmental influences, but current knowledge remains limited. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-242
更新日期:2013-04-11 00:00:00