Rapid quantification of sequence repeats to resolve the size, structure and contents of bacterial genomes.

Abstract:

BACKGROUND:The numerous classes of repeats often impede the assembly of genome sequences from the short reads provided by new sequencing technologies. We demonstrate a simple and rapid means to ascertain the repeat structure and total size of a bacterial or archaeal genome without the need for assembly by directly analyzing the abundances of distinct k-mers among reads. RESULTS:The sensitivity of this procedure to resolve variation within a bacterial species is demonstrated: genome sizes and repeat structure of five environmental strains of E. coli from short Illumina reads were estimated by this method, and total genome sizes corresponded well with those obtained for the same strains by pulsed-field gel electrophoresis. In addition, this approach was applied to read-sets for completed genomes and shown to be accurate over a wide range of microbial genome sizes. CONCLUSIONS:Application of these procedures, based solely on k-mer abundances in short read data sets, allows aspects of genome structure to be resolved that are not apparent from conventional short read assemblies. This knowledge of the repetitive content of genomes provides insights into genome evolution and diversity.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Williams D,Trimble WL,Shilts M,Meyer F,Ochman H

doi

10.1186/1471-2164-14-537

subject

Has Abstract

pub_date

2013-08-08 00:00:00

pages

537

issn

1471-2164

pii

1471-2164-14-537

journal_volume

14

pub_type

杂志文章
  • Evidence of uneven selective pressure on different subsets of the conserved human genome; implications for the significance of intronic and intergenic DNA.

    abstract:BACKGROUND:Human genetic variation produces the wide range of phenotypic differences that make us individual. However, little is known about the distribution of variation in the most conserved functional regions of the human genome. We examined whether different subsets of the conserved human genome have been subjected...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-614

    authors: Davidson S,Starkey A,MacKenzie A

    更新日期:2009-12-16 00:00:00

  • Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs).

    abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S8-S8

    authors: Handelman SK,Seweryn M,Smith RM,Hartmann K,Wang D,Pietrzak M,Johnson AD,Kloczkowski A,Sadee W

    更新日期:2015-01-01 00:00:00

  • Transcriptome profiling of developmental and xenobiotic responses in a keystone soil animal, the oligochaete annelid Lumbricus rubellus.

    abstract:BACKGROUND:Natural contamination and anthropogenic pollution of soils are likely to be major determinants of functioning and survival of keystone invertebrate taxa. Soil animals will have both evolutionary adaptation and genetically programmed responses to these toxic chemicals, but mechanistic understanding of such is...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-266

    authors: Owen J,Hedley BA,Svendsen C,Wren J,Jonker MJ,Hankard PK,Lister LJ,Stürzenbaum SR,Morgan AJ,Spurgeon DJ,Blaxter ML,Kille P

    更新日期:2008-06-03 00:00:00

  • Transcriptome analysis of Sacha Inchi (Plukenetia volubilis L.) seeds at two developmental stages.

    abstract:BACKGROUND:Sacha Inchi (Plukenetia volubilis L., Euphorbiaceae) is a potential oilseed crop because the seeds of this plant are rich in unsaturated fatty acids (FAs). In particular, the fatty acid composition of its seed oil differs markedly in containing large quantities of α-linolenic acid (18C:3, a kind of ω-3 FAs)....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-716

    authors: Wang X,Xu R,Wang R,Liu A

    更新日期:2012-12-20 00:00:00

  • The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

    abstract:BACKGROUND:The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplif...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3336-4

    authors: Molin WT,Wright AA,Lawton-Rauh A,Saski CA

    更新日期:2017-01-17 00:00:00

  • Strand-specific transcriptomes of Enterohemorrhagic Escherichia coli in response to interactions with ground beef microbiota: interactions between microorganisms in raw meat.

    abstract:BACKGROUND:Enterohemorrhagic Escherichia coli (EHEC) are zoonotic agents associated with outbreaks worldwide. Growth of EHEC strains in ground beef could be inhibited by background microbiota that is present initially at levels greater than that of the pathogen E. coli. However, how the microbiota outcompetes the patho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3957-2

    authors: Galia W,Leriche F,Cruveiller S,Garnier C,Navratil V,Dubost A,Blanquet-Diot S,Thevenot-Sergentet D

    更新日期:2017-08-03 00:00:00

  • Construction of relatedness matrices using genotyping-by-sequencing data.

    abstract:BACKGROUND:Genotyping-by-sequencing (GBS) is becoming an attractive alternative to array-based methods for genotyping individuals for a large number of single nucleotide polymorphisms (SNPs). Costs can be lowered by reducing the mean sequencing depth, but this results in genotype calls of lower quality. A common analys...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2252-3

    authors: Dodds KG,McEwan JC,Brauning R,Anderson RM,van Stijn TC,Kristjánsson T,Clarke SM

    更新日期:2015-12-09 00:00:00

  • Genomic differences between cultivated soybean, G. max and its wild relative G. soja.

    abstract:BACKGROUND:Glycine max is an economically important crop and many different varieties of soybean exist around the world. The first draft sequences and gene models of G. max (domesticated soybean) as well as G. soja (wild soybean), both became available in 2010. This opened the door for comprehensive comparative genomic...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-S1-S5

    authors: Joshi T,Valliyodan B,Wu JH,Lee SH,Xu D,Nguyen HT

    更新日期:2013-01-01 00:00:00

  • Unravelling the complex trait of harvest index in rapeseed (Brassica napus L.) with association mapping.

    abstract:BACKGROUND:Harvest index (HI), the ratio of grain yield to total biomass, is considered as a measure of biological success in partitioning assimilated photosynthate to the harvestable product. While crop production can be dramatically improved by increasing HI, the underlying molecular genetic mechanism of HI in rapese...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1607-0

    authors: Luo X,Ma C,Yue Y,Hu K,Li Y,Duan Z,Wu M,Tu J,Shen J,Yi B,Fu T

    更新日期:2015-05-12 00:00:00

  • Mapping a quantitative trait locus (QTL) conferring pyrethroid resistance in the African malaria vector Anopheles funestus.

    abstract:BACKGROUND:Pyrethroid resistance in Anopheles funestus populations has led to an increase in malaria transmission in southern Africa. Resistance has been attributed to elevated activities of cytochrome P450s but the molecular basis underlying this metabolic resistance is unknown. Microsatellite and SNP markers were use...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-34

    authors: Wondji CS,Morgan J,Coetzee M,Hunt RH,Steen K,Black WC 4th,Hemingway J,Ranson H

    更新日期:2007-01-29 00:00:00

  • Mosquito transcriptome changes and filarial worm resistance in Armigeres subalbatus.

    abstract:BACKGROUND:Armigeres subalbatus is a natural vector of the filarial worm Brugia pahangi, but it rapidly and proficiently kills Brugia malayi microfilariae by melanotic encapsulation. Because B. malayi and B. pahangi are morphologically and biologically similar, the Armigeres-Brugia system serves as a valuable model for...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-463

    authors: Aliota MT,Fuchs JF,Mayhew GF,Chen CC,Christensen BM

    更新日期:2007-12-18 00:00:00

  • RNA-seq analysis of an apical meristem time series reveals a critical point in Arabidopsis thaliana flower initiation.

    abstract:BACKGROUND:Floral transition is a critical event in the life cycle of a flowering plant as it determines its reproductive success. Despite extensive studies of specific genes that regulate this process, the global changes in transcript expression profiles at the point when a vegetative meristem transitions into an infl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1688-9

    authors: Klepikova AV,Logacheva MD,Dmitriev SE,Penin AA

    更新日期:2015-06-18 00:00:00

  • Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts.

    abstract:BACKGROUND:Pantoea ananatis is found in a wide range of natural environments, including water, soil, as part of the epi- and endophytic flora of various plant hosts, and in the insect gut. Some strains have proven effective as biological control agents and plant-growth promoters, while other strains have been implicate...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-404

    authors: De Maayer P,Chan WY,Rubagotti E,Venter SN,Toth IK,Birch PR,Coutinho TA

    更新日期:2014-05-27 00:00:00

  • End-sequencing and characterization of silkworm (Bombyx mori) bacterial artificial chromosome libraries.

    abstract:BACKGROUND:We performed large-scale bacterial artificial chromosome (BAC) end-sequencing of two BAC libraries (an EcoRI- and a BamHI-digested library) and conducted an in silico analysis to characterize the obtained sequence data, to make them a useful resource for genomic research on the silkworm (Bombyx mori). RESUL...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-314

    authors: Suetsugu Y,Minami H,Shimomura M,Sasanuma S,Narukawa J,Mita K,Yamamoto K

    更新日期:2007-09-07 00:00:00

  • Comparative proteome analysis of Saccharomyces cerevisiae: a global overview of in vivo targets of the yeast activator protein 1.

    abstract:BACKGROUND:The activity of the yeast activator protein 1 (Yap1p) increases under stress conditions, which leads to enhanced transcription of a number of genes encoding protective enzymes or other proteins. To obtain a global overview of changes in expression of Yap1p-targeted proteins, we compared a Yap1p-overexpressin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-230

    authors: Jun H,Kieselbach T,Jönsson LJ

    更新日期:2012-06-09 00:00:00

  • Cis regulatory motifs and antisense transcriptional control in the apicomplexan Theileria parva.

    abstract:BACKGROUND:Theileria parva is an intracellular parasite that causes a lymphoproliferative disease in cattle. It does so by inducing cancer-like phenotypes in the host cells it infects, although the molecular and regulatory mechanisms involved remain poorly understood. RNAseq data, and the resulting updated genome annot...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2444-5

    authors: Tretina K,Pelle R,Silva JC

    更新日期:2016-02-20 00:00:00

  • Biclustering of transcriptome sequencing data reveals human tissue-specific circular RNAs.

    abstract:BACKGROUND:Emerging evidence has been experimentally confirmed the tissue-specific expression of circRNAs (circRNAs). Global identification of human tissue-specific circRNAs is crucial for the functionality study, which facilitates the discovery of circRNAs for potential diagnostic biomarkers. RESULTS:In this study, c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4335-9

    authors: Liu YC,Chiu YJ,Li JR,Sun CH,Liu CC,Huang HD

    更新日期:2018-01-19 00:00:00

  • Anomaly detection in gene expression via stochastic models of gene regulatory networks.

    abstract:BACKGROUND:The steady-state behaviour of gene regulatory networks (GRNs) can provide crucial evidence for detecting disease-causing genes. However, monitoring the dynamics of GRNs is particularly difficult because biological data only reflects a snapshot of the dynamical behaviour of the living organism. Also most GRN ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S3-S26

    authors: Kim H,Gelenbe E

    更新日期:2009-12-03 00:00:00

  • Next-generation sequencing identifies equine cartilage and subchondral bone miRNAs and suggests their involvement in osteochondrosis physiopathology.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are an abundant class of small single-stranded non-coding RNA molecules ranging from 18 to 24 nucleotides. They negatively regulate gene expression at the post-transcriptional level and play key roles in many biological processes, including skeletal development and cartilage maturation. In...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-798

    authors: Desjardin C,Vaiman A,Mata X,Legendre R,Laubier J,Kennedy SP,Laloe D,Barrey E,Jacques C,Cribiu EP,Schibler L

    更新日期:2014-09-17 00:00:00

  • Genome-wide analyses of Epstein-Barr virus reveal conserved RNA structures and a novel stable intronic sequence RNA.

    abstract:BACKGROUND:Epstein-Barr virus (EBV) is a human herpesvirus implicated in cancer and autoimmune disorders. Little is known concerning the roles of RNA structure in this important human pathogen. This study provides the first comprehensive genome-wide survey of RNA and RNA structure in EBV. RESULTS:Novel EBV RNAs and RN...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-543

    authors: Moss WN,Steitz JA

    更新日期:2013-08-09 00:00:00

  • Genome-wide profiling of G protein-coupled receptors in cerebellar granule neurons using high-throughput, real-time PCR.

    abstract:BACKGROUND:G protein-coupled receptors (GPCRs) are major players in cell communication, regulate a whole range of physiological functions during development and throughout adult life, are affected in numerous pathological situations, and constitute so far the largest class of drugable targets for human diseases. The co...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-241

    authors: Maurel B,Le Digarcher A,Dantec C,Journot L

    更新日期:2011-05-16 00:00:00

  • Determining multiallelic complex copy number and sequence variation from high coverage exome sequencing data.

    abstract:BACKGROUND:Copy number variation (CNV) is a major component of genomic variation, yet methods to accurately type genomic CNV lag behind methods that type single nucleotide variation. High-throughput sequencing can contribute to these methods by using sequence read depth, which takes the number of reads that map to a gi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2123-y

    authors: Forni D,Martin D,Abujaber R,Sharp AJ,Sironi M,Hollox EJ

    更新日期:2015-11-02 00:00:00

  • Genome-wide genetic structure and selection signatures for color in 10 traditional Chinese yellow-feathered chicken breeds.

    abstract:BACKGROUND:Yellow-feathered chickens (YFCs) have a long history in China. They are well-known for the nutritional and commercial importance attributable to their yellow color phenotype. Currently, there is a huge paucity in knowledge of the genetic determinants responsible for phenotypic and biochemical properties of t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6736-4

    authors: Huang X,Otecko NO,Peng M,Weng Z,Li W,Chen J,Zhong M,Zhong F,Jin S,Geng Z,Luo W,He D,Ma C,Han J,Ommeh SC,Zhang Y,Zhang X,Du B

    更新日期:2020-04-20 00:00:00

  • Steps toward broad-spectrum therapeutics: discovering virulence-associated genes present in diverse human pathogens.

    abstract:BACKGROUND:New and improved antimicrobial countermeasures are urgently needed to counteract increased resistance to existing antimicrobial treatments and to combat currently untreatable or new emerging infectious diseases. We demonstrate that computational comparative genomics, together with experimental screening, can...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-501

    authors: Stubben CJ,Duffield ML,Cooper IA,Ford DC,Gans JD,Karlyshev AV,Lingard B,Oyston PC,de Rochefort A,Song J,Wren BW,Titball RW,Wolinsky M

    更新日期:2009-10-29 00:00:00

  • Analysis of functional variants in mitochondrial DNA of Finnish athletes.

    abstract:BACKGROUND:We have previously reported on paucity of mitochondrial DNA (mtDNA) haplogroups J and K among Finnish endurance athletes. Here we aimed to further explore differences in mtDNA variants between elite endurance and sprint athletes. For this purpose, we determined the rate of functional variants and the mutatio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6171-6

    authors: Kiiskilä J,Moilanen JS,Kytövuori L,Niemi AK,Majamaa K

    更新日期:2019-10-29 00:00:00

  • Evaluation of whole exome sequencing as an alternative to BeadChip and whole genome sequencing in human population genetic analysis.

    abstract:BACKGROUND:Understanding the underlying genetic structure of human populations is of fundamental interest to both biological and social sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation. The most widely used methods for col...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5168-x

    authors: Maróti Z,Boldogkői Z,Tombácz D,Snyder M,Kalmár T

    更新日期:2018-10-29 00:00:00

  • Transcriptomic analysis of Macrobrachium rosenbergii (giant fresh water prawn) post-larvae in response to M. rosenbergii nodavirus (MrNV) infection: de novo assembly and functional annotation.

    abstract:BACKGROUND:Macrobrachium rosenbergii, is one of a major freshwater prawn species cultured in Southeast Asia. White tail disease (WTD), caused by Macrobrachium rosenbergii nodavirus (MrNV), is a serious problem in farm cultivation and is responsible for up to 100% mortality in the post larvae stage. Molecular data on ho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6102-6

    authors: Pasookhush P,Hindmarch C,Sithigorngul P,Longyant S,Bendena WG,Chaivisuthangkura P

    更新日期:2019-10-22 00:00:00

  • Transcriptome profiling using pyrosequencing shows genes associated with bast fiber development in ramie (Boehmeria nivea L.).

    abstract:BACKGROUND:Ramie (Boehmeria nivea L.), popularly known as "China grass", is one of the oldest crops in China and the second most important fiber crop in terms of area sown. Ramie fiber, extracted from the plant bast, is important in the textile industry. However, the molecular mechanism of ramie fiber development remai...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-919

    authors: Chen J,Pei Z,Dai L,Wang B,Liu L,An X,Peng D

    更新日期:2014-10-22 00:00:00

  • A genome-wide analysis of the phospholipid: diacylglycerol acyltransferase gene family in Gossypium.

    abstract:BACKGROUND:Cotton (Gossypium spp.) is the most important natural fiber crop worldwide, and cottonseed oil is its most important byproduct. Phospholipid: diacylglycerol acyltransferase (PDAT) is important in TAG biosynthesis, as it catalyzes the transfer of a fatty acyl moiety from the sn-2 position of a phospholipid to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5728-8

    authors: Zang X,Geng X,Ma L,Wang N,Pei W,Wu M,Zhang J,Yu J

    更新日期:2019-05-22 00:00:00

  • De novo transcriptome analysis of Perna viridis highlights tissue-specific patterns for environmental studies.

    abstract:BACKGROUND:The tropical green-lipped mussel Perna viridis is a common biomonitor throughout the Indo-Pacific region that is used for environmental monitoring and ecotoxicological investigations. However, there is limited molecular data available regarding this species. We sought to establish a global transcriptome data...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-804

    authors: Leung PT,Ip JC,Mak SS,Qiu JW,Lam PK,Wong CK,Chan LL,Leung KM

    更新日期:2014-09-19 00:00:00