Finishing monkeypox genomes from short reads: assembly analysis and a neural network method.

Abstract:

BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses, the emergence of monkeypox outbreaks in humans and the need to monitor potential malicious release of smallpox virus requires development of methods for rapid virus identification. Whole-genome sequencing (WGS) is an emergent technology with increasing application to the diagnosis of diseases and the identification of outbreak pathogens. But "finishing" such a genome is a laborious and time-consuming process, not easily automated. To date the large, complete poxvirus genomes have not been studied comprehensively in terms of applying WGS techniques and evaluating genome assembly algorithms. RESULTS:To explore the limitations to finishing a poxvirus genome from short reads, we first analyze the repetitive regions in a monkeypox genome and evaluate genome assembly on the simulated reads. We also report on procedures and insights relevant to the assembly (from realistically short reads) of genomes. Finally, we propose a neural network method (namely Neural-KSP) to "finish" the process by closing gaps remaining after conventional assembly, as the final stage in a protocol to elucidate clinical poxvirus genomic sequences. CONCLUSIONS:The protocol may prove useful in any clinical viral isolate (regardless if a reference-strain sequence is available) and especially useful in genomes confounded by many global and local repetitive sequences embedded in them. This work highlights the feasibility of finishing real, complex genomes by systematically analyzing genetic characteristics, thus remedying existing assembly shortcomings with a neural network method. Such finished sequences may enable clinicians to track genetic distance between viral isolates that provides a powerful epidemiological tool.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Zhao K,Wohlhueter RM,Li Y

doi

10.1186/s12864-016-2826-8

subject

Has Abstract

pub_date

2016-08-31 00:00:00

pages

497

issn

1471-2164

pii

10.1186/s12864-016-2826-8

journal_volume

17 Suppl 5

pub_type

杂志文章
  • Gene expression profiling in peanut using high density oligonucleotide microarrays.

    abstract:BACKGROUND:Transcriptome expression analysis in peanut to date has been limited to a relatively small set of genes and only recently has a significant number of ESTs been released into the public domain. Utilization of these ESTs for oligonucleotide microarrays provides a means to investigate large-scale transcript res...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-265

    authors: Payton P,Kottapalli KR,Rowland D,Faircloth W,Guo B,Burow M,Puppala N,Gallo M

    更新日期:2009-06-12 00:00:00

  • Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq.

    abstract:BACKGROUND:Chinese bayberry (Myrica rubra Sieb. and Zucc.) is an important subtropical fruit crop and an ideal species for fruit quality research due to the rapid and substantial changes that occur during development and ripening, including changes in fruit color and taste. However, research at the molecular level is l...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-19

    authors: Feng C,Chen M,Xu CJ,Bai L,Yin XR,Li X,Allan AC,Ferguson IB,Chen KS

    更新日期:2012-01-13 00:00:00

  • Global transcriptome and gene regulation network for secondary metabolite biosynthesis of tea plant (Camellia sinensis).

    abstract:BACKGROUND:Major secondary metabolites, including flavonoids, caffeine, and theanine, are important components of tea products and are closely related to the taste, flavor, and health benefits of tea. Secondary metabolite biosynthesis in Camellia sinensis is differentially regulated in different tissues during growth a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1773-0

    authors: Li CF,Zhu Y,Yu Y,Zhao QY,Wang SJ,Wang XC,Yao MZ,Luo D,Li X,Chen L,Yang YJ

    更新日期:2015-07-29 00:00:00

  • Comprehensive transcriptome analysis of grafting onto Artemisia scoparia W. to affect the aphid resistance of chrysanthemum (Chrysanthemum morifolium T.).

    abstract:BACKGROUND:Aphid (Macrosiphoniella sanbourni) stress drastically influences the yield and quality of chrysanthemum, and grafting has been widely used to improve tolerance to biotic and abiotic stresses. However, the effect of grafting on the resistance of chrysanthemum to aphids remains unclear. Therefore, we used the ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6158-3

    authors: Zhang XY,Sun XZ,Zhang S,Yang JH,Liu FF,Fan J

    更新日期:2019-10-25 00:00:00

  • Antennal transcriptome analysis of the chemosensory gene families in the tree killing bark beetles, Ips typographus and Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae).

    abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-198

    authors: Andersson MN,Grosse-Wilde E,Keeling CI,Bengtsson JM,Yuen MM,Li M,Hillbur Y,Bohlmann J,Hansson BS,Schlyter F

    更新日期:2013-03-21 00:00:00

  • Genes optimized by evolution for accurate and fast translation encode in Archaea and Bacteria a broad and characteristic spectrum of protein functions.

    abstract:BACKGROUND:In many microbial genomes, a strong preference for a small number of codons can be observed in genes whose products are needed by the cell in large quantities. This codon usage bias (CUB) improves translational accuracy and speed and is one of several factors optimizing cell growth. Whereas CUB and the overr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-617

    authors: von Mandach C,Merkl R

    更新日期:2010-11-04 00:00:00

  • Stoichiometric gene-to-reaction associations enhance model-driven analysis performance: Metabolic response to chronic exposure to Aldrin in prostate cancer.

    abstract:BACKGROUND:Genome-scale metabolic models (GSMM) integrating transcriptomics have been widely used to study cancer metabolism. This integration is achieved through logical rules that describe the association between genes, proteins, and reactions (GPRs). However, current gene-to-reaction formulation lacks the stoichiome...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5979-4

    authors: Marín de Mas I,Torrents L,Bedia C,Nielsen LK,Cascante M,Tauler R

    更新日期:2019-08-15 00:00:00

  • MicroRNA modulate alveolar epithelial response to cyclic stretch.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of gene expression implicated in multiple cellular processes. Cyclic stretch of alveoli is characteristic of mechanical ventilation, and is postulated to be partly responsible for the lung injury and inflammation in ventilator-induced lung injury. We pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-154

    authors: Yehya N,Yerrapureddy A,Tobias J,Margulies SS

    更新日期:2012-04-26 00:00:00

  • Generation and analysis of expression sequence tags from haustoria of the wheat stripe rust fungus Puccinia striiformis f. sp. Tritici.

    abstract:BACKGROUND:Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat (Triticum aestivum L.) worldwide. In spite of its agricultural importance, the genomics and genetics of the pathogen are poorly characterized. Pst transcripts from urediniospores and germinated ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-626

    authors: Yin C,Chen X,Wang X,Han Q,Kang Z,Hulbert SH

    更新日期:2009-12-23 00:00:00

  • A new approach for efficient genotype imputation using information from relatives.

    abstract:BACKGROUND:Genotype imputation can help reduce genotyping costs particularly for implementation of genomic selection. In applications entailing large populations, recovering the genotypes of untyped loci using information from reference individuals that were genotyped with a higher density panel is computationally chal...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-478

    authors: Sargolzaei M,Chesnais JP,Schenkel FS

    更新日期:2014-06-17 00:00:00

  • Rapid quantification of sequence repeats to resolve the size, structure and contents of bacterial genomes.

    abstract:BACKGROUND:The numerous classes of repeats often impede the assembly of genome sequences from the short reads provided by new sequencing technologies. We demonstrate a simple and rapid means to ascertain the repeat structure and total size of a bacterial or archaeal genome without the need for assembly by directly anal...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-537

    authors: Williams D,Trimble WL,Shilts M,Meyer F,Ochman H

    更新日期:2013-08-08 00:00:00

  • Mixed evolutionary origins of endogenous biomass-depolymerizing enzymes in animals.

    abstract:BACKGROUND:Animals are thought to achieve lignocellulose digestion via symbiotic associations with gut microbes; this view leads to significant focus on bacteria and fungi for lignocellulolytic systems. The presence of biomass conversion systems hardwired into animal genomes has not yet been unequivocally demonstrated....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4861-0

    authors: Chang WH,Lai AG

    更新日期:2018-06-20 00:00:00

  • SNP discovery by high-throughput sequencing in soybean.

    abstract:BACKGROUND:With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-469

    authors: Wu X,Ren C,Joshi T,Vuong T,Xu D,Nguyen HT

    更新日期:2010-08-11 00:00:00

  • A systematic evaluation of expression of HERV-W elements; influence of genomic context, viral structure and orientation.

    abstract:BACKGROUND:One member of the W family of human endogenous retroviruses (HERV) appears to have been functionally adopted by the human host. Nevertheless, a highly diversified and regulated transcription from a range of HERV-W elements has been observed in human tissues and cells. Aberrant expression of members of this f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-22

    authors: Li F,Nellåker C,Yolken RH,Karlsson H

    更新日期:2011-01-12 00:00:00

  • Global proteomic analysis of the oocyst/sporozoite of Toxoplasma gondii reveals commitment to a host-independent lifestyle.

    abstract:BACKGROUND:Toxoplasmosis is caused by the apicomplexan parasite Toxoplasma gondii and can be acquired either congenitally or via the oral route. In the latter case, transmission is mediated by two distinct invasive stages, i.e., bradyzoites residing in tissue cysts or sporozoites contained in environmentally resistant ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-183

    authors: Possenti A,Fratini F,Fantozzi L,Pozio E,Dubey JP,Ponzi M,Pizzi E,Spano F

    更新日期:2013-03-15 00:00:00

  • Comparison of gene expression of Paramecium bursaria with and without Chlorella variabilis symbionts.

    abstract:BACKGROUND:The ciliate Paramecium bursaria harbors several hundred cells of the green-alga Chlorella sp. in their cytoplasm. Irrespective of the mutual relation between P. bursaria and the symbiotic algae, both cells retain the ability to grow without the partner. They can easily reestablish endosymbiosis when put in c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-183

    authors: Kodama Y,Suzuki H,Dohra H,Sugii M,Kitazume T,Yamaguchi K,Shigenobu S,Fujishima M

    更新日期:2014-03-10 00:00:00

  • Comparative analyses of the variation of the transcriptome and proteome of Rhodobacter sphaeroides throughout growth.

    abstract:BACKGROUND:In natural environments, bacteria must frequently cope with extremely scarce nutrients. Most studies focus on bacterial growth in nutrient replete conditions, while less is known about the stationary phase. Here, we are interested in global gene expression throughout all growth phases, including the adjustme...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5749-3

    authors: Bathke J,Konzer A,Remes B,McIntosh M,Klug G

    更新日期:2019-05-09 00:00:00

  • Analysis of whole genome sequencing for the Escherichia coli O157:H7 typing phages.

    abstract:BACKGROUND:Shiga toxin producing Escherichia coli O157 can cause severe bloody diarrhea and haemolytic uraemic syndrome. Phage typing of E. coli O157 facilitates public health surveillance and outbreak investigations, certain phage types are more likely to occupy specific niches and are associated with specific age gro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1470-z

    authors: Cowley LA,Beckett SJ,Chase-Topping M,Perry N,Dallman TJ,Gally DL,Jenkins C

    更新日期:2015-04-08 00:00:00

  • Flux of transcript patterns during soybean seed development.

    abstract:BACKGROUND:To understand gene expression networks leading to functional properties of the soybean seed, we have undertaken a detailed examination of soybean seed development during the stages of major accumulation of oils, proteins, and starches, as well as the desiccating and mature stages, using microarrays consistin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-136

    authors: Jones SI,Gonzalez DO,Vodkin LO

    更新日期:2010-02-24 00:00:00

  • Microarray-based estimation of SNP allele-frequency in pooled DNA using the Langmuir kinetic model.

    abstract:BACKGROUND:High throughput genotyping of single nucleotide polymorphisms (SNPs) for genome-wide association requires technologies for generating millions of genotypes with relative ease but also at a reasonable cost and with high accuracy. In this work, we have developed a theoretical approach to estimate allele freque...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-605

    authors: Yin BC,Li H,Ye BC

    更新日期:2008-12-16 00:00:00

  • Correction to: De novo profiling of RNA viruses in Anopheles malaria vector mosquitoes from forest ecological zones in Senegal and Cambodia.

    abstract::Following the publication of this article [1], the authors reported that the original shading in columns 3 and 4 of Table 3, which indicated the presence or absence of viruses in each library, had been removed during typesetting. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-019-6067-5

    authors: Belda E,Nanfack-Minkeu F,Eiglmeier K,Carissimo G,Holm I,Diallo M,Diallo D,Vantaux A,Kim S,Sharakhov IV,Vernick KD

    更新日期:2019-09-05 00:00:00

  • Combinatorial control of temporal gene expression in the Drosophila wing by enhancers and core promoters.

    abstract:BACKGROUND:The transformation of a developing epithelium into an adult structure is a complex process, which often involves coordinated changes in cell proliferation, metabolism, adhesion, and shape. To identify genetic mechanisms that control epithelial differentiation, we analyzed the temporal patterns of gene expres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-498

    authors: O'Keefe DD,Thomas SR,Bolin K,Griggs E,Edgar BA,Buttitta LA

    更新日期:2012-09-20 00:00:00

  • Genome-wide association study reveals novel loci associated with body size and carcass yields in Pekin ducks.

    abstract:BACKGROUND:Pekin duck products have become popular in Asia over recent decades and account for an increasing market share. However, the genetic mechanisms affecting carcass growth in Pekin ducks remain unknown. This study aimed to identify quantitative trait loci affecting body size and carcass yields in Pekin ducks. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5379-1

    authors: Deng MT,Zhu F,Yang YZ,Yang FX,Hao JP,Chen SR,Hou ZC

    更新日期:2019-01-03 00:00:00

  • Comparing Mycobacterium tuberculosis genomes using genome topology networks.

    abstract:BACKGROUND:Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1259-0

    authors: Jiang J,Gu J,Zhang L,Zhang C,Deng X,Dou T,Zhao G,Zhou Y

    更新日期:2015-02-14 00:00:00

  • Introduction to the proceedings of the Avian Genomics and Gene Ontology Annotation Workshop.

    abstract::The Avian Genomics Conference and Gene Ontology Annotation Workshop brought together researchers and students from around the world to present their latest research addressing the delivery of value from the billions of base-pairs of Archosaur sequence that have become available in the last few years. This editorial de...

    journal_title:BMC genomics

    pub_type:

    doi:10.1186/1471-2164-10-S2-I1

    authors: Bridges SM,Burgess SC,McCarthy FM

    更新日期:2009-07-14 00:00:00

  • A deconvolution method and its application in analyzing the cellular fractions in acute myeloid leukemia samples.

    abstract:BACKGROUND:The identification of cell type-specific genes (markers) is an essential step for the deconvolution of the cellular fractions, primarily, from the gene expression data of a bulk sample. However, the genes with significant changes identified by pair-wise comparisons cannot indeed represent the specificity of ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06888-1

    authors: Li H,Sharma A,Ming W,Sun X,Liu H

    更新日期:2020-09-23 00:00:00

  • Comparative transcriptomics provide insight into the morphogenesis and evolution of fistular leaves in Allium.

    abstract:BACKGROUND:Fistular leaves frequently appear in Allium species, and previous developmental studies have proposed that the process of fistular leaf formation involves programmed cell death. However, molecular evidence for the role of programmed cell death in the formation of fistular leaf cavities has yet to be reported...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3474-8

    authors: Zhu S,Tang S,Tan Z,Yu Y,Dai Q,Liu T

    更新日期:2017-01-10 00:00:00

  • Differential control of Zap1-regulated genes in response to zinc deficiency in Saccharomyces cerevisiae.

    abstract:BACKGROUND:The Zap1 transcription factor is a central player in the response of yeast to changes in zinc status. We previously used transcriptome profiling with DNA microarrays to identify 46 potential Zap1 target genes in the yeast genome. In this new study, we used complementary methods to identify additional Zap1 ta...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-370

    authors: Wu CY,Bird AJ,Chung LM,Newton MA,Winge DR,Eide DJ

    更新日期:2008-08-01 00:00:00

  • Genome-wide survey of two-component signal transduction systems in the plant growth-promoting bacterium Azospirillum.

    abstract:BACKGROUND:Two-component systems (TCS) play critical roles in sensing and responding to environmental cues. Azospirillum is a plant growth-promoting rhizobacterium living in the rhizosphere of many important crops. Despite numerous studies about its plant beneficial properties, little is known about how the bacterium s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1962-x

    authors: Borland S,Oudart A,Prigent-Combaret C,Brochier-Armanet C,Wisniewski-Dyé F

    更新日期:2015-10-22 00:00:00

  • Transcriptomic analysis of Macrobrachium rosenbergii (giant fresh water prawn) post-larvae in response to M. rosenbergii nodavirus (MrNV) infection: de novo assembly and functional annotation.

    abstract:BACKGROUND:Macrobrachium rosenbergii, is one of a major freshwater prawn species cultured in Southeast Asia. White tail disease (WTD), caused by Macrobrachium rosenbergii nodavirus (MrNV), is a serious problem in farm cultivation and is responsible for up to 100% mortality in the post larvae stage. Molecular data on ho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6102-6

    authors: Pasookhush P,Hindmarch C,Sithigorngul P,Longyant S,Bendena WG,Chaivisuthangkura P

    更新日期:2019-10-22 00:00:00