De novo assembly of highly diverse viral populations.

Abstract:

BACKGROUND:Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo population consensus assembly is valuable both as a single linear representation of the population and as a backbone on which intra-host variants can be accurately mapped. The availability of consensus assemblies and robustly mapped variants are crucial to the genetic study of viral disease progression, transmission dynamics, and viral evolution. Existing de novo assembly techniques fail to robustly assemble ultra-deep sequence data from genetically heterogeneous populations such as viruses into full-length genomes due to the presence of extensive genetic variability, contaminants, and variable sequence coverage. RESULTS:We present VICUNA, a de novo assembly algorithm suitable for generating consensus assemblies from genetically heterogeneous populations. We demonstrate its effectiveness on Dengue, Human Immunodeficiency and West Nile viral populations, representing a range of intra-host diversity. Compared to state-of-the-art assemblers designed for haploid or diploid systems, VICUNA recovers full-length consensus and captures insertion/deletion polymorphisms in diverse samples. Final assemblies maintain a high base calling accuracy. VICUNA program is publicly available at: http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/ viral-genomics-analysis-software. CONCLUSIONS:We developed VICUNA, a publicly available software tool, that enables consensus assembly of ultra-deep sequence derived from diverse viral populations. While VICUNA was developed for the analysis of viral populations, its application to other heterogeneous sequence data sets such as metagenomic or tumor cell population samples may prove beneficial in these fields of research.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Yang X,Charlebois P,Gnerre S,Coole MG,Lennon NJ,Levin JZ,Qu J,Ryan EM,Zody MC,Henn MR

doi

10.1186/1471-2164-13-475

subject

Has Abstract

pub_date

2012-09-13 00:00:00

pages

475

issn

1471-2164

pii

1471-2164-13-475

journal_volume

13

pub_type

杂志文章
  • Selective constraint, background selection, and mutation accumulation variability within and between human populations.

    abstract:BACKGROUND:Regions of the genome that are under evolutionary constraint across multiple species have previously been used to identify functional sequences in the human genome. Furthermore, it is known that there is an inverse relationship between evolutionary constraint and the allele frequency of a mutation segregatin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-495

    authors: Hodgkinson A,Casals F,Idaghdour Y,Grenier JC,Hernandez RD,Awadalla P

    更新日期:2013-07-23 00:00:00

  • Altered gene expression in the superior temporal gyrus in schizophrenia.

    abstract:BACKGROUND:The superior temporal gyrus (STG), which encompasses the primary auditory cortex, is believed to be a major anatomical substrate for speech, language and communication. The STG connects to the limbic system (hippocampus and amygdala), the thalamus and neocortical association areas in the prefrontal cortex, a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-199

    authors: Bowden NA,Scott RJ,Tooney PA

    更新日期:2008-04-29 00:00:00

  • scReQTL: an approach to correlate SNVs to gene expression from individual scRNA-seq datasets.

    abstract:BACKGROUND:Recently, pioneering expression quantitative trait loci (eQTL) studies on single cell RNA sequencing (scRNA-seq) data have revealed new and cell-specific regulatory single nucleotide variants (SNVs). Here, we present an alternative QTL-related approach applicable to transcribed SNV loci from scRNA-seq data: ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07334-y

    authors: Liu H,Prashant NM,Spurr LF,Bousounis P,Alomran N,Ibeawuchi H,Sein J,Słowiński P,Tsaneva-Atanasova K,Horvath A

    更新日期:2021-01-08 00:00:00

  • Identification and evolutionary analysis of the nucleolar proteome of Giardia lamblia.

    abstract:BACKGROUND:The nucleoli, including their proteomes, of higher eukaryotes have been extensively studied, while few studies about the nucleoli of the lower eukaryotes - protists were reported. Giardia lamblia, a protist with the controversy of whether it is an extreme primitive eukaryote or just a highly evolved parasite...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6679-9

    authors: Feng JM,Yang CL,Tian HF,Wang JX,Wen JF

    更新日期:2020-03-30 00:00:00

  • Monophyly of clade III nematodes is not supported by phylogenetic analysis of complete mitochondrial genome sequences.

    abstract:BACKGROUND:The orders Ascaridida, Oxyurida, and Spirurida represent major components of zooparasitic nematode diversity, including many species of veterinary and medical importance. Phylum-wide nematode phylogenetic hypotheses have mainly been based on nuclear rDNA sequences, but more recently complete mitochondrial (m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-392

    authors: Park JK,Sultana T,Lee SH,Kang S,Kim HK,Min GS,Eom KS,Nadler SA

    更新日期:2011-08-03 00:00:00

  • The Zygosaccharomyces bailii transcription factor Haa1 is required for acetic acid and copper stress responses suggesting subfunctionalization of the ancestral bifunctional protein Haa1/Cup2.

    abstract:BACKGROUND:The food spoilage yeast species Zygosaccharomyces bailii exhibits an extraordinary capacity to tolerate weak acids, in particular acetic acid. In Saccharomyces cerevisiae, the transcription factor Haa1 (ScHaa1) is considered the main player in genomic expression reprogramming in response to acetic acid stres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3443-2

    authors: Palma M,Dias PJ,Roque FC,Luzia L,Guerreiro JF,Sá-Correia I

    更新日期:2017-01-13 00:00:00

  • A crustacean annotated transcriptome (CAT) database.

    abstract:BACKGROUND:Decapods are an order of crustaceans which includes shrimps, crabs, lobsters and crayfish. They occur worldwide and are of great scientific interest as well as being of ecological and economic importance in fisheries and aquaculture. However, our knowledge of their biology mainly comes from the group which i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6433-3

    authors: Nong W,Chai ZYH,Jiang X,Qin J,Ma KY,Chan KM,Chan TF,Chow BKC,Kwan HS,Wong CKC,Qiu JW,Hui JHL,Chu KH

    更新日期:2020-01-09 00:00:00

  • Exploring the genetics of trotting racing ability in horses using a unique Nordic horse model.

    abstract:BACKGROUND:Horses have been strongly selected for speed, strength, and endurance-exercise traits since the onset of domestication. As a result, highly specialized horse breeds have developed with many modern horse breeds often representing closed populations with high phenotypic and genetic uniformity. However, a great...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5484-9

    authors: Velie BD,Lillie M,Fegraeus KJ,Rosengren MK,Solé M,Wiklund M,Ihler CF,Strand E,Lindgren G

    更新日期:2019-02-04 00:00:00

  • Flashy flagella: flagellin modification is relatively common and highly versatile among the Enterobacteriaceae.

    abstract:BACKGROUND:Post-translational glycosylation of the flagellin protein is relatively common among Gram-negative bacteria, and has been linked to several phenotypes, including flagellar biosynthesis and motility, biofilm formation, host immune evasion and manipulation and virulence. However to date, despite extensive phys...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2735-x

    authors: De Maayer P,Cowan DA

    更新日期:2016-05-20 00:00:00

  • Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple.

    abstract:BACKGROUND:Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of pe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-537

    authors: Zhang Q,Ma B,Li H,Chang Y,Han Y,Li J,Wei G,Zhao S,Khan MA,Zhou Y,Gu C,Zhang X,Han Z,Korban SS,Li S,Han Y

    更新日期:2012-10-07 00:00:00

  • Quantitative proteomic analysis of host--pathogen interactions: a study of Acinetobacter baumannii responses to host airways.

    abstract:BACKGROUND:Acinetobacter baumannii is a major health problem. The most common infection caused by A. baumannii is hospital acquired pneumonia, and the associated mortality rate is approximately 50%. Neither in vivo nor ex vivo expression profiling has been performed at the proteomic or transcriptomic level for pneumoni...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1608-z

    authors: Méndez JA,Mateos J,Beceiro A,Lopez M,Tomás M,Poza M,Bou G

    更新日期:2015-05-30 00:00:00

  • Acute systemic inflammatory response to lipopolysaccharide stimulation in pigs divergently selected for residual feed intake.

    abstract:BACKGROUND:It is unclear whether improving feed efficiency by selection for low residual feed intake (RFI) compromises pigs' immunocompetence. Here, we aimed at investigating whether pig lines divergently selected for RFI had different inflammatory responses to lipopolysaccharide (LPS) exposure, regarding to clinical p...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6127-x

    authors: Liu H,Feye KM,Nguyen YT,Rakhshandeh A,Loving CL,Dekkers JCM,Gabler NK,Tuggle CK

    更新日期:2019-10-11 00:00:00

  • Modular assembly of transposable element arrays by microsatellite targeting in the guayule and rice genomes.

    abstract:BACKGROUND:Guayule (Parthenium argentatum A. Gray) is a rubber-producing desert shrub native to Mexico and the United States. Guayule represents an alternative to Hevea brasiliensis as a source for commercial natural rubber. The efficient application of modern molecular/genetic tools to guayule improvement requires cha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4653-6

    authors: Valdes Franco JA,Wang Y,Huo N,Ponciano G,Colvin HA,McMahan CM,Gu YQ,Belknap WR

    更新日期:2018-04-19 00:00:00

  • SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences.

    abstract:BACKGROUND:The large amount of data produced by high-throughput sequencing poses new computational challenges. In the last decade, several tools have been developed for the identification of transcription and splicing factor binding sites. RESULTS:Here, we introduce the SeAMotE (Sequence Analysis of Motifs Enrichment)...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-925

    authors: Agostini F,Cirillo D,Ponti RD,Tartaglia GG

    更新日期:2014-10-23 00:00:00

  • Genome-wide binding analysis of the transcriptional regulator TrmBL1 in Pyrococcus furiosus.

    abstract:BACKGROUND:Several in vitro studies document the function of the transcriptional regulator TrmBL1 of Pyrococcus furiosus. These data indicate that the protein can act as repressor or activator and is mainly involved in transcriptional control of sugar uptake and in the switch between glycolysis and gluconeogenesis. The...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2360-0

    authors: Reichelt R,Gindner A,Thomm M,Hausner W

    更新日期:2016-01-08 00:00:00

  • Recurrence-associated pathways in hepatitis B virus-positive hepatocellular carcinoma.

    abstract:BACKGROUND:Despite the recent identification of several prognostic gene signatures, the lack of common genes among experimental cohorts has posed a considerable challenge in uncovering the molecular basis underlying hepatocellular carcinoma (HCC) recurrence for application in clinical purposes. To overcome the limitati...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1472-x

    authors: Kim BY,Choi DW,Woo SR,Park ER,Lee JG,Kim SH,Koo I,Park SH,Han CJ,Kim SB,Yeom YI,Yang SJ,Yu A,Lee JW,Jang JJ,Cho MH,Jeon WK,Park YN,Suh KS,Lee KH

    更新日期:2015-04-10 00:00:00

  • High throughput sequencing in mice: a platform comparison identifies a preponderance of cryptic SNPs.

    abstract:BACKGROUND:Allelic variation is the cornerstone of genetically determined differences in gene expression, gene product structure, physiology, and behavior. However, allelic variation, particularly cryptic (unknown or not annotated) variation, is problematic for follow up analyses. Polymorphisms result in a high inciden...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-379

    authors: Walter NA,Bottomly D,Laderas T,Mooney MA,Darakjian P,Searles RP,Harrington CA,McWeeney SK,Hitzemann R,Buck KJ

    更新日期:2009-08-17 00:00:00

  • MicroRNA modulate alveolar epithelial response to cyclic stretch.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of gene expression implicated in multiple cellular processes. Cyclic stretch of alveoli is characteristic of mechanical ventilation, and is postulated to be partly responsible for the lung injury and inflammation in ventilator-induced lung injury. We pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-154

    authors: Yehya N,Yerrapureddy A,Tobias J,Margulies SS

    更新日期:2012-04-26 00:00:00

  • The diversity of small non-coding RNAs in the diatom Phaeodactylum tricornutum.

    abstract:BACKGROUND:Marine diatoms constitute a major component of eukaryotic phytoplankton and stand at the crossroads of several evolutionary lineages. These microalgae possess peculiar genomic features and novel combinations of genes acquired from bacterial, animal and plant ancestors. Furthermore, they display both DNA meth...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-698

    authors: Rogato A,Richard H,Sarazin A,Voss B,Cheminant Navarro S,Champeimont R,Navarro L,Carbone A,Hess WR,Falciatore A

    更新日期:2014-08-20 00:00:00

  • Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq.

    abstract:BACKGROUND:Thoroughbred horses are the most expensive domestic animals, and their running ability and knowledge about their muscle-related diseases are important in animal genetics. While the horse reference genome is available, there has been no large-scale functional annotation of the genome using expressed genes der...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-473

    authors: Park KD,Park J,Ko J,Kim BC,Kim HS,Ahn K,Do KT,Choi H,Kim HM,Song S,Lee S,Jho S,Kong HS,Yang YM,Jhun BH,Kim C,Kim TH,Hwang S,Bhak J,Lee HK,Cho BW

    更新日期:2012-09-12 00:00:00

  • BAC library resources for map-based cloning and physical map construction in barley (Hordeum vulgare L.).

    abstract:BACKGROUND:Although second generation sequencing (2GS) technologies allow re-sequencing of previously gold-standard-sequenced genomes, whole genome shotgun sequencing and de novo assembly of large and complex eukaryotic genomes is still difficult. Availability of a genome-wide physical map is therefore still a prerequi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-247

    authors: Schulte D,Ariyadasa R,Shi B,Fleury D,Saski C,Atkins M,deJong P,Wu CC,Graner A,Langridge P,Stein N

    更新日期:2011-05-19 00:00:00

  • Transcriptome analysis of the oil-rich seed of the bioenergy crop Jatropha curcas L.

    abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-462

    authors: Costa GG,Cardoso KC,Del Bem LE,Lima AC,Cunha MA,de Campos-Leite L,Vicentini R,Papes F,Moreira RC,Yunes JA,Campos FA,Da Silva MJ

    更新日期:2010-08-06 00:00:00

  • Ostreococcus tauri is a new model green alga for studying iron metabolism in eukaryotic phytoplankton.

    abstract:BACKGROUND:Low iron bioavailability is a common feature of ocean surface water and therefore micro-algae developed original strategies to optimize iron uptake and metabolism. The marine picoeukaryotic green alga Ostreococcus tauri is a very good model for studying physiological and genetic aspects of the adaptation of ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2666-6

    authors: Lelandais G,Scheiber I,Paz-Yepes J,Lozano JC,Botebol H,Pilátová J,Žárský V,Léger T,Blaiseau PL,Bowler C,Bouget FY,Camadro JM,Sutak R,Lesuisse E

    更新日期:2016-05-03 00:00:00

  • Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes.

    abstract:BACKGROUND:Many gram-negative bacteria use type III secretion systems (T3SSs) to translocate effector proteins into host cells. T3SS effectors can give some bacteria a competitive edge over others within the same environment and can help bacteria to invade the host cells and allow them to multiply rapidly within the ho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3363-1

    authors: Hobbs CK,Porter VL,Stow ML,Siame BA,Tsang HH,Leung KY

    更新日期:2016-12-19 00:00:00

  • XenDB: full length cDNA prediction and cross species mapping in Xenopus laevis.

    abstract:BACKGROUND:Research using the model system Xenopus laevis has provided critical insights into the mechanisms of early vertebrate development and cell biology. Large scale sequencing efforts have provided an increasingly important resource for researchers. To provide full advantage of the available sequence, we have ana...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-123

    authors: Sczyrba A,Beckstette M,Brivanlou AH,Giegerich R,Altmann CR

    更新日期:2005-09-14 00:00:00

  • A new allele for aluminium tolerance gene in barley (Hordeum vulgare L.).

    abstract:BACKGROUND:Aluminium (Al) toxicity is the main factor limiting the crop production in acid soils and barley (Hordeum vulgare L.) is one of the most Al-sensitive of the small-grained cereals. The major gene for Al tolerance in barley is HvAACT1 (HvMATE) on chromosome 4H which encodes a multidrug and toxic compound extru...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2551-3

    authors: Ma Y,Li C,Ryan PR,Shabala S,You J,Liu J,Liu C,Zhou M

    更新日期:2016-03-05 00:00:00

  • Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database.

    abstract:BACKGROUND:The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genom...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-454

    authors: Scaglione D,Acquadro A,Portis E,Taylor CA,Lanteri S,Knapp SJ

    更新日期:2009-09-28 00:00:00

  • Transcriptomic and proteomic analyses of a new cytoplasmic male sterile line with a wild Gossypium bickii genetic background.

    abstract:BACKGROUND:Cotton is an important fiber crop but has serious heterosis effects, and cytoplasmic male sterility (CMS) is the major cause of heterosis in plants. However, to the best of our knowledge, no studies have investigated CMS Yamian A in cotton with the genetic background of Australian wild Gossypium bickii. Conj...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07261-y

    authors: Zhao H,Wang J,Qu Y,Peng R,Magwanga RO,Liu F,Huang J

    更新日期:2020-12-02 00:00:00

  • Secondary structure in the target as a confounding factor in synthetic oligomer microarray design.

    abstract:BACKGROUND:Secondary structure in the target is a property not usually considered in software applications for design of optimal custom oligonucleotide probes. It is frequently assumed that eliminating self-complementarity, or screening for secondary structure in the probe, is sufficient to avoid interference with hybr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-31

    authors: Ratushna VG,Weller JW,Gibas CJ

    更新日期:2005-03-08 00:00:00

  • Characterization of SR3 reveals abundance of non-LTR retrotransposons of the RTE clade in the genome of the human blood fluke, Schistosoma mansoni.

    abstract:BACKGROUND:It is becoming apparent that perhaps as much as half of the genome of the human blood fluke Schistosoma mansoni is constituted of mobile genetic element-related sequences. Non-long terminal repeat (LTR) retrotransposons, related to the LINE elements of mammals, comprise much of this repetitive component of t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-154

    authors: Laha T,Kewgrai N,Loukas A,Brindley PJ

    更新日期:2005-11-04 00:00:00