Accurate single nucleotide variant detection in viral populations by combining probabilistic clustering with a statistical test of strand bias.

Abstract:

BACKGROUND:Deep sequencing is a powerful tool for assessing viral genetic diversity. Such experiments harness the high coverage afforded by next generation sequencing protocols by treating sequencing reads as a population sample. Distinguishing true single nucleotide variants (SNVs) from sequencing errors remains challenging, however. Current protocols are characterised by high false positive rates, with results requiring time consuming manual checking. RESULTS:By statistical modelling, we show that if multiple variant sites are considered at once, SNVs can be called reliably from high coverage viral deep sequencing data at frequencies lower than the error rate of the sequencing technology, and that SNV calling accuracy increases as true sequence diversity within a read length increases. We demonstrate these findings on two control data sets, showing that SNV detection is more reliable on a high diversity human immunodeficiency virus sample as compared to a moderate diversity sample of hepatitis C virus. Finally, we show that in situations where probabilistic clustering retains false positive SNVs (for instance due to insufficient sample diversity or systematic errors), applying a strand bias test based on a beta-binomial model of forward read distribution can improve precision, with negligible cost to true positive recall. CONCLUSIONS:By combining probabilistic clustering (implemented in the program ShoRAH) with a statistical test of strand bias, SNVs may be called from deeply sequenced viral populations with high accuracy.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

McElroy K,Zagordi O,Bull R,Luciani F,Beerenwinkel N

doi

10.1186/1471-2164-14-501

subject

Has Abstract

pub_date

2013-07-24 00:00:00

pages

501

issn

1471-2164

pii

1471-2164-14-501

journal_volume

14

pub_type

杂志文章
  • Inferring linkage disequilibrium from non-random samples.

    abstract:BACKGROUND:Linkage disequilibrium (LD) plays a fundamental role in population genetics and in the current surge of studies to screen for subtle genetic variants affecting complex traits. Methods widely implemented in LD analyses require samples to be randomly collected, which, however, are usually ignored and thus rais...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-328

    authors: Wang M,Jia T,Jiang N,Wang L,Hu X,Luo Z

    更新日期:2010-05-26 00:00:00

  • Accumulation of CTCF-binding sites drives expression divergence between tandemly duplicated genes in humans.

    abstract:BACKGROUND:During eukaryotic genome evolution, tandem gene duplication is the most frequent event giving rise to clustered gene families. However, how expression divergence between tandemly duplicated genes has emerged and maintained remain unclear. In particular, it is unknown if epigenetic regulators have been involv...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S1-S8

    authors: Liao BY,Chang A

    更新日期:2014-01-01 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • Comparative mitogenomic analysis of the superfamily Pentatomoidea (Insecta: Hemiptera: Heteroptera) and phylogenetic implications.

    abstract:BACKGROUND:Insect mitochondrial genomes (mitogenomes) are the most extensively used genetic marker for evolutionary and population genetics studies of insects. The Pentatomoidea superfamily is economically important and the largest superfamily within Pentatomomorpha with over 7,000 species. To better understand the div...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1679-x

    authors: Yuan ML,Zhang QL,Guo ZL,Wang J,Shen YY

    更新日期:2015-06-16 00:00:00

  • Identification of dysfunctional modules and disease genes in congenital heart disease by a network-based approach.

    abstract:BACKGROUND:The incidence of congenital heart disease (CHD) is continuously increasing among infants born alive nowadays, making it one of the leading causes of infant morbidity worldwide. Various studies suggest that both genetic and environmental factors lead to CHD, and therefore identifying its candidate genes and d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-592

    authors: He D,Liu ZP,Chen L

    更新日期:2011-12-02 00:00:00

  • Environment sensing and response mediated by ABC transporters.

    abstract:BACKGROUND:Transporter proteins are one of an organism's primary interfaces with the environment. The expressed set of transporters mediates cellular metabolic capabilities and influences signal transduction pathways and regulatory networks. The functional annotation of most transporters is currently limited to general...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-S1-S8

    authors: Giuliani SE,Frank AM,Corgliano DM,Seifert C,Hauser L,Collart FR

    更新日期:2011-06-15 00:00:00

  • Mitochondrial dysregulation and oxidative stress in patients with chronic kidney disease.

    abstract:BACKGROUND:Chronic renal disease (CKD) is characterized by complex changes in cell metabolism leading to an increased production of oxygen radicals, that, in turn has been suggested to play a key role in numerous clinical complications of this pathological condition. Several reports have focused on the identification o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-388

    authors: Granata S,Zaza G,Simone S,Villani G,Latorre D,Pontrelli P,Carella M,Schena FP,Grandaliano G,Pertosa G

    更新日期:2009-08-21 00:00:00

  • Nicotiana attenuata Data Hub (NaDH): an integrative platform for exploring genomic, transcriptomic and metabolomic data in wild tobacco.

    abstract:BACKGROUND:Nicotiana attenuata (coyote tobacco) is an ecological model for studying plant-environment interactions and plant gene function under real-world conditions. During the last decade, large amounts of genomic, transcriptomic and metabolomic data have been generated with this plant which has provided new insight...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3465-9

    authors: Brockmöller T,Ling Z,Li D,Gaquerel E,Baldwin IT,Xu S

    更新日期:2017-01-13 00:00:00

  • RNA-seq analysis provides insights into cold stress responses of Xanthomonas citri pv. citri.

    abstract:BACKGROUND:Xanthomonas citri pv. citri (Xcc) is a citrus canker causing Gram-negative bacteria. Currently, little is known about the biological and molecular responses of Xcc to low temperatures. RESULTS:Results depicted that low temperature significantly reduced growth and increased biofilm formation and unsaturated ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6193-0

    authors: Liao JX,Li KH,Wang JP,Deng JR,Liu QG,Chang CQ

    更新日期:2019-11-06 00:00:00

  • Comparing de novo assemblers for 454 transcriptome data.

    abstract:BACKGROUND:Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-571

    authors: Kumar S,Blaxter ML

    更新日期:2010-10-16 00:00:00

  • Genome-wide genetic aberrations of thymoma using cDNA microarray based comparative genomic hybridization.

    abstract:BACKGROUND:Thymoma is a heterogeneous group of tumors in biology and clinical behavior. Even though thymoma is divided into five subgroups following the World Health Organization classification, the nature of the disease is mixed within the subgroups. RESULTS:We investigated the molecular characteristics of genetic ch...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-305

    authors: Lee GY,Yang WI,Jeung HC,Kim SC,Seo MY,Park CH,Chung HC,Rha SY

    更新日期:2007-09-03 00:00:00

  • First comprehensive analysis of lysine acetylation in Alvinocaris longirostris from the deep-sea hydrothermal vents.

    abstract:BACKGROUND:Deep-sea hydrothermal vents are unique chemoautotrophic ecosystems with harsh conditions. Alvinocaris longirostris is one of the dominant crustacean species inhabiting in these extreme environments. It is significant to clarify mechanisms in their adaptation to the vents. Lysine acetylation has been known to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4745-3

    authors: Hui M,Cheng J,Sha Z

    更新日期:2018-05-10 00:00:00

  • Cloning and characterization of the mouse Mcoln1 gene reveals an alternatively spliced transcript not seen in humans.

    abstract:BACKGROUND:Mucolipidosis type IV (MLIV) is an autosomal recessive lysosomal storage disorder characterized by severe neurologic and ophthalmologic abnormalities. Recently the MLIV gene, MCOLN1, has been identified as a new member of the transient receptor potential (TRP) cation channel superfamily. Here we report the c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-3-3

    authors: Falardeau JL,Kennedy JC,Acierno JS Jr,Sun M,Stahl S,Goldin E,Slaugenhaupt SA

    更新日期:2002-01-01 00:00:00

  • Transcriptomic time-series analysis of early development in olive from germinated embryos to juvenile tree.

    abstract:BACKGROUND:Despite its relevance, almost no studies account for the genetic control in the early stages of tree development, i.e. from germination on. This study seeks to make a quite complete transcriptome for olive development and to elucidate the dynamic regulation of the transcriptomic response during the early-juv...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5232-6

    authors: Jiménez-Ruiz J,de la O Leyva-Pérez M,Vidoy-Mercado I,Barceló A,Luque F

    更新日期:2018-11-19 00:00:00

  • Gene expression analyses in Atlantic salmon challenged with infectious salmon anemia virus reveal differences between individuals with early, intermediate and late mortality.

    abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-179

    authors: Jørgensen SM,Afanasyev S,Krasnov A

    更新日期:2008-04-18 00:00:00

  • Correction to: Proteotranscriptomics assisted gene annotation and spatial proteomics of Bombyx mori BmN4 cell line.

    abstract::An amendment to this paper has been published and can be accessed via the original article. ...

    journal_title:BMC genomics

    pub_type: 已发布勘误

    doi:10.1186/s12864-020-07211-8

    authors: Levin M,Scheibe M,Butter F

    更新日期:2020-11-12 00:00:00

  • Comparative genomics of the family Vibrionaceae reveals the wide distribution of genes encoding virulence-associated proteins.

    abstract:BACKGROUND:Species of the family Vibrionaceae are ubiquitous in marine environments. Several of these species are important pathogens of humans and marine species. Evidence indicates that genetic exchange plays an important role in the emergence of new pathogenic strains within this family. Data from the sequenced geno...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-369

    authors: Lilburn TG,Gu J,Cai H,Wang Y

    更新日期:2010-06-10 00:00:00

  • Vaginal microbiome variances in sample groups categorized by clinical criteria of bacterial vaginosis.

    abstract:BACKGROUND:One of the most common and recurrent vaginal infections is bacterial vaginosis (BV). The diagnosis is based on changes to the "normal" vaginal microbiome; however, the normal microbiome appears to differ according to reproductive status and ethnicity, and even among individuals within these groups. The Amsel...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5284-7

    authors: Chen HM,Chang TH,Lin FM,Liang C,Chiu CM,Yang TL,Yang T,Huang CY,Cheng YN,Chang YA,Chang PY,Weng SL

    更新日期:2018-12-31 00:00:00

  • Comparative genomics of European avian pathogenic E. Coli (APEC).

    abstract:BACKGROUND:Avian pathogenic Escherichia coli (APEC) causes colibacillosis, which results in significant economic losses to the poultry industry worldwide. However, the diversity between isolates remains poorly understood. Here, a total of 272 APEC isolates collected from the United Kingdom (UK), Italy and Germany were ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3289-7

    authors: Cordoni G,Woodward MJ,Wu H,Alanazi M,Wallis T,La Ragione RM

    更新日期:2016-11-22 00:00:00

  • A graph-theoretic approach for classification and structure prediction of transmembrane β-barrel proteins.

    abstract:BACKGROUND:Transmembrane β-barrel proteins are a special class of transmembrane proteins which play several key roles in human body and diseases. Due to experimental difficulties, the number of transmembrane β-barrel proteins with known structures is very small. Over the years, a number of learning-based methods have b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S2-S5

    authors: Tran Vdu T,Chassignet P,Sheikh S,Steyaert JM

    更新日期:2012-04-12 00:00:00

  • Waking the sleeping dragon: gene expression profiling reveals adaptive strategies of the hibernating reptile Pogona vitticeps.

    abstract:BACKGROUND:Hibernation is a physiological state exploited by many animals exposed to prolonged adverse environmental conditions associated with winter. Large changes in metabolism and cellular function occur, with many stress response pathways modulated to tolerate physiological challenges that might otherwise be letha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5750-x

    authors: Capraro A,O'Meally D,Waters SA,Patel HR,Georges A,Waters PD

    更新日期:2019-06-06 00:00:00

  • Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs).

    abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S8-S8

    authors: Handelman SK,Seweryn M,Smith RM,Hartmann K,Wang D,Pietrzak M,Johnson AD,Kloczkowski A,Sadee W

    更新日期:2015-01-01 00:00:00

  • Transcriptome profiling of developmental and xenobiotic responses in a keystone soil animal, the oligochaete annelid Lumbricus rubellus.

    abstract:BACKGROUND:Natural contamination and anthropogenic pollution of soils are likely to be major determinants of functioning and survival of keystone invertebrate taxa. Soil animals will have both evolutionary adaptation and genetically programmed responses to these toxic chemicals, but mechanistic understanding of such is...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-266

    authors: Owen J,Hedley BA,Svendsen C,Wren J,Jonker MJ,Hankard PK,Lister LJ,Stürzenbaum SR,Morgan AJ,Spurgeon DJ,Blaxter ML,Kille P

    更新日期:2008-06-03 00:00:00

  • Haplotype analysis of sucrose synthase gene family in three Saccharum species.

    abstract:BACKGROUND:Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-314

    authors: Zhang J,Arro J,Chen Y,Ming R

    更新日期:2013-05-10 00:00:00

  • Comparative analysis of fungal protein kinases and associated domains.

    abstract:BACKGROUND:Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-133

    authors: Kosti I,Mandel-Gutfreund Y,Glaser F,Horwitz BA

    更新日期:2010-02-24 00:00:00

  • Avoidance of recognition sites of restriction-modification systems is a widespread but not universal anti-restriction strategy of prokaryotic viruses.

    abstract:BACKGROUND:Restriction-modification (R-M) systems protect bacteria and archaea from attacks by bacteriophages and archaeal viruses. An R-M system specifically recognizes short sites in foreign DNA and cleaves it, while such sites in the host DNA are protected by methylation. Prokaryotic viruses have developed a number ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5324-3

    authors: Rusinov IS,Ershova AS,Karyagina AS,Spirin SA,Alexeevski AV

    更新日期:2018-12-07 00:00:00

  • Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearra...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-561

    authors: Jo YD,Choi Y,Kim DH,Kim BD,Kang BC

    更新日期:2014-07-04 00:00:00

  • Gene duplications in the E. coli genome: common themes among pathotypes.

    abstract:BACKGROUND:Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplications in this microorg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5683-4

    authors: Bernabeu M,Sánchez-Herrero JF,Huedo P,Prieto A,Hüttener M,Rozas J,Juárez A

    更新日期:2019-04-24 00:00:00

  • Time course of the response to ACTH in pig: biological and transcriptomic study.

    abstract:BACKGROUND:HPA axis plays a major role in physiological homeostasis. It is also involved in stress and adaptive response to the environment. In farm animals in general and specifically in pigs, breeding strategies have highly favored production traits such as lean growth rate, feed efficiency and prolificacy at the cos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2118-8

    authors: Sautron V,Terenina E,Gress L,Lippi Y,Billon Y,Larzul C,Liaubet L,Villa-Vialaneix N,Mormède P

    更新日期:2015-11-17 00:00:00

  • Tracing the genetic history of the 'Cañaris' from Ecuador and Peru using uniparental DNA markers.

    abstract:BACKGROUND:According to history, in the pre-Hispanic period, during the conquest and Inka expansion in Ecuador, many Andean families of the Cañar region would have been displaced to several places of Tawantinsuyu, including Kañaris, a Quechua-speaking community located at the highlands of the Province of Ferreñafe, Lam...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06834-1

    authors: Sandoval JR,Lacerda DR,Jota MMS,Robles-Ruiz P,Danos P,Paz-Y-Miño C,Wells S,Santos FR,Fujita R

    更新日期:2020-09-10 00:00:00