Genotyping structural variants in pangenome graphs using the vg toolkit.

Abstract:

:Structural variants (SVs) remain challenging to represent and study relative to point mutations despite their demonstrated importance. We show that variation graphs, as implemented in the vg toolkit, provide an effective means for leveraging SV catalogs for short-read SV genotyping experiments. We benchmark vg against state-of-the-art SV genotypers using three sequence-resolved SV catalogs generated by recent long-read sequencing studies. In addition, we use assemblies from 12 yeast strains to show that graphs constructed directly from aligned de novo assemblies improve genotyping compared to graphs built from intermediate SV catalogs in the VCF format.

journal_name

Genome Biol

journal_title

Genome biology

authors

Hickey G,Heller D,Monlong J,Sibbesen JA,Sirén J,Eizenga J,Dawson ET,Garrison E,Novak AM,Paten B

doi

10.1186/s13059-020-1941-7

subject

Has Abstract

pub_date

2020-02-12 00:00:00

pages

35

issue

1

eissn

1474-7596

issn

1474-760X

pii

10.1186/s13059-020-1941-7

journal_volume

21

pub_type

杂志文章
  • Whole-genome resequencing of Escherichia coli K-12 MG1655 undergoing short-term laboratory evolution in lactate minimal media reveals flexible selection of adaptive mutations.

    abstract:BACKGROUND:Short-term laboratory evolution of bacteria followed by genomic sequencing provides insight into the mechanism of adaptive evolution, such as the number of mutations needed for adaptation, genotype-phenotype relationships, and the reproducibility of adaptive outcomes. RESULTS:In the present study, we descri...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-10-r118

    authors: Conrad TM,Joyce AR,Applebee MK,Barrett CL,Xie B,Gao Y,Palsson BØ

    更新日期:2009-01-01 00:00:00

  • An integrated computational pipeline and database to support whole-genome sequence annotation.

    abstract::We describe here our experience in annotating the Drosophila melanogaster genome sequence, in the course of which we developed several new open-source software tools and a database schema to support large-scale genome annotation. We have developed these into an integrated and reusable software system for whole-genome ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2002-3-12-research0081

    authors: Mungall CJ,Misra S,Berman BP,Carlson J,Frise E,Harris N,Marshall B,Shu S,Kaminker JS,Prochnik SE,Smith CD,Smith E,Tupy JL,Wiel C,Rubin GM,Lewis SE

    更新日期:2002-01-01 00:00:00

  • Protein recoding by ADAR1-mediated RNA editing is not essential for normal development and homeostasis.

    abstract:BACKGROUND:Adenosine-to-inosine (A-to-I) editing of dsRNA by ADAR proteins is a pervasive epitranscriptome feature. Tens of thousands of A-to-I editing events are defined in the mouse, yet the functional impact of most is unknown. Editing causing protein recoding is the essential function of ADAR2, but an essential rol...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1301-4

    authors: Heraud-Farlow JE,Chalk AM,Linder SE,Li Q,Taylor S,White JM,Pang L,Liddicoat BJ,Gupte A,Li JB,Walkley CR

    更新日期:2017-09-05 00:00:00

  • Anticipating the 1,000 dollar genome.

    abstract::A new generation of DNA-sequencing platforms will become commercially available over the next few years. These instruments will enable re-sequencing of human genomes at a previously unimagined throughput and low cost. Here, I examine why the 1,000 dollar human genome is an important goal for research and clinical diag...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2006-7-7-112

    authors: Mardis ER

    更新日期:2006-01-01 00:00:00

  • Evidence from comparative genomics for a complete sexual cycle in the 'asexual' pathogenic yeast Candida glabrata.

    abstract:BACKGROUND:Candida glabrata is a pathogenic yeast of increasing medical concern. It has been regarded as asexual since it was first described in 1917, yet phylogenetic analyses have revealed that it is more closely related to sexual yeasts than other Candida species. We show here that the C. glabrata genome contains ma...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2003-4-2-r10

    authors: Wong S,Fares MA,Zimmermann W,Butler G,Wolfe KH

    更新日期:2003-01-01 00:00:00

  • Evolution enters the genomic era.

    abstract::A report on the 18th Congress of the European Society for Evolutionary Biology (ESEB), Aarhus, Denmark, 20-25 August, 2001. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2001-2-11-reports4026

    authors: Liberles DA

    更新日期:2001-01-01 00:00:00

  • Plant immunity from A to Z.

    abstract::A report of The Keystone Symposium on Plant Innate Immunity, Keystone, USA, 10-15 February 2008. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2008-9-4-304

    authors: Robatzek S,Saijo Y

    更新日期:2008-01-01 00:00:00

  • Chromatin Central: towards the comparative proteome by accurate mapping of the yeast proteomic environment.

    abstract:BACKGROUND:Understanding the design logic of living systems requires the understanding and comparison of proteomes. Proteomes define the commonalities between organisms more precisely than genomic sequences. Because uncertainties remain regarding the accuracy of proteomic data, several issues need to be resolved before...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-11-r167

    authors: Shevchenko A,Roguev A,Schaft D,Buchanan L,Habermann B,Sakalar C,Thomas H,Krogan NJ,Shevchenko A,Stewart AF

    更新日期:2008-01-01 00:00:00

  • A new recruit for the army of the men of death.

    abstract::The army of the men of death, in John Bunyan's memorable phrase, has a new recruit, and fear has a new face: a face wearing a surgical mask. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2003-4-7-113

    authors: Petsko GA

    更新日期:2003-01-01 00:00:00

  • Domain atrophy creates rare cases of functional partial protein domains.

    abstract:BACKGROUND:Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0655-8

    authors: Prakash A,Bateman A

    更新日期:2015-04-30 00:00:00

  • A novel and universal method for microRNA RT-qPCR data normalization.

    abstract::Gene expression analysis of microRNA molecules is becoming increasingly important. In this study we assess the use of the mean expression value of all expressed microRNAs in a given sample as a normalization factor for microRNA real-time quantitative PCR data and compare its performance to the currently adopted approa...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-6-r64

    authors: Mestdagh P,Van Vlierberghe P,De Weer A,Muth D,Westermann F,Speleman F,Vandesompele J

    更新日期:2009-01-01 00:00:00

  • A comparison of programmed cell death between species.

    abstract::Key components of the programmed cell death pathway are conserved between Caenorhabditis elegans, Drosophila melanogaster and humans. The search for additional homologs has been facilitated by the availability of the entire genomic sequence for each of these organisms. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2000-1-3-reviews0003

    authors: Tittel JN,Steller H

    更新日期:2000-01-01 00:00:00

  • MicroRNAs and their isomiRs function cooperatively to target common biological pathways.

    abstract:BACKGROUND:Variants of microRNAs (miRNAs), called isomiRs, are commonly reported in deep-sequencing studies; however, the functional significance of these variants remains controversial. Observational studies show that isomiR patterns are non-random, hinting that these molecules could be regulated and therefore functio...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-12-r126

    authors: Cloonan N,Wani S,Xu Q,Gu J,Lea K,Heater S,Barbacioru C,Steptoe AL,Martin HC,Nourbakhsh E,Krishnan K,Gardiner B,Wang X,Nones K,Steen JA,Matigian NA,Wood DL,Kassahn KS,Waddell N,Shepherd J,Lee C,Ichikawa J,McKer

    更新日期:2011-12-30 00:00:00

  • A network perspective on the evolution of metabolism by gene duplication.

    abstract:BACKGROUND:Gene duplication followed by divergence is one of the main sources of metabolic versatility. The patchwork and stepwise models of metabolic evolution help us to understand these processes, but their assumptions are relatively simplistic. We used a network-based approach to determine the influence of metaboli...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-2-r26

    authors: Díaz-Mejía JJ,Pérez-Rueda E,Segovia L

    更新日期:2007-01-01 00:00:00

  • Dating branches on the tree of life using DNA.

    abstract::The use of DNA sequences to estimate the timing of evolutionary events is increasingly popular, although it is fraught with practical difficulties. But the exponential growth of relevant information and improved methods of analysis are providing increasingly reliable sequence-derived dates, and it may become possible ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-3-1-reviews0001

    authors: Wray GA

    更新日期:2002-01-01 00:00:00

  • The draft genome of the C3 panicoid grass species Dichanthelium oligosanthes.

    abstract:BACKGROUND:Comparisons between C3 and C4 grasses often utilize C3 species from the subfamilies Ehrhartoideae or Pooideae and C4 species from the subfamily Panicoideae, two clades that diverged over 50 million years ago. The divergence of the C3 panicoid grass Dichanthelium oligosanthes from the independent C4 lineages ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-016-1080-3

    authors: Studer AJ,Schnable JC,Weissmann S,Kolbe AR,McKain MR,Shao Y,Cousins AB,Kellogg EA,Brutnell TP

    更新日期:2016-10-28 00:00:00

  • Molecular archeology of L1 insertions in the human genome.

    abstract:BACKGROUND:As the rough draft of the human genome sequence nears a finished product and other genome-sequencing projects accumulate sequence data exponentially, bioinformatics is emerging as an important tool for studies of transposon biology. In particular, L1 elements exhibit a variety of sequence structures after in...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-10-research0052

    authors: Szak ST,Pickeral OK,Makalowski W,Boguski MS,Landsman D,Boeke JD

    更新日期:2002-09-19 00:00:00

  • Modeling double strand break susceptibility to interrogate structural variation in cancer.

    abstract:BACKGROUND:Structural variants (SVs) are known to play important roles in a variety of cancers, but their origins and functional consequences are still poorly understood. Many SVs are thought to emerge from errors in the repair processes following DNA double strand breaks (DSBs). RESULTS:We used experimentally quantif...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1635-1

    authors: Ballinger TJ,Bouwman BAM,Mirzazadeh R,Garnerone S,Crosetto N,Semple CA

    更新日期:2019-02-08 00:00:00

  • The first aurochs genome reveals the breeding history of British and European cattle.

    abstract::The first genome sequence of the extinct European wild aurochs reveals the genetic foundation of native British and Irish landraces of cattle.See related Research article: www.dx.doi.org/10.1186/s13059-015-0790-2. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-015-0793-z

    authors: Orlando L

    更新日期:2015-10-26 00:00:00

  • Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets using protocol-specific bias modeling.

    abstract:BACKGROUND:DNase-seq and ATAC-seq are broadly used methods to assay open chromatin regions genome-wide. The single nucleotide resolution of DNase-seq has been further exploited to infer transcription factor binding sites (TFBSs) in regulatory regions through footprinting. Recent studies have demonstrated the sequence b...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1654-y

    authors: Karabacak Calviello A,Hirsekorn A,Wurmus R,Yusuf D,Ohler U

    更新日期:2019-02-21 00:00:00

  • Reduced intrinsic DNA curvature leads to increased mutation rate.

    abstract:BACKGROUND:Mutation rates vary across the genome. Many trans factors that influence mutation rates have been identified, as have specific sequence motifs at the 1-7-bp scale, but cis elements remain poorly characterized. The lack of understanding regarding why different sequences have different mutation rates hampers o...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1525-y

    authors: Duan C,Huan Q,Chen X,Wu S,Carey LB,He X,Qian W

    更新日期:2018-09-14 00:00:00

  • Functions, structure, and read-through alternative splicing of feline APOBEC3 genes.

    abstract:BACKGROUND:Over the past years a variety of host restriction genes have been identified in human and mammals that modulate retrovirus infectivity, replication, assembly, and/or cross-species transmission. Among these host-encoded restriction factors, the APOBEC3 (A3; apolipoprotein B mRNA-editing catalytic polypeptide ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-3-r48

    authors: Münk C,Beck T,Zielonka J,Hotz-Wagenblatt A,Chareza S,Battenberg M,Thielebein J,Cichutek K,Bravo IG,O'Brien SJ,Löchelt M,Yuhki N

    更新日期:2008-01-01 00:00:00

  • Survey of human mitochondrial diseases using new genomic/proteomic tools.

    abstract:BACKGROUND:We have constructed Bayesian prior-based, amino-acid sequence profiles for the complete yeast mitochondrial proteome and used them to develop methods for identifying and characterizing the context of protein mutations that give rise to human mitochondrial diseases. (Bayesian priors are conditional probabilit...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2001-2-6-research0021

    authors: Plasterer TN,Smith TF,Mohr SC

    更新日期:2001-01-01 00:00:00

  • A showcase of future plant biology: moving towards next-generation plant genetics assisted by genome sequencing and systems biology.

    abstract::A report on the Cold Spring Harbor Asia conference on Genome Assisted Biology of Crops and Model Plant Systems Meeting, held in Suzhou, China, April 21-25, 2014. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb4176

    authors: Lee I

    更新日期:2014-05-23 00:00:00

  • Exploiting microarrays to reveal differential gene expression in the nervous system.

    abstract::Microarrays have been used in a wide variety of experimental systems, but realizing their full potential is contingent on sophisticated and rigorous experimental design and data analysis. This article highlights what is needed to get the most out of microarrays in terms of accurately and effectively revealing differen...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2003-4-2-105

    authors: Griffin RS,Mills CD,Costigan M,Woolf CJ

    更新日期:2003-01-01 00:00:00

  • Going beyond genetics to discover cancer targets.

    abstract::Two recent studies demonstrate the power of integrating tumor genotype information with epigenetic and proteomic studies to discover potential therapeutic targets in breast cancer. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1238-7

    authors: Sandoval GJ,Hahn WC

    更新日期:2017-05-22 00:00:00

  • Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains.

    abstract:BACKGROUND:CTCF binding contributes to the establishment of a higher-order genome structure by demarcating the boundaries of large-scale topologically associating domains (TADs). However, despite the importance and conservation of TADs, the role of CTCF binding in their evolution and stability remains elusive. RESULTS...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1894-x

    authors: Kentepozidou E,Aitken SJ,Feig C,Stefflova K,Ibarra-Soria X,Odom DT,Roller M,Flicek P

    更新日期:2020-01-07 00:00:00

  • Transposable elements in the mammalian embryo: pioneers surviving through stealth and service.

    abstract::Transposable elements (TEs) are notable drivers of genetic innovation. Over evolutionary time, TE insertions can supply new promoter, enhancer, and insulator elements to protein-coding genes and establish novel, species-specific gene regulatory networks. Conversely, ongoing TE-driven insertional mutagenesis, nonhomolo...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/s13059-016-0965-5

    authors: Gerdes P,Richardson SR,Mager DL,Faulkner GJ

    更新日期:2016-05-09 00:00:00

  • Transcriptional profiling of long non-coding RNAs and novel transcribed regions across a diverse panel of archived human cancers.

    abstract:BACKGROUND:Molecular characterization of tumors has been critical for identifying important genes in cancer biology and for improving tumor classification and diagnosis. Long non-coding RNAs, as a new, relatively unstudied class of transcripts, provide a rich opportunity to identify both functional drivers and cancer-t...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2012-13-8-r75

    authors: Brunner AL,Beck AH,Edris B,Sweeney RT,Zhu SX,Li R,Montgomery K,Varma S,Gilks T,Guo X,Foley JW,Witten DM,Giacomini CP,Flynn RA,Pollack JR,Tibshirani R,Chang HY,van de Rijn M,West RB

    更新日期:2012-08-28 00:00:00

  • The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data.

    abstract::We have developed an ontology to provide standardized nomenclature for anatomical terms in the postnatal mouse. The Adult Mouse Anatomical Dictionary is structured as a directed acyclic graph, and is organized hierarchically both spatially and functionally. The ontology will be used to annotate and integrate different...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-3-r29

    authors: Hayamizu TF,Mangan M,Corradi JP,Kadin JA,Ringwald M

    更新日期:2005-01-01 00:00:00