Replicate exome-sequencing in a multiple-generation family: improved interpretation of next-generation sequencing data.

Abstract:

BACKGROUND:Whole-exome sequencing (WES) is rapidly evolving into a tool of choice for rapid, and inexpensive identification of molecular genetic lesions within targeted regions of the human genome. While biases in WES coverage of nucleotides in targeted regions are recognized, it is not well understood how repetition of WES improves the interpretation of sequencing results in a clinical diagnostic setting. METHOD:To address this, we compared independently generated exome-capture of six individuals from three-generations sequenced in triplicate. This generated between 48x-86x mean target depth of high-quality mapped bases (>Q20) for each technical replicate library. Cumulatively, we achieved 179 - 208x average target coverage for each individual in the pedigree. Using this experimental design, we evaluated stochastics in WES interpretation, genotyping sensitivity, and accuracy to detect de novo variants. RESULTS:In this study, we show that repetition of WES improved the interpretation of the capture target regions after aggregating the data (93.5 - 93.9 %). Compared to 81.2 - 89.6 % (50.2-55.4 Mb of 61.7 M) coverage of targeted bases at ≥20x in the individual technical replicates, the aggregated data covered 93.5 - 93.9 % of targeted bases (57.7 - 58.0 of 61.7 M) at ≥20x threshold, suggesting a 4.3 - 12.7 % improvement in coverage. Each individual's aggregate dataset recovered 3.4 - 6.4 million bases within variable targeted regions. We uncovered technical variability (2-5 %) inherent to WES technique. We also show improved interpretation in assessing clinically important regions that lack interpretation under current conditions, affecting 12-16 of the 56 genes recommended for secondary analysis by American College of Medical Genetics (ACMG). We demonstrate that comparing technical replicate WES datasets and their derived aggregate data can effectively address overall WES genotyping discrepancies. CONCLUSION:We describe a method to evaluate the reproducibility and stochastics in exome library preparation, and delineate the advantages of aggregating the data derived from technical replicates. The implications of this study are directly applicable to improved experimental design and provide an opportunity to rapidly, efficiently, and accurately arrive at reliable candidate nucleotide variants.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Cherukuri PF,Maduro V,Fuentes-Fajardo KV,Lam K,NISC Comparative Sequencing Program.,Adams DR,Tifft CJ,Mullikin JC,Gahl WA,Boerkoel CF

doi

10.1186/s12864-015-2107-y

subject

Has Abstract

pub_date

2015-11-25 00:00:00

pages

998

issn

1471-2164

pii

10.1186/s12864-015-2107-y

journal_volume

16

pub_type

杂志文章
  • Genome sequence of an aflatoxigenic pathogen of Argentinian peanut, Aspergillus arachidicola.

    abstract:BACKGROUND:Aspergillus arachidicola is an aflatoxigenic fungal species, first isolated from the leaves of a wild peanut species native to Argentina. It has since been reported in maize, Brazil nut and human sputum samples. This aflatoxigenic species is capable of secreting both B and G aflatoxins, similar to A. parasit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4576-2

    authors: Moore GG,Mack BM,Beltz SB,Puel O

    更新日期:2018-03-09 00:00:00

  • Staphylococci phages display vast genomic diversity and evolutionary relationships.

    abstract:BACKGROUND:Bacteriophages are the most abundant and diverse entities in the biosphere, and this diversity is driven by constant predator-prey evolutionary dynamics and horizontal gene transfer. Phage genome sequences are under-sampled and therefore present an untapped and uncharacterized source of genetic diversity, ty...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5647-8

    authors: Oliveira H,Sampaio M,Melo LDR,Dias O,Pope WH,Hatfull GF,Azeredo J

    更新日期:2019-05-09 00:00:00

  • A genomic perspective on the potential of Actinobacillus succinogenes for industrial succinate production.

    abstract:BACKGROUND:Succinate is produced petrochemically from maleic anhydride to satisfy a small specialty chemical market. If succinate could be produced fermentatively at a price competitive with that of maleic anhydride, though, it could replace maleic anhydride as the precursor of many bulk chemicals, transforming a multi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-680

    authors: McKinlay JB,Laivenieks M,Schindler BD,McKinlay AA,Siddaramappa S,Challacombe JF,Lowry SR,Clum A,Lapidus AL,Burkhart KB,Harkins V,Vieille C

    更新日期:2010-11-30 00:00:00

  • Monophyly of clade III nematodes is not supported by phylogenetic analysis of complete mitochondrial genome sequences.

    abstract:BACKGROUND:The orders Ascaridida, Oxyurida, and Spirurida represent major components of zooparasitic nematode diversity, including many species of veterinary and medical importance. Phylum-wide nematode phylogenetic hypotheses have mainly been based on nuclear rDNA sequences, but more recently complete mitochondrial (m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-392

    authors: Park JK,Sultana T,Lee SH,Kang S,Kim HK,Min GS,Eom KS,Nadler SA

    更新日期:2011-08-03 00:00:00

  • Genome-wide analyses of Epstein-Barr virus reveal conserved RNA structures and a novel stable intronic sequence RNA.

    abstract:BACKGROUND:Epstein-Barr virus (EBV) is a human herpesvirus implicated in cancer and autoimmune disorders. Little is known concerning the roles of RNA structure in this important human pathogen. This study provides the first comprehensive genome-wide survey of RNA and RNA structure in EBV. RESULTS:Novel EBV RNAs and RN...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-543

    authors: Moss WN,Steitz JA

    更新日期:2013-08-09 00:00:00

  • Speckle reducing bilateral filter for cattle follicle segmentation.

    abstract:BACKGROUND:Ultrasound imaging technology has wide applications in cattle reproduction and has been used to monitor individual follicles and determine the patterns of follicular development. However, the speckles in ultrasound images affect the post-processing, such as follicle segmentation and finally affect the measur...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S2-S9

    authors: Tang J,Guo S,Sun Q,Deng Y,Zhou D

    更新日期:2010-11-02 00:00:00

  • Hyper-expansion of large DNA segments in the genome of kuruma shrimp, Marsupenaeus japonicus.

    abstract:BACKGROUND:Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Ma...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-141

    authors: Koyama T,Asakawa S,Katagiri T,Shimizu A,Fagutao FF,Mavichak R,Santos MD,Fuji K,Sakamoto T,Kitakado T,Kondo H,Shimizu N,Aoki T,Hirono I

    更新日期:2010-02-26 00:00:00

  • Analysis of salivary transcripts and antigens of the sand fly Phlebotomus arabicus.

    abstract:BACKGROUND:Sand fly saliva plays an important role in blood feeding and Leishmania transmission as it was shown to increase parasite virulence. On the other hand, immunity to salivary components impedes the establishment of infection. Therefore, it is most desirable to gain a deeper insight into the composition of sali...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-282

    authors: Hostomská J,Volfová V,Mu J,Garfield M,Rohousová I,Volf P,Valenzuela JG,Jochim RC

    更新日期:2009-06-25 00:00:00

  • Genomes of Helicobacter pylori from native Peruvians suggest admixture of ancestral and modern lineages and reveal a western type cag-pathogenicity island.

    abstract:BACKGROUND:Helicobacter pylori is presumed to be co-evolved with its human host and is a highly diverse gastric pathogen at genetic levels. Ancient origins of H. pylori in the New World are still debatable. It is not clear how different waves of human migrations in South America contributed to the evolution of strain d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-191

    authors: Devi SM,Ahmed I,Khan AA,Rahman SA,Alvi A,Sechi LA,Ahmed N

    更新日期:2006-07-27 00:00:00

  • Association of the matrix attachment region recognition signature with coding regions in Caenorhabditis elegans.

    abstract:BACKGROUND:Matrix attachment regions (MAR) are the sites on genomic DNA that interact with the nuclear matrix. There is increasing evidence for the involvement of MAR in regulation of gene expression. The unsuitability of experimental detection of MAR for genome-wide analyses has led to the development of computational...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-418

    authors: Anthony A,Blaxter M

    更新日期:2007-11-15 00:00:00

  • Generation and analysis of expression sequence tags from haustoria of the wheat stripe rust fungus Puccinia striiformis f. sp. Tritici.

    abstract:BACKGROUND:Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat (Triticum aestivum L.) worldwide. In spite of its agricultural importance, the genomics and genetics of the pathogen are poorly characterized. Pst transcripts from urediniospores and germinated ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-626

    authors: Yin C,Chen X,Wang X,Han Q,Kang Z,Hulbert SH

    更新日期:2009-12-23 00:00:00

  • Population structure of Apodemus flavicollis and comparison to Apodemus sylvaticus in northern Poland based on RAD-seq.

    abstract:BACKGROUND:Mice of the genus Apodemus are one the most common mammals in the Palaearctic region. Despite their broad range and long history of ecological observations, there are no whole-genome data available for Apodemus, hindering our ability to further exploit the genus in evolutionary and ecological genomics contex...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6603-3

    authors: Martin Cerezo ML,Kucka M,Zub K,Chan YF,Bryk J

    更新日期:2020-03-18 00:00:00

  • The genome of the emerging barley pathogen Ramularia collo-cygni.

    abstract:BACKGROUND:Ramularia collo-cygni is a newly important, foliar fungal pathogen of barley that causes the disease Ramularia leaf spot. The fungus exhibits a prolonged endophytic growth stage before switching life habit to become an aggressive, necrotrophic pathogen that causes significant losses to green leaf area and he...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2928-3

    authors: McGrann GR,Andongabo A,Sjökvist E,Trivedi U,Dussart F,Kaczmarek M,Mackenzie A,Fountaine JM,Taylor JM,Paterson LJ,Gorniak K,Burnett F,Kanyuka K,Hammond-Kosack KE,Rudd JJ,Blaxter M,Havis ND

    更新日期:2016-08-09 00:00:00

  • Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.

    abstract:BACKGROUND:Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-683

    authors: Ramos E,Levinson BT,Chasnoff S,Hughes A,Young AL,Thornton K,Li A,Vallania FL,Province M,Druley TE

    更新日期:2012-12-06 00:00:00

  • Computational discovery and RT-PCR validation of novel Burkholderia conserved and Burkholderia pseudomallei unique sRNAs.

    abstract:BACKGROUND:The sRNAs of bacterial pathogens are known to be involved in various cellular roles including environmental adaptation as well as regulation of virulence and pathogenicity. It is expected that sRNAs may also have similar functions for Burkholderia pseudomallei, a soil bacterium that can adapt to diverse envi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S13

    authors: Khoo JS,Chai SF,Mohamed R,Nathan S,Firdaus-Raih M

    更新日期:2012-01-01 00:00:00

  • Expanding dynamics of the virulence-related gene variations in the toxigenic Vibrio cholerae serogroup O1.

    abstract:BACKGROUND:Toxigenic Vibrio cholerae serogroup O1 is the causative pathogen in the sixth and seventh cholera pandemics. Cholera toxin is the major virulent factor but other virulence and virulence-related factors play certain roles in the pathogenesis and survival in the host. Along with the evolution of the epidemic s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5725-y

    authors: Li Z,Pang B,Wang D,Li J,Xu J,Fang Y,Lu X,Kan B

    更新日期:2019-05-09 00:00:00

  • An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications.

    abstract:BACKGROUND:Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S10

    authors: Menon R,Gasser RB,Mitreva M,Ranganathan S

    更新日期:2012-01-01 00:00:00

  • Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome.

    abstract:BACKGROUND:The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its members. Given that ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-91

    authors: Ambrosio AB,do Nascimento LC,Oliveira BV,Teixeira PJ,Tiburcio RA,Toledo Thomazella DP,Leme AF,Carazzolle MF,Vidal RO,Mieczkowski P,Meinhardt LW,Pereira GA,Cabrera OG

    更新日期:2013-02-11 00:00:00

  • Generation of a reference transcriptome for evaluating rainbow trout responses to various stressors.

    abstract:BACKGROUND:Fish under intensive culture conditions are exposed to a variety of acute and chronic stressors, including high rearing densities, sub-optimal water quality, and severe thermal fluctuations. Such stressors are inherent in aquaculture production and can induce physiological responses with adverse effects on t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-626

    authors: Sánchez CC,Weber GM,Gao G,Cleveland BM,Yao J,Rexroad CE 3rd

    更新日期:2011-12-21 00:00:00

  • Efficient calculation of exact probability distributions of integer features on RNA secondary structures.

    abstract:BACKGROUND:Although the needs for analyses of secondary structures of RNAs are increasing, prediction of the secondary structures of RNAs are not always reliable. Because an RNA may have a complicated energy landscape, comprehensive representations of the whole ensemble of the secondary structures, such as the probabil...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S10-S6

    authors: Mori R,Hamada M,Asai K

    更新日期:2014-01-01 00:00:00

  • Evolutionary origin of regulatory regions of retrogenes in Drosophila.

    abstract:BACKGROUND:Retrogenes are processed copies of other genes. This duplication mechanism produces a copy of the parental gene that should not contain introns, and usually does not contain cis-regulatory regions. Here, we computationally address the evolutionary origin of promoter and other cis-regulatory regions in retrog...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-241

    authors: Bai Y,Casola C,Betrán E

    更新日期:2008-05-22 00:00:00

  • Complete haplotype phasing of the MHC and KIR loci with targeted HaploSeq.

    abstract:BACKGROUND:The MHC and KIR loci are clinically relevant regions of the genome. Typing the sequence of these loci has a wide range of applications including organ transplantation, drug discovery, pharmacogenomics and furthering fundamental research in immune genetics. Rapid advances in biochemical and next-generation se...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1949-7

    authors: Selvaraj S,Schmitt AD,Dixon JR,Ren B

    更新日期:2015-11-05 00:00:00

  • Comparative transcriptomics provide insight into the morphogenesis and evolution of fistular leaves in Allium.

    abstract:BACKGROUND:Fistular leaves frequently appear in Allium species, and previous developmental studies have proposed that the process of fistular leaf formation involves programmed cell death. However, molecular evidence for the role of programmed cell death in the formation of fistular leaf cavities has yet to be reported...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3474-8

    authors: Zhu S,Tang S,Tan Z,Yu Y,Dai Q,Liu T

    更新日期:2017-01-10 00:00:00

  • The intestinal microbiome of fish under starvation.

    abstract:BACKGROUND:Starvation not only affects the nutritional and health status of the animals, but also the microbial composition in the host's intestine. Next-generation sequencing provides a unique opportunity to explore gut microbial communities and their interactions with hosts. However, studies on gut microbiomes have b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-266

    authors: Xia JH,Lin G,Fu GH,Wan ZY,Lee M,Wang L,Liu XJ,Yue GH

    更新日期:2014-04-05 00:00:00

  • Hsf and Hsp gene families in Populus: genome-wide identification, organization and correlated expression during development and in stress responses.

    abstract:BACKGROUND:Heat shock proteins (Hsps) are molecular chaperones that are involved in many normal cellular processes and stress responses, and heat shock factors (Hsfs) are the transcriptional activators of Hsps. Hsfs and Hsps are widely coordinated in various biological processes. Although the roles of Hsfs and Hsps in ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1398-3

    authors: Zhang J,Liu B,Li J,Zhang L,Wang Y,Zheng H,Lu M,Chen J

    更新日期:2015-03-14 00:00:00

  • Deep sequencing of the uterine immune response to bacteria during the equine oestrous cycle.

    abstract:BACKGROUND:The steroid hormone environment in healthy horses seems to have a significant impact on the efficiency of their uterine immune response. The objective of this study was to characterize the changes in gene expression in the equine endometrium in response to the introduction of bacterial pathogens and the infl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2139-3

    authors: Marth CD,Young ND,Glenton LY,Noden DM,Browning GF,Krekeler N

    更新日期:2015-11-14 00:00:00

  • HLA-VBSeq: accurate HLA typing at full resolution from whole-genome sequencing data.

    abstract:BACKGROUND:Human leucocyte antigen (HLA) genes play an important role in determining the outcome of organ transplantation and are linked to many human diseases. Because of the diversity and polymorphisms of HLA loci, HLA typing at high resolution is challenging even with whole-genome sequencing data. RESULTS:We have d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S2-S7

    authors: Nariai N,Kojima K,Saito S,Mimori T,Sato Y,Kawai Y,Yamaguchi-Kabata Y,Yasuda J,Nagasaki M

    更新日期:2015-01-01 00:00:00

  • Quantitative phosphoproteomic profiling of fiber differentiation and initiation in a fiberless mutant of cotton.

    abstract:BACKGROUND:The cotton (Gossypium spp.) fiber cell is an important unicellular model for studying cell differentiation. There is evidence suggesting that phosphorylation is a critical post-translational modification involved in regulation of a wide range of cell activities. Nevertheless, the sites of phosphorylation in ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-466

    authors: Ma Q,Wu M,Pei W,Li H,Li X,Zhang J,Yu J,Yu S

    更新日期:2014-06-12 00:00:00

  • A comprehensive evaluation of ensembl, RefSeq, and UCSC annotations in the context of RNA-seq read mapping and gene quantification.

    abstract:BACKGROUND:RNA-Seq has become increasingly popular in transcriptome profiling. One aspect of transcriptome research is to quantify the expression levels of genomic elements, such as genes, their transcripts and exons. Acquiring a transcriptome expression profile requires genomic elements to be defined in the context of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1308-8

    authors: Zhao S,Zhang B

    更新日期:2015-02-18 00:00:00

  • Skipper genome sheds light on unique phenotypic traits and phylogeny.

    abstract:BACKGROUND:Butterflies and moths are emerging as model organisms in genetics and evolutionary studies. The family Hesperiidae (skippers) was traditionally viewed as a sister to other butterflies based on its moth-like morphology and darting flight habits with fast wing beats. However, DNA studies suggest that the famil...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1846-0

    authors: Cong Q,Borek D,Otwinowski Z,Grishin NV

    更新日期:2015-08-27 00:00:00