Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.

Abstract:

BACKGROUND:Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. RESULTS:We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. CONCLUSIONS:This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Ramos E,Levinson BT,Chasnoff S,Hughes A,Young AL,Thornton K,Li A,Vallania FL,Province M,Druley TE

doi

10.1186/1471-2164-13-683

subject

Has Abstract

pub_date

2012-12-06 00:00:00

pages

683

issn

1471-2164

pii

1471-2164-13-683

journal_volume

13

pub_type

杂志文章
  • High-throughput cis-regulatory element discovery in the vector mosquito Aedes aegypti.

    abstract:BACKGROUND:Despite substantial progress in mosquito genomic and genetic research, few cis-regulatory elements (CREs), DNA sequences that control gene expression, have been identified in mosquitoes or other non-model insects. Formaldehyde-assisted isolation of regulatory elements paired with DNA sequencing, FAIRE-seq, i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2468-x

    authors: Behura SK,Sarro J,Li P,Mysore K,Severson DW,Emrich SJ,Duman-Scheel M

    更新日期:2016-05-10 00:00:00

  • Genome-wide analysis of consistently RNA edited sites in human blood reveals interactions with mRNA processing genes and suggests correlations with cell types and biological variables.

    abstract:BACKGROUND:A-to-I RNA editing is a co-/post-transcriptional modification catalyzed by ADAR enzymes, that deaminates Adenosines (A) into Inosines (I). Most of known editing events are located within inverted ALU repeats, but they also occur in coding sequences and may alter the function of encoded proteins. RNA editing ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5364-8

    authors: Giacopuzzi E,Gennarelli M,Sacco C,Filippini A,Mingardi J,Magri C,Barbon A

    更新日期:2018-12-27 00:00:00

  • Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

    abstract:BACKGROUND:Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RESULTS:RNA-...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4513-4

    authors: Kebede AZ,Johnston A,Schneiderman D,Bosnich W,Harris LJ

    更新日期:2018-02-09 00:00:00

  • Development of the first marmoset-specific DNA microarray (EUMAMA): a new genetic tool for large-scale expression profiling in a non-human primate.

    abstract:BACKGROUND:The common marmoset monkey (Callithrix jacchus), a small non-endangered New World primate native to eastern Brazil, is becoming increasingly used as a non-human primate model in biomedical research, drug development and safety assessment. In contrast to the growing interest for the marmoset as an animal mode...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-190

    authors: Datson NA,Morsink MC,Atanasova S,Armstrong VW,Zischler H,Schlumbohm C,Dutilh BE,Huynen MA,Waegele B,Ruepp A,de Kloet ER,Fuchs E

    更新日期:2007-06-25 00:00:00

  • Cloning and characterization of the mouse Mcoln1 gene reveals an alternatively spliced transcript not seen in humans.

    abstract:BACKGROUND:Mucolipidosis type IV (MLIV) is an autosomal recessive lysosomal storage disorder characterized by severe neurologic and ophthalmologic abnormalities. Recently the MLIV gene, MCOLN1, has been identified as a new member of the transient receptor potential (TRP) cation channel superfamily. Here we report the c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-3-3

    authors: Falardeau JL,Kennedy JC,Acierno JS Jr,Sun M,Stahl S,Goldin E,Slaugenhaupt SA

    更新日期:2002-01-01 00:00:00

  • Quantitative analysis of chromatin interaction changes upon a 4.3 Mb deletion at mouse 4E2.

    abstract:BACKGROUND:Circular chromosome conformation capture (4C) has provided important insights into three dimensional (3D) genome organization and its critical impact on the regulation of gene expression. We developed a new quantitative framework based on polymer physics for the analysis of paired-end sequencing 4C (PE-4Cseq...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2137-5

    authors: Zepeda-Mendoza CJ,Mukhopadhyay S,Wong ES,Harder N,Splinter E,de Wit E,Eckersley-Maslin MA,Ried T,Eils R,Rohr K,Mills A,de Laat W,Flicek P,Sengupta AM,Spector DL

    更新日期:2015-11-21 00:00:00

  • Unlocking the mystery of the hard-to-sequence phage genome: PaP1 methylome and bacterial immunity.

    abstract:BACKGROUND:Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Sho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-803

    authors: Lu S,Le S,Tan Y,Li M,Liu C,Zhang K,Huang J,Chen H,Rao X,Zhu J,Zou L,Ni Q,Li S,Wang J,Jin X,Hu Q,Yao X,Zhao X,Zhang L,Huang G,Hu F

    更新日期:2014-09-19 00:00:00

  • LtrDetector: A tool-suite for detecting long terminal repeat retrotransposons de-novo.

    abstract:BACKGROUND:Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating l...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5796-9

    authors: Valencia JD,Girgis HZ

    更新日期:2019-06-03 00:00:00

  • Annotation and expression of carboxylesterases in the silkworm, Bombyx mori.

    abstract:BACKGROUND:Carboxylesterase is a multifunctional superfamily and ubiquitous in all living organisms, including animals, plants, insects, and microbes. It plays important roles in xenobiotic detoxification, and pheromone degradation, neurogenesis and regulating development. Previous studies mainly used Dipteran Drosophi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-553

    authors: Yu QY,Lu C,Li WL,Xiang ZH,Zhang Z

    更新日期:2009-11-24 00:00:00

  • TSCC: Two-Stage Combinatorial Clustering for virtual screening using protein-ligand interactions and physicochemical features.

    abstract:BACKGROUND:The increasing numbers of 3D compounds and protein complexes stored in databases contribute greatly to current advances in biotechnology, being employed in several pharmaceutical and industrial applications. However, screening and retrieving appropriate candidates as well as handling false positives presents...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S4-S26

    authors: Clinciu DL,Chen YF,Ko CN,Lo CC,Yang JM

    更新日期:2010-12-02 00:00:00

  • L-lactate induces specific genome wide alterations of gene expression in cultured bovine granulosa cells.

    abstract:BACKGROUND:Previously, we could show that L-lactate affects cultured bovine granulosa cells (GC) in a specific manner driving the cells into an early pre-ovulatory phenotype. Here we studied genome wide effects in L-lactate-treated GC to further elucidate the underlying mechanisms that are responsible for the L-lactate...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5657-6

    authors: Baufeld A,Koczan D,Vanselow J

    更新日期:2019-04-05 00:00:00

  • Transcriptional insights into key genes and pathways controlling muscle lipid metabolism in broiler chickens.

    abstract:BACKGROUND:Intramuscular fat (IMF) is one of the most important factors positively associated with meat quality. Triglycerides (TGs), as the main component of IMF, play an essential role in muscle lipid metabolism. This transcriptome analysis of pectoralis muscle tissue aimed to identify functional genes and biological...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6221-0

    authors: Liu L,Liu X,Cui H,Liu R,Zhao G,Wen J

    更新日期:2019-11-15 00:00:00

  • Quantitative proteomic analysis of host--pathogen interactions: a study of Acinetobacter baumannii responses to host airways.

    abstract:BACKGROUND:Acinetobacter baumannii is a major health problem. The most common infection caused by A. baumannii is hospital acquired pneumonia, and the associated mortality rate is approximately 50%. Neither in vivo nor ex vivo expression profiling has been performed at the proteomic or transcriptomic level for pneumoni...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1608-z

    authors: Méndez JA,Mateos J,Beceiro A,Lopez M,Tomás M,Poza M,Bou G

    更新日期:2015-05-30 00:00:00

  • Large scale genome-wide association and LDLA mapping study identifies QTLs for boar taint and related sex steroids.

    abstract:BACKGROUND:Boar taint is observed in a high proportion of uncastrated male pigs and is characterized by an unpleasant odor/flavor in cooked meat, primarily caused by elevated levels of androstenone and skatole. Androstenone is a steroid produced in the testis in parallel with biosynthesis of other sex steroids like tes...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-362

    authors: Grindflek E,Lien S,Hamland H,Hansen MH,Kent M,van Son M,Meuwissen TH

    更新日期:2011-07-13 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • Haplotype analysis of sucrose synthase gene family in three Saccharum species.

    abstract:BACKGROUND:Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-314

    authors: Zhang J,Arro J,Chen Y,Ming R

    更新日期:2013-05-10 00:00:00

  • Integrative phenotyping framework (iPF): integrative clustering of multiple omics data identifies novel lung disease subphenotypes.

    abstract:BACKGROUND:The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical. RESULTS:In ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2170-4

    authors: Kim S,Herazo-Maya JD,Kang DD,Juan-Guardela BM,Tedrow J,Martinez FJ,Sciurba FC,Tseng GC,Kaminski N

    更新日期:2015-11-11 00:00:00

  • Linkage disequilibrium and genome-wide association analysis for anthocyanin pigmentation and fruit color in eggplant.

    abstract:BACKGROUND:The genome-wide association (GWA) approach represents an alternative to biparental linkage mapping for determining the genetic basis of trait variation. Both approaches rely on recombination to re-arrange the genome, and seek to establish correlations between phenotype and genotype. The major advantages of G...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-896

    authors: Cericola F,Portis E,Lanteri S,Toppino L,Barchi L,Acciarri N,Pulcini L,Sala T,Rotino GL

    更新日期:2014-10-14 00:00:00

  • Whole genome resequencing of the Iranian native dogs and wolves to unravel variome during dog domestication.

    abstract:BACKGROUND:Advances in genome technology have simplified a new comprehension of the genetic and historical processes crucial to rapid phenotypic evolution under domestication. To get new insight into the genetic basis of the dog domestication process, we conducted whole-genome sequence analysis of three wolves and thre...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6619-8

    authors: Amiri Ghanatsaman Z,Wang GD,Asadollahpour Nanaei H,Asadi Fozi M,Peng MS,Esmailizadeh A,Zhang YP

    更新日期:2020-03-04 00:00:00

  • Genetic variation and population structure of maize inbred lines adapted to the mid-altitude sub-humid maize agro-ecology of Ethiopia using single nucleotide polymorphic (SNP) markers.

    abstract:BACKGROUND:Molecular characterization is important for efficient utilization of germplasm and development of improved varieties. In the present study, we investigated the genetic purity, relatedness and population structure of 265 maize inbred lines from the Ethiopian Institute of Agricultural Research (EIAR), the Inte...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4173-9

    authors: Ertiro BT,Semagn K,Das B,Olsen M,Labuschagne M,Worku M,Wegary D,Azmach G,Ogugo V,Keno T,Abebe B,Chibsa T,Menkir A

    更新日期:2017-10-12 00:00:00

  • Genes optimized by evolution for accurate and fast translation encode in Archaea and Bacteria a broad and characteristic spectrum of protein functions.

    abstract:BACKGROUND:In many microbial genomes, a strong preference for a small number of codons can be observed in genes whose products are needed by the cell in large quantities. This codon usage bias (CUB) improves translational accuracy and speed and is one of several factors optimizing cell growth. Whereas CUB and the overr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-617

    authors: von Mandach C,Merkl R

    更新日期:2010-11-04 00:00:00

  • Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and other fungi.

    abstract:BACKGROUND:Helitrons are class-II eukaryotic transposons that transpose via a rolling circle mechanism. Due to their ability to capture and mobilize gene fragments, they play an important role in the evolution of their host genomes. We have used a bioinformatics approach for the identification of helitrons in two Pleur...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1071

    authors: Castanera R,Pérez G,López L,Sancho R,Santoyo F,Alfaro M,Gabaldón T,Pisabarro AG,Oguiza JA,Ramírez L

    更新日期:2014-12-05 00:00:00

  • Transferring knowledge of bacterial protein interaction networks to predict pathogen targeted human genes and immune signaling pathways: a case study on M. tuberculosis.

    abstract:BACKGROUND:Bacterial invasive infection and host immune response is fundamental to the understanding of pathogen pathogenesis and the discovery of effective therapeutic drugs. However, there are very few experimental studies on the signaling cross-talks between bacteria and human host to date. METHODS:In this work, ta...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4873-9

    authors: Mei S,Flemington EK,Zhang K

    更新日期:2018-06-28 00:00:00

  • Optimization of cDNA microarrays procedures using criteria that do not rely on external standards.

    abstract:BACKGROUND:The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray pr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-377

    authors: Bruland T,Anderssen E,Doseth B,Bergum H,Beisvag V,Laegreid A

    更新日期:2007-10-18 00:00:00

  • Strand bias structure in mouse DNA gives a glimpse of how chromatin structure affects gene expression.

    abstract:BACKGROUND:On a single strand of genomic DNA the number of As is usually about equal to the number of Ts (and similarly for Gs and Cs), but deviations have been noted for transcribed regions and origins of replication. RESULTS:The mouse genome is shown to have a segmented structure defined by strand bias. Transcriptio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-16

    authors: Evans KJ

    更新日期:2008-01-14 00:00:00

  • Response of the mosquito protein interaction network to dengue infection.

    abstract:BACKGROUND:Two fifths of the world's population is at risk from dengue. The absence of effective drugs and vaccines leaves vector control as the primary intervention tool. Understanding dengue virus (DENV) host interactions is essential for the development of novel control strategies. The availability of genome sequenc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-380

    authors: Guo X,Xu Y,Bian G,Pike AD,Xie Y,Xi Z

    更新日期:2010-06-16 00:00:00

  • Comparative analysis of Mycobacterium and related Actinomycetes yields insight into the evolution of Mycobacterium tuberculosis pathogenesis.

    abstract:BACKGROUND:The sequence of the pathogen Mycobacterium tuberculosis (Mtb) strain H37Rv has been available for over a decade, but the biology of the pathogen remains poorly understood. Genome sequences from other Mtb strains and closely related bacteria present an opportunity to apply the power of comparative genomics to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-120

    authors: McGuire AM,Weiner B,Park ST,Wapinski I,Raman S,Dolganov G,Peterson M,Riley R,Zucker J,Abeel T,White J,Sisk P,Stolte C,Koehrsen M,Yamamoto RT,Iacobelli-Martinez M,Kidd MJ,Maer AM,Schoolnik GK,Regev A,Galagan J

    更新日期:2012-03-28 00:00:00

  • Comparative analysis of fungal protein kinases and associated domains.

    abstract:BACKGROUND:Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-133

    authors: Kosti I,Mandel-Gutfreund Y,Glaser F,Horwitz BA

    更新日期:2010-02-24 00:00:00

  • Identification of prohormones and pituitary neuropeptides in the African cichlid, Astatotilapia burtoni.

    abstract:BACKGROUND:Cichlid fishes have evolved remarkably diverse reproductive, social, and feeding behaviors. Cell-to-cell signaling molecules, notably neuropeptides and peptide hormones, are known to regulate these behaviors across vertebrates. This class of signaling molecules derives from prohormone genes that have undergo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2914-9

    authors: Hu CK,Southey BR,Romanova EV,Maruska KP,Sweedler JV,Fernald RD

    更新日期:2016-08-19 00:00:00

  • Production and utilization of a high-density oligonucleotide microarray in channel catfish, Ictalurus punctatus.

    abstract:BACKGROUND:Functional analysis of the catfish genome will be useful for the identification of genes controlling traits of economic importance, especially innate disease resistance. However, this species lacks a platform for global gene expression profiling, so we designed a first generation high-density oligonucleotide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-134

    authors: Li RW,Waldbieser GC

    更新日期:2006-06-01 00:00:00