Abstract:
BACKGROUND:Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. RESULTS:We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. CONCLUSIONS:This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Ramos E,Levinson BT,Chasnoff S,Hughes A,Young AL,Thornton K,Li A,Vallania FL,Province M,Druley TEdoi
10.1186/1471-2164-13-683subject
Has Abstractpub_date
2012-12-06 00:00:00pages
683issn
1471-2164pii
1471-2164-13-683journal_volume
13pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Despite substantial progress in mosquito genomic and genetic research, few cis-regulatory elements (CREs), DNA sequences that control gene expression, have been identified in mosquitoes or other non-model insects. Formaldehyde-assisted isolation of regulatory elements paired with DNA sequencing, FAIRE-seq, i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2468-x
更新日期:2016-05-10 00:00:00
abstract:BACKGROUND:A-to-I RNA editing is a co-/post-transcriptional modification catalyzed by ADAR enzymes, that deaminates Adenosines (A) into Inosines (I). Most of known editing events are located within inverted ALU repeats, but they also occur in coding sequences and may alter the function of encoded proteins. RNA editing ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5364-8
更新日期:2018-12-27 00:00:00
abstract:BACKGROUND:Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RESULTS:RNA-...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4513-4
更新日期:2018-02-09 00:00:00
abstract:BACKGROUND:The common marmoset monkey (Callithrix jacchus), a small non-endangered New World primate native to eastern Brazil, is becoming increasingly used as a non-human primate model in biomedical research, drug development and safety assessment. In contrast to the growing interest for the marmoset as an animal mode...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-190
更新日期:2007-06-25 00:00:00
abstract:BACKGROUND:Mucolipidosis type IV (MLIV) is an autosomal recessive lysosomal storage disorder characterized by severe neurologic and ophthalmologic abnormalities. Recently the MLIV gene, MCOLN1, has been identified as a new member of the transient receptor potential (TRP) cation channel superfamily. Here we report the c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-3-3
更新日期:2002-01-01 00:00:00
abstract:BACKGROUND:Circular chromosome conformation capture (4C) has provided important insights into three dimensional (3D) genome organization and its critical impact on the regulation of gene expression. We developed a new quantitative framework based on polymer physics for the analysis of paired-end sequencing 4C (PE-4Cseq...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2137-5
更新日期:2015-11-21 00:00:00
abstract:BACKGROUND:Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Sho...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-803
更新日期:2014-09-19 00:00:00
abstract:BACKGROUND:Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating l...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5796-9
更新日期:2019-06-03 00:00:00
abstract:BACKGROUND:Carboxylesterase is a multifunctional superfamily and ubiquitous in all living organisms, including animals, plants, insects, and microbes. It plays important roles in xenobiotic detoxification, and pheromone degradation, neurogenesis and regulating development. Previous studies mainly used Dipteran Drosophi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-553
更新日期:2009-11-24 00:00:00
abstract:BACKGROUND:The increasing numbers of 3D compounds and protein complexes stored in databases contribute greatly to current advances in biotechnology, being employed in several pharmaceutical and industrial applications. However, screening and retrieving appropriate candidates as well as handling false positives presents...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-S4-S26
更新日期:2010-12-02 00:00:00
abstract:BACKGROUND:Previously, we could show that L-lactate affects cultured bovine granulosa cells (GC) in a specific manner driving the cells into an early pre-ovulatory phenotype. Here we studied genome wide effects in L-lactate-treated GC to further elucidate the underlying mechanisms that are responsible for the L-lactate...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5657-6
更新日期:2019-04-05 00:00:00
abstract:BACKGROUND:Intramuscular fat (IMF) is one of the most important factors positively associated with meat quality. Triglycerides (TGs), as the main component of IMF, play an essential role in muscle lipid metabolism. This transcriptome analysis of pectoralis muscle tissue aimed to identify functional genes and biological...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6221-0
更新日期:2019-11-15 00:00:00
abstract:BACKGROUND:Acinetobacter baumannii is a major health problem. The most common infection caused by A. baumannii is hospital acquired pneumonia, and the associated mortality rate is approximately 50%. Neither in vivo nor ex vivo expression profiling has been performed at the proteomic or transcriptomic level for pneumoni...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1608-z
更新日期:2015-05-30 00:00:00
abstract:BACKGROUND:Boar taint is observed in a high proportion of uncastrated male pigs and is characterized by an unpleasant odor/flavor in cooked meat, primarily caused by elevated levels of androstenone and skatole. Androstenone is a steroid produced in the testis in parallel with biosynthesis of other sex steroids like tes...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-362
更新日期:2011-07-13 00:00:00
abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5488-5
更新日期:2019-04-04 00:00:00
abstract:BACKGROUND:Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugar...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-314
更新日期:2013-05-10 00:00:00
abstract:BACKGROUND:The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical. RESULTS:In ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2170-4
更新日期:2015-11-11 00:00:00
abstract:BACKGROUND:The genome-wide association (GWA) approach represents an alternative to biparental linkage mapping for determining the genetic basis of trait variation. Both approaches rely on recombination to re-arrange the genome, and seek to establish correlations between phenotype and genotype. The major advantages of G...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-896
更新日期:2014-10-14 00:00:00
abstract:BACKGROUND:Advances in genome technology have simplified a new comprehension of the genetic and historical processes crucial to rapid phenotypic evolution under domestication. To get new insight into the genetic basis of the dog domestication process, we conducted whole-genome sequence analysis of three wolves and thre...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6619-8
更新日期:2020-03-04 00:00:00
abstract:BACKGROUND:Molecular characterization is important for efficient utilization of germplasm and development of improved varieties. In the present study, we investigated the genetic purity, relatedness and population structure of 265 maize inbred lines from the Ethiopian Institute of Agricultural Research (EIAR), the Inte...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4173-9
更新日期:2017-10-12 00:00:00
abstract:BACKGROUND:In many microbial genomes, a strong preference for a small number of codons can be observed in genes whose products are needed by the cell in large quantities. This codon usage bias (CUB) improves translational accuracy and speed and is one of several factors optimizing cell growth. Whereas CUB and the overr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-617
更新日期:2010-11-04 00:00:00
abstract:BACKGROUND:Helitrons are class-II eukaryotic transposons that transpose via a rolling circle mechanism. Due to their ability to capture and mobilize gene fragments, they play an important role in the evolution of their host genomes. We have used a bioinformatics approach for the identification of helitrons in two Pleur...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1071
更新日期:2014-12-05 00:00:00
abstract:BACKGROUND:Bacterial invasive infection and host immune response is fundamental to the understanding of pathogen pathogenesis and the discovery of effective therapeutic drugs. However, there are very few experimental studies on the signaling cross-talks between bacteria and human host to date. METHODS:In this work, ta...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4873-9
更新日期:2018-06-28 00:00:00
abstract:BACKGROUND:The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray pr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-377
更新日期:2007-10-18 00:00:00
abstract:BACKGROUND:On a single strand of genomic DNA the number of As is usually about equal to the number of Ts (and similarly for Gs and Cs), but deviations have been noted for transcribed regions and origins of replication. RESULTS:The mouse genome is shown to have a segmented structure defined by strand bias. Transcriptio...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-16
更新日期:2008-01-14 00:00:00
abstract:BACKGROUND:Two fifths of the world's population is at risk from dengue. The absence of effective drugs and vaccines leaves vector control as the primary intervention tool. Understanding dengue virus (DENV) host interactions is essential for the development of novel control strategies. The availability of genome sequenc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-380
更新日期:2010-06-16 00:00:00
abstract:BACKGROUND:The sequence of the pathogen Mycobacterium tuberculosis (Mtb) strain H37Rv has been available for over a decade, but the biology of the pathogen remains poorly understood. Genome sequences from other Mtb strains and closely related bacteria present an opportunity to apply the power of comparative genomics to...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-120
更新日期:2012-03-28 00:00:00
abstract:BACKGROUND:Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-133
更新日期:2010-02-24 00:00:00
abstract:BACKGROUND:Cichlid fishes have evolved remarkably diverse reproductive, social, and feeding behaviors. Cell-to-cell signaling molecules, notably neuropeptides and peptide hormones, are known to regulate these behaviors across vertebrates. This class of signaling molecules derives from prohormone genes that have undergo...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2914-9
更新日期:2016-08-19 00:00:00
abstract:BACKGROUND:Functional analysis of the catfish genome will be useful for the identification of genes controlling traits of economic importance, especially innate disease resistance. However, this species lacks a platform for global gene expression profiling, so we designed a first generation high-density oligonucleotide...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-134
更新日期:2006-06-01 00:00:00