Evaluation of whole exome sequencing as an alternative to BeadChip and whole genome sequencing in human population genetic analysis.

Abstract:

BACKGROUND:Understanding the underlying genetic structure of human populations is of fundamental interest to both biological and social sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation. The most widely used methods for collecting variant information at the DNA-level include whole genome sequencing, which remains costly, and the more economical solution of array-based techniques, as these are capable of simultaneously genotyping a pre-selected set of variable DNA sites in the human genome. The largest publicly accessible set of human genomic sequence data available today originates from exome sequencing that comprises around 1.2% of the whole genome (approximately 30 million base pairs). RESULTS:To unbiasedly compare the effect of SNP selection strategies in population genetic analysis we subsampled the variants of the same highly curated 1 K Genome dataset to mimic genome, exome sequencing and array data in order to eliminate the effect of different chemistry and error profiles of these different approaches. Next we compared the application of the exome dataset to the array-based dataset and to the gold standard whole genome dataset using the same population genetic analysis methods. CONCLUSIONS:Our results draw attention to some of the inherent problems that arise from using pre-selected SNP sets for population genetic analysis. Additionally, we demonstrate that exome sequencing provides a better alternative to the array-based methods for population genetic analysis. In this study, we propose a strategy for unbiased variant collection from exome data and offer a bioinformatics protocol for proper data processing.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Maróti Z,Boldogkői Z,Tombácz D,Snyder M,Kalmár T

doi

10.1186/s12864-018-5168-x

subject

Has Abstract

pub_date

2018-10-29 00:00:00

pages

778

issue

1

issn

1471-2164

pii

10.1186/s12864-018-5168-x

journal_volume

19

pub_type

杂志文章
  • Phylogenetic reconstruction from transpositions.

    abstract:BACKGROUND:Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangem...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S2-S15

    authors: Yue F,Zhang M,Tang J

    更新日期:2008-09-16 00:00:00

  • Effects of elevated seawater pCO(2) on gene expression patterns in the gills of the green crab, Carcinus maenas.

    abstract:BACKGROUND:The green crab Carcinus maenas is known for its high acclimation potential to varying environmental abiotic conditions. A high ability for ion and acid-base regulation is mainly based on an efficient regulation apparatus located in gill epithelia. However, at present it is neither known which ion transport p...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-488

    authors: Fehsenfeld S,Kiko R,Appelhans Y,Towle DW,Zimmer M,Melzner F

    更新日期:2011-10-06 00:00:00

  • Identification of "pathologs" (disease-related genes) from the RIKEN mouse cDNA dataset using human curation plus FACTS, a new biological information extraction system.

    abstract:BACKGROUND:A major goal in the post-genomic era is to identify and characterise disease susceptibility genes and to apply this knowledge to disease prevention and treatment. Rodents and humans have remarkably similar genomes and share closely related biochemical, physiological and pathological pathways. In this work we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-5-28

    authors: Silva DG,Schönbach C,Brusic V,Socha LA,Nagashima T,Petrovsky N

    更新日期:2004-04-29 00:00:00

  • Genetic variation and population structure of maize inbred lines adapted to the mid-altitude sub-humid maize agro-ecology of Ethiopia using single nucleotide polymorphic (SNP) markers.

    abstract:BACKGROUND:Molecular characterization is important for efficient utilization of germplasm and development of improved varieties. In the present study, we investigated the genetic purity, relatedness and population structure of 265 maize inbred lines from the Ethiopian Institute of Agricultural Research (EIAR), the Inte...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4173-9

    authors: Ertiro BT,Semagn K,Das B,Olsen M,Labuschagne M,Worku M,Wegary D,Azmach G,Ogugo V,Keno T,Abebe B,Chibsa T,Menkir A

    更新日期:2017-10-12 00:00:00

  • Comparing de novo assemblers for 454 transcriptome data.

    abstract:BACKGROUND:Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-571

    authors: Kumar S,Blaxter ML

    更新日期:2010-10-16 00:00:00

  • Identification and analysis of long non-coding RNAs that are involved in inflammatory process in response to transmissible gastroenteritis virus infection.

    abstract:BACKGROUND:Transmissible gastroenteritis virus (TGEV) infection can cause acute inflammation. Long noncoding RNAs (lncRNAs) play important roles in a number of biological process including inflammation response. However, whether lncRNAs participate in TGEV-induced inflammation in porcine intestinal epithelial cells (IP...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6156-5

    authors: Ma X,Zhao X,Wang K,Tang X,Guo J,Mi M,Qi Y,Chang L,Huang Y,Tong D

    更新日期:2019-11-04 00:00:00

  • QTLs associated with dry matter intake, metabolic mid-test weight, growth and feed efficiency have little overlap across 4 beef cattle studies.

    abstract:BACKGROUND:The identification of genetic markers associated with complex traits that are expensive to record such as feed intake or feed efficiency would allow these traits to be included in selection programs. To identify large-effect QTL, we performed a series of genome-wide association studies and functional analyse...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1004

    authors: Saatchi M,Beever JE,Decker JE,Faulkner DB,Freetly HC,Hansen SL,Yampara-Iquise H,Johnson KA,Kachman SD,Kerley MS,Kim J,Loy DD,Marques E,Neibergs HL,Pollak EJ,Schnabel RD,Seabury CM,Shike DW,Snelling WM,Spangler ML,

    更新日期:2014-11-20 00:00:00

  • Gastrointestinal microbial populations can distinguish pediatric and adolescent Acute Lymphoblastic Leukemia (ALL) at the time of disease diagnosis.

    abstract:BACKGROUND:An estimated 15,000 children and adolescents under the age of 19 years are diagnosed with leukemia, lymphoma and other tumors in the USA every year. All children and adolescent acute leukemia patients will undergo chemotherapy as part of their treatment regimen. Fortunately, survival rates for most pediatric...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2965-y

    authors: Rajagopala SV,Yooseph S,Harkins DM,Moncera KJ,Zabokrtsky KB,Torralba MG,Tovchigrechko A,Highlander SK,Pieper R,Sender L,Nelson KE

    更新日期:2016-08-15 00:00:00

  • Alternative splicing enriched cDNA libraries identify breast cancer-associated transcripts.

    abstract:BACKGROUND:Alternative splicing (AS) is a central mechanism in the generation of genomic complexity and is a major contributor to transcriptome and proteome diversity. Alterations of the splicing process can lead to deregulation of crucial cellular processes and have been associated with a large spectrum of human disea...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S5-S4

    authors: Ferreira EN,Rangel MC,Galante PF,de Souza JE,Molina GC,de Souza SJ,Carraro DM

    更新日期:2010-12-22 00:00:00

  • Correction to: RNA-sequencing reveals positional memory of multipotent mesenchymal stromal cells from oral and maxillofacial tissue transcriptomes.

    abstract::An amendment to this paper has been published and can be accessed via the original article. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-020-06939-7

    authors: Onizuka S,Yamazaki Y,Park SJ,Sugimoto T,Sone Y,Sjöqvist S,Usui M,Takeda A,Nakai K,Nakashima K,Iwata T

    更新日期:2020-08-11 00:00:00

  • Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms.

    abstract:BACKGROUND:Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-222

    authors: Ruhlman T,Lee SB,Jansen RK,Hostetler JB,Tallon LJ,Town CD,Daniell H

    更新日期:2006-08-31 00:00:00

  • Metabolite and transcriptome analysis during fasting suggest a role for the p53-Ddit4 axis in major metabolic tissues.

    abstract:BACKGROUND:Fasting induces specific molecular and metabolic adaptions in most organisms. In biomedical research fasting is used in metabolic studies to synchronize nutritional states of study subjects. Because there is a lack of standardization for this procedure, we need a deeper understanding of the dynamics and the ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-758

    authors: Schupp M,Chen F,Briggs ER,Rao S,Pelzmann HJ,Pessentheiner AR,Bogner-Strauss JG,Lazar MA,Baldwin D,Prokesch A

    更新日期:2013-11-05 00:00:00

  • Reductive evolution and the loss of PDC/PAS domains from the genus Staphylococcus.

    abstract:BACKGROUND:The Per-Arnt-Sim (PAS) domain represents a ubiquitous structural fold that is involved in bacterial sensing and adaptation systems, including several virulence related functions. Although PAS domains and the subclass of PhoQ-DcuS-CitA (PDC) domains have a common structure, there is limited amino acid sequenc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-524

    authors: Shah N,Gaupp R,Moriyama H,Eskridge KM,Moriyama EN,Somerville GA

    更新日期:2013-07-31 00:00:00

  • Effects of pathogenic CNVs on physical traits in participants of the UK Biobank.

    abstract:BACKGROUND:Copy number variants (CNVs) have been shown to increase risk for physical anomalies, developmental, psychiatric and medical disorders. Some of them have been associated with changes in weight, height, and other physical traits. As most studies have been performed on children and young people, these effects o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5292-7

    authors: Owen D,Bracher-Smith M,Kendall KM,Rees E,Einon M,Escott-Price V,Owen MJ,O'Donovan MC,Kirov G

    更新日期:2018-12-04 00:00:00

  • A study of inter-lab and inter-platform agreement of DNA microarray data.

    abstract::As gene expression profile data from DNA microarrays accumulate rapidly, there is a natural need to compare data across labs and platforms. Comparisons of microarray data can be quite challenging due to data complexity and variability. Different labs may adopt different technology platforms. One may ask about the degr...

    journal_title:BMC genomics

    pub_type: 杂志文章,多中心研究

    doi:10.1186/1471-2164-6-71

    authors: Wang H,He X,Band M,Wilson C,Liu L

    更新日期:2005-05-11 00:00:00

  • Comparison of leaf transcriptome in response to Rhizoctonia solani infection between resistant and susceptible rice cultivars.

    abstract:BACKGROUND:Sheath blight (SB), caused by Rhizoctonia solani, is a common rice disease worldwide. Currently, rice cultivars with robust resistance to R. solani are still lacking. To provide theoretic basis for molecular breeding of R. solani-resistant rice cultivars, the changes of transcriptome profiles in response to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6645-6

    authors: Shi W,Zhao SL,Liu K,Sun YB,Ni ZB,Zhang GY,Tang HS,Zhu JW,Wan BJ,Sun HQ,Dai JY,Sun MF,Yan GH,Wang AM,Zhu GY

    更新日期:2020-03-19 00:00:00

  • A comparison of gene transcription profiles of domesticated and wild Atlantic salmon (Salmo salar L.) at early life stages, reared under controlled conditions.

    abstract:BACKGROUND:Atlantic salmon have been subject to domestication for approximately ten generations, beginning in the early 1970s. This process of artificial selection will have created various genetic differences between wild and farmed stocks. Each year, hundreds of thousands of farmed fish escape into the wild. These es...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-884

    authors: Bicskei B,Bron JE,Glover KA,Taggart JB

    更新日期:2014-10-09 00:00:00

  • Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

    abstract:BACKGROUND:Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.)...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-211

    authors: Straub SC,Fishbein M,Livshultz T,Foster Z,Parks M,Weitemier K,Cronn RC,Liston A

    更新日期:2011-05-04 00:00:00

  • In silico characterization of a novel putative aerotaxis chemosensory system in the myxobacterium, Corallococcus coralloides.

    abstract:BACKGROUND:An efficient signal transduction system allows a bacterium to sense environmental cues and then to respond positively or negatively to those signals; this process is referred to as taxis. In addition to external cues, the internal metabolic state of any bacterium plays a major role in determining its ability...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5151-6

    authors: Sharma G,Parales R,Singer M

    更新日期:2018-10-19 00:00:00

  • Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).

    abstract:BACKGROUND:Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding program...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-90

    authors: Houston RD,Taggart JB,Cézard T,Bekaert M,Lowe NR,Downing A,Talbot R,Bishop SC,Archibald AL,Bron JE,Penman DJ,Davassi A,Brew F,Tinch AE,Gharbi K,Hamilton A

    更新日期:2014-02-06 00:00:00

  • Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database.

    abstract:BACKGROUND:The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genom...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-454

    authors: Scaglione D,Acquadro A,Portis E,Taylor CA,Lanteri S,Knapp SJ

    更新日期:2009-09-28 00:00:00

  • NovelFam3000--uncharacterized human protein domains conserved across model organisms.

    abstract:BACKGROUND:Despite significant efforts from the research community, an extensive portion of the proteins encoded by human genes lack an assigned cellular function. Most metazoan proteins are composed of structural and/or functional domains, of which many appear in multiple proteins. Once a domain is characterized in on...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-48

    authors: Kemmer D,Podowski RM,Arenillas D,Lim J,Hodges E,Roth P,Sonnhammer EL,Höög C,Wasserman WW

    更新日期:2006-03-13 00:00:00

  • Bcheck: a wrapper tool for detecting RNase P RNA genes.

    abstract:BACKGROUND:Effective bioinformatics solutions are needed to tackle challenges posed by industrial-scale genome annotation. We present Bcheck, a wrapper tool which predicts RNase P RNA genes by combining the speed of pattern matching and sensitivity of covariance models. The core of Bcheck is a library of subfamily spec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-432

    authors: Yusuf D,Marz M,Stadler PF,Hofacker IL

    更新日期:2010-07-13 00:00:00

  • Local hopping mobile DNA implicated in pseudogene formation and reductive evolution in an obligate cyanobacteria-plant symbiosis.

    abstract:BACKGROUND:Insertion sequences (ISs) are approximately 1 kbp long "jumping" genes found in prokaryotes. ISs encode the protein Transposase, which facilitates the excision and reinsertion of ISs in genomes, making these sequences a type of class I ("cut-and-paste") Mobile Genetic Elements. ISs are proposed to be involve...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1386-7

    authors: Vigil-Stenman T,Larsson J,Nylander JA,Bergman B

    更新日期:2015-03-17 00:00:00

  • The genomes of three stocks comprising the most widely utilized live sporozoite Theileria parva vaccine exhibit very different degrees and patterns of sequence divergence.

    abstract:BACKGROUND:There are no commercially available vaccines against human protozoan parasitic diseases, despite the success of vaccination-induced long-term protection against infectious diseases. East Coast fever, caused by the protist Theileria parva, kills one million cattle each year in sub-Saharan Africa, and contribu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1910-9

    authors: Norling M,Bishop RP,Pelle R,Qi W,Henson S,Drábek EF,Tretina K,Odongo D,Mwaura S,Njoroge T,Bongcam-Rudloff E,Daubenberger CA,Silva JC

    更新日期:2015-09-24 00:00:00

  • In silico analysis of the core signaling proteome from the barley powdery mildew pathogen (Blumeria graminis f.sp. hordei).

    abstract:BACKGROUND:Compared to other ascomycetes, the barley powdery mildew pathogen Blumeria graminis f.sp. hordei (Bgh) has a large genome (ca. 120 Mbp) that harbors a relatively small number of protein-coding genes (ca. 6500). This genomic assemblage is thought to be the result of numerous gene losses, which likely represen...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-843

    authors: Kusch S,Ahmadinejad N,Panstruga R,Kuhn H

    更新日期:2014-10-02 00:00:00

  • Effect of paleopolyploidy and allopolyploidy on gene expression in banana.

    abstract:BACKGROUND:Bananas (Musa spp.) are an important crop worldwide. Most modern cultivars resulted from a complex polyploidization history that comprised three whole genome duplications (WGDs) shaping the haploid Musa genome, followed by inter- and intra-specific crosses between Musa acuminata and M. balbisiana (A and B ge...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5618-0

    authors: Cenci A,Hueber Y,Zorrilla-Fontanesi Y,van Wesemael J,Kissel E,Gislard M,Sardos J,Swennen R,Roux N,Carpentier SC,Rouard M

    更新日期:2019-03-27 00:00:00

  • An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications.

    abstract:BACKGROUND:Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S10

    authors: Menon R,Gasser RB,Mitreva M,Ranganathan S

    更新日期:2012-01-01 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • A network-based integrative approach to prioritize reliable hits from multiple genome-wide RNAi screens in Drosophila.

    abstract:BACKGROUND:The recently developed RNA interference (RNAi) technology has created an unprecedented opportunity which allows the function of individual genes in whole organisms or cell lines to be interrogated at genome-wide scale. However, multiple issues, such as off-target effects or low efficacies in knocking down ce...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-220

    authors: Wang L,Tu Z,Sun F

    更新日期:2009-05-12 00:00:00