Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.

Abstract:

BACKGROUND:High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. RESULTS:We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. CONCLUSIONS:Our enhanced iRRL approach greatly facilitates genotyping-by-sequencing and thus direct estimates of allele frequencies. Our direct comparison of three commonly used SNP callers emphasizes the need to question the accuracy of SNP and genotype calling, as we obtained considerably different SNP datasets depending on caller algorithms, sequencing depths and filtering criteria. These differences affected scans for signatures of natural selection, but will also exert undue influences on demographic inferences. This study presents the first effort to generate a population genomic dataset for wild-born orangutans with known population provenance.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Greminger MP,Stölting KN,Nater A,Goossens B,Arora N,Bruggmann R,Patrignani A,Nussberger B,Sharma R,Kraus RH,Ambu LN,Singleton I,Chikhi L,van Schaik CP,Krützen M

doi

10.1186/1471-2164-15-16

subject

Has Abstract

pub_date

2014-01-10 00:00:00

pages

16

issn

1471-2164

pii

1471-2164-15-16

journal_volume

15

pub_type

杂志文章
  • RNA sequencing identifies common pathways between cigarette smoke exposure and replicative senescence in human airway epithelia.

    abstract:BACKGROUND:Aging is affected by genetic and environmental factors, and cigarette smoking is strongly associated with accumulation of senescent cells. In this study, we wanted to identify genes that may potentially be beneficial for cell survival in response to cigarette smoke and thereby may contribute to development o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5409-z

    authors: Voic H,Li X,Jang JH,Zou C,Sundd P,Alder J,Rojas M,Chandra D,Randell S,Mallampalli RK,Tesfaigzi Y,Ryba T,Nyunoya T

    更新日期:2019-01-09 00:00:00

  • ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data.

    abstract:BACKGROUND:ATAC-seq (Assays for Transposase-Accessible Chromatin using sequencing) is a recently developed technique for genome-wide analysis of chromatin accessibility. Compared to earlier methods for assaying chromatin accessibility, ATAC-seq is faster and easier to perform, does not require cross-linking, has higher...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4559-3

    authors: Ou J,Liu H,Yu J,Kelliher MA,Castilla LH,Lawson ND,Zhu LJ

    更新日期:2018-03-01 00:00:00

  • Genome-wide survey and expression profiles of the AP2/ERF family in castor bean (Ricinus communis L.).

    abstract:BACKGROUND:The AP2/ERF transcription factor, one of the largest gene families in plants, plays a crucial role in the regulation of growth and development, metabolism, and responses to biotic and abiotic stresses. Castor bean (Ricinus communis L., Euphobiaceae) is one of most important non-edible oilseed crops and its s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-785

    authors: Xu W,Li F,Ling L,Liu A

    更新日期:2013-11-13 00:00:00

  • Disease gene identification by random walk on multigraphs merging heterogeneous genomic and phenotype data.

    abstract:BACKGROUND:High throughput experiments resulted in many genomic datasets and hundreds of candidate disease genes. To discover the real disease genes from a set of candidate genes, computational methods have been proposed and worked on various types of genomic data sources. As a single source of genomic data is prone of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S7-S27

    authors: Li Y,Li J

    更新日期:2012-01-01 00:00:00

  • Survey of cryptic unstable transcripts in yeast.

    abstract:BACKGROUND:Cryptic unstable transcripts (CUTs) are a largely unexplored class of nuclear exosome degraded, non-coding RNAs in budding yeast. It is highly debated whether CUT transcription has a functional role in the cell or whether CUTs represent noise in the yeast transcriptome. We sought to ascertain the extent of c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2622-5

    authors: Vera JM,Dowell RD

    更新日期:2016-04-26 00:00:00

  • Haplotyping and copy number estimation of the highly polymorphic human beta-defensin locus on 8p23 by 454 amplicon sequencing.

    abstract:BACKGROUND:The beta-defensin gene cluster (DEFB) at chromosome 8p23.1 is one of the most copy number (CN) variable regions of the human genome. Whereas individual DEFB CNs have been suggested as independent genetic risk factors for several diseases (e.g. psoriasis and Crohn's disease), the role of multisite sequence va...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-252

    authors: Taudien S,Groth M,Huse K,Petzold A,Szafranski K,Hampe J,Rosenstiel P,Schreiber S,Platzer M

    更新日期:2010-04-19 00:00:00

  • A transcriptional blueprint for a spiral-cleaving embryo.

    abstract:BACKGROUND:The spiral cleavage mode of early development is utilized in over one-third of all animal phyla and generates embryonic cells of different size, position, and fate through a conserved set of stereotypic and invariant asymmetric cell divisions. Despite the widespread use of spiral cleavage, regulatory and mol...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2860-6

    authors: Chou HC,Pruitt MM,Bastin BR,Schneider SQ

    更新日期:2016-08-05 00:00:00

  • Altered patterns of gene duplication and differential gene gain and loss in fungal pathogens.

    abstract:BACKGROUND:Duplication, followed by fixation or random loss of novel genes, contributes to genome evolution. Particular outcomes of duplication events are possibly associated with pathogenic life histories in fungi. To date, differential gene gain and loss have not been studied at genomic scales in fungal pathogens, de...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-147

    authors: Powell AJ,Conant GC,Brown DE,Carbone I,Dean RA

    更新日期:2008-03-28 00:00:00

  • Finishing monkeypox genomes from short reads: assembly analysis and a neural network method.

    abstract:BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2826-8

    authors: Zhao K,Wohlhueter RM,Li Y

    更新日期:2016-08-31 00:00:00

  • Production and utilization of a high-density oligonucleotide microarray in channel catfish, Ictalurus punctatus.

    abstract:BACKGROUND:Functional analysis of the catfish genome will be useful for the identification of genes controlling traits of economic importance, especially innate disease resistance. However, this species lacks a platform for global gene expression profiling, so we designed a first generation high-density oligonucleotide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-134

    authors: Li RW,Waldbieser GC

    更新日期:2006-06-01 00:00:00

  • Improved resolution in the position of drought-related QTLs in a single mapping population of rice by meta-analysis.

    abstract:BACKGROUND:Meta-analysis of QTLs combines the results of several QTL detection studies and provides narrow confidence intervals for meta-QTLs, permitting easier positional candidate gene identification. It is usually applied to multiple mapping populations, but can be applied to one. Here, a meta-analysis of drought re...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-10-276

    authors: Khowaja FS,Norton GJ,Courtois B,Price AH

    更新日期:2009-06-22 00:00:00

  • Differences in transcription between free-living and CO2-activated third-stage larvae of Haemonchus contortus.

    abstract:BACKGROUND:The disease caused by Haemonchus contortus, a blood-feeding nematode of small ruminants, is of major economic importance worldwide. The infective third-stage larva (L3) of this gastric nematode is enclosed in a cuticle (sheath) and, once ingested with herbage by the host, undergoes an exsheathment process th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-266

    authors: Cantacessi C,Campbell BE,Young ND,Jex AR,Hall RS,Presidente PJ,Zawadzki JL,Zhong W,Aleman-Meza B,Loukas A,Sternberg PW,Gasser RB

    更新日期:2010-04-27 00:00:00

  • Mitochondrial lineage M1 traces an early human backflow to Africa.

    abstract:BACKGROUND:The out of Africa hypothesis has gained generalized consensus. However, many specific questions remain unsettled. To know whether the two M and N macrohaplogroups that colonized Eurasia were already present in Africa before the exit is puzzling. It has been proposed that the east African clade M1 supports a ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-223

    authors: González AM,Larruga JM,Abu-Amero KK,Shi Y,Pestano J,Cabrera VM

    更新日期:2007-07-09 00:00:00

  • Identification of microRNA-mRNA modules using microarray data.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of mRNA expression and are involved in numerous cellular processes. Consequently, miRNAs are an important component of gene regulatory networks and an improved understanding of miRNAs will further our knowledge of these networks. There is a many-to-many ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-138

    authors: Jayaswal V,Lutherborrow M,Ma DD,Yang YH

    更新日期:2011-03-06 00:00:00

  • Antennal transcriptome analysis of the chemosensory gene families in the tree killing bark beetles, Ips typographus and Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae).

    abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-198

    authors: Andersson MN,Grosse-Wilde E,Keeling CI,Bengtsson JM,Yuen MM,Li M,Hillbur Y,Bohlmann J,Hansson BS,Schlyter F

    更新日期:2013-03-21 00:00:00

  • Waking the sleeping dragon: gene expression profiling reveals adaptive strategies of the hibernating reptile Pogona vitticeps.

    abstract:BACKGROUND:Hibernation is a physiological state exploited by many animals exposed to prolonged adverse environmental conditions associated with winter. Large changes in metabolism and cellular function occur, with many stress response pathways modulated to tolerate physiological challenges that might otherwise be letha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5750-x

    authors: Capraro A,O'Meally D,Waters SA,Patel HR,Georges A,Waters PD

    更新日期:2019-06-06 00:00:00

  • Gene ontology analysis of expanded porcine blastocysts from gilts fed organic or inorganic selenium combined with pyridoxine.

    abstract:BACKGROUND:Gene ontology analysis using the microarray database generated in a previous study by this laboratory was used to further evaluate how maternal dietary supplementation with pyridoxine combined with different sources of selenium (Se) affected global gene expression of expanded porcine blastocysts. Data were g...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5237-1

    authors: Dalto DB,Tsoi S,Dyck MK,Matte JJ

    更新日期:2018-11-21 00:00:00

  • Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection.

    abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-173

    authors: Cho WK,Yu J,Lee KM,Son M,Min K,Lee YW,Kim KH

    更新日期:2012-05-06 00:00:00

  • Tandem repeats derived from centromeric retrotransposons.

    abstract:BACKGROUND:Tandem repeats are ubiquitous and abundant in higher eukaryotic genomes and constitute, along with transposable elements, much of DNA underlying centromeres and other heterochromatic domains. In maize, centromeric satellite repeat (CentC) and centromeric retrotransposons (CR), a class of Ty3/gypsy retrotrans...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-142

    authors: Sharma A,Wolfgruber TK,Presting GG

    更新日期:2013-03-04 00:00:00

  • Transversions have larger regulatory effects than transitions.

    abstract:BACKGROUND:Transversions (Tv's) are more likely to alter the amino acid sequence of proteins than transitions (Ts's), and local deviations in the Ts:Tv ratio are indicative of evolutionary selection on genes. Whether the two different types of mutations have different effects in non-protein-coding sequences remains unk...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3785-4

    authors: Guo C,McDowell IC,Nodzenski M,Scholtens DM,Allen AS,Lowe WL,Reddy TE

    更新日期:2017-05-19 00:00:00

  • Comparative analysis of mitochondrial genomes between the hau cytoplasmic male sterility (CMS) line and its iso-nuclear maintainer line in Brassica juncea to reveal the origin of the CMS-associated gene orf288.

    abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is not only important for exploiting heterosis in crop plants, but also as a model for investigating nuclear-cytoplasmic interaction. CMS may be caused by mutations, rearrangement or recombination in the mitochondrial genome. Understanding the mitochondrial genome is often th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-322

    authors: Heng S,Wei C,Jing B,Wan Z,Wen J,Yi B,Ma C,Tu J,Fu T,Shen J

    更新日期:2014-04-30 00:00:00

  • Genomic comparison of serogroups O159 and O170 with other Vibrio cholerae serogroups.

    abstract:BACKGROUND:Of the hundreds of Vibrio cholerae serogroups, O1 and O139 are the main epidemic-causing ones. Although non-O1/non-O139 serogroups rarely cause epidemics, the possibility exists for strains within them to have pathogenic potential. RESULTS:We selected 25 representative strains within 16 V. cholerae serogrou...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5603-7

    authors: Li Z,Lu X,Wang D,Liang WL,Zhang J,Li J,Xu J,Pang B,Kan B

    更新日期:2019-03-25 00:00:00

  • Liver proteome response of pre-harvest Atlantic salmon following exposure to elevated temperature.

    abstract:BACKGROUND:Atlantic salmon production in Tasmania (Southern Australia) occurs near the upper limits of the species thermal tolerance. Summer water temperatures can average over 19 °C over several weeks and have negative effects on performance and health. Liver tissue exerts important metabolic functions in thermal adap...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4517-0

    authors: Nuez-Ortín WG,Carter CG,Nichols PD,Cooke IR,Wilson R

    更新日期:2018-02-12 00:00:00

  • Short-term genome evolution of Listeria monocytogenes in a non-controlled environment.

    abstract:BACKGROUND:While increasing data on bacterial evolution in controlled environments are available, our understanding of bacterial genome evolution in natural environments is limited. We thus performed full genome analyses on four Listeria monocytogenes, including human and food isolates from both a 1988 case of sporadic...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-539

    authors: Orsi RH,Borowsky ML,Lauer P,Young SK,Nusbaum C,Galagan JE,Birren BW,Ivy RA,Sun Q,Graves LM,Swaminathan B,Wiedmann M

    更新日期:2008-11-13 00:00:00

  • Comparative transcriptomics provide insight into the morphogenesis and evolution of fistular leaves in Allium.

    abstract:BACKGROUND:Fistular leaves frequently appear in Allium species, and previous developmental studies have proposed that the process of fistular leaf formation involves programmed cell death. However, molecular evidence for the role of programmed cell death in the formation of fistular leaf cavities has yet to be reported...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3474-8

    authors: Zhu S,Tang S,Tan Z,Yu Y,Dai Q,Liu T

    更新日期:2017-01-10 00:00:00

  • Tracking the best reference genes for RT-qPCR data normalization in filamentous fungi.

    abstract:BACKGROUND:A critical step in the RT-qPCR workflow for studying gene expression is data normalization, one of the strategies being the use of reference genes. This study aimed to identify and validate a selection of reference genes for relative quantification in Talaromyces versatilis, a relevant industrial filamentous...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1224-y

    authors: Llanos A,François JM,Parrou JL

    更新日期:2015-02-14 00:00:00

  • dCaP: detecting differential binding events in multiple conditions and proteins.

    abstract:BACKGROUND:Current ChIP-seq studies are interested in comparing multiple epigenetic profiles across several cell types and tissues simultaneously for studying constitutive and differential regulation. Simultaneous analysis of multiple epigenetic features in many samples can gain substantial power and specificity than a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S9-S12

    authors: Chen KB,Hardison R,Zhang Y

    更新日期:2014-01-01 00:00:00

  • Evaluation of viral genome assembly and diversity estimation in deep metagenomes.

    abstract:BACKGROUND:Viruses have unique properties, small genome and regions of high similarity, whose effects on metagenomic assemblies have not been characterized so far. This study uses diverse in silico simulated viromes to evaluate how extensively genomes can be assembled using different sequencing platforms and assemblers...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-989

    authors: Aguirre de Cárcer D,Angly FE,Alcamí A

    更新日期:2014-11-18 00:00:00

  • Identification of sex determination genes and their evolution in Phlebotominae sand flies (Diptera, Nematocera).

    abstract:BACKGROUND:Phlebotomine sand flies (Diptera, Nematocera) are important vectors of several pathogens, including Leishmania parasites, causing serious diseases of humans and dogs. Despite their importance as disease vectors, most aspects of sand fly biology remain unknown including the molecular basis of their reproducti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5898-4

    authors: Petrella V,Aceto S,Colonna V,Saccone G,Sanges R,Polanska N,Volf P,Gradoni L,Bongiorno G,Salvemini M

    更新日期:2019-06-25 00:00:00

  • Spread of avian pathogenic Escherichia coli ST117 O78:H4 in Nordic broiler production.

    abstract:BACKGROUND:Escherichia coli infections known as colibacillosis constitute a considerable challenge to poultry farmers worldwide, in terms of decreased animal welfare and production economy. Colibacillosis is caused by avian pathogenic E. coli (APEC). APEC strains are extraintestinal pathogenic E. coli and have in gener...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3415-6

    authors: Ronco T,Stegger M,Olsen RH,Sekse C,Nordstoga AB,Pohjanvirta T,Lilje B,Lyhs U,Andersen PS,Pedersen K

    更新日期:2017-01-03 00:00:00