Abstract:
BACKGROUND:High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. RESULTS:We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. CONCLUSIONS:Our enhanced iRRL approach greatly facilitates genotyping-by-sequencing and thus direct estimates of allele frequencies. Our direct comparison of three commonly used SNP callers emphasizes the need to question the accuracy of SNP and genotype calling, as we obtained considerably different SNP datasets depending on caller algorithms, sequencing depths and filtering criteria. These differences affected scans for signatures of natural selection, but will also exert undue influences on demographic inferences. This study presents the first effort to generate a population genomic dataset for wild-born orangutans with known population provenance.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Greminger MP,Stölting KN,Nater A,Goossens B,Arora N,Bruggmann R,Patrignani A,Nussberger B,Sharma R,Kraus RH,Ambu LN,Singleton I,Chikhi L,van Schaik CP,Krützen Mdoi
10.1186/1471-2164-15-16subject
Has Abstractpub_date
2014-01-10 00:00:00pages
16issn
1471-2164pii
1471-2164-15-16journal_volume
15pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Aging is affected by genetic and environmental factors, and cigarette smoking is strongly associated with accumulation of senescent cells. In this study, we wanted to identify genes that may potentially be beneficial for cell survival in response to cigarette smoke and thereby may contribute to development o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5409-z
更新日期:2019-01-09 00:00:00
abstract:BACKGROUND:ATAC-seq (Assays for Transposase-Accessible Chromatin using sequencing) is a recently developed technique for genome-wide analysis of chromatin accessibility. Compared to earlier methods for assaying chromatin accessibility, ATAC-seq is faster and easier to perform, does not require cross-linking, has higher...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4559-3
更新日期:2018-03-01 00:00:00
abstract:BACKGROUND:The AP2/ERF transcription factor, one of the largest gene families in plants, plays a crucial role in the regulation of growth and development, metabolism, and responses to biotic and abiotic stresses. Castor bean (Ricinus communis L., Euphobiaceae) is one of most important non-edible oilseed crops and its s...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-785
更新日期:2013-11-13 00:00:00
abstract:BACKGROUND:High throughput experiments resulted in many genomic datasets and hundreds of candidate disease genes. To discover the real disease genes from a set of candidate genes, computational methods have been proposed and worked on various types of genomic data sources. As a single source of genomic data is prone of...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S7-S27
更新日期:2012-01-01 00:00:00
abstract:BACKGROUND:Cryptic unstable transcripts (CUTs) are a largely unexplored class of nuclear exosome degraded, non-coding RNAs in budding yeast. It is highly debated whether CUT transcription has a functional role in the cell or whether CUTs represent noise in the yeast transcriptome. We sought to ascertain the extent of c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2622-5
更新日期:2016-04-26 00:00:00
abstract:BACKGROUND:The beta-defensin gene cluster (DEFB) at chromosome 8p23.1 is one of the most copy number (CN) variable regions of the human genome. Whereas individual DEFB CNs have been suggested as independent genetic risk factors for several diseases (e.g. psoriasis and Crohn's disease), the role of multisite sequence va...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-252
更新日期:2010-04-19 00:00:00
abstract:BACKGROUND:The spiral cleavage mode of early development is utilized in over one-third of all animal phyla and generates embryonic cells of different size, position, and fate through a conserved set of stereotypic and invariant asymmetric cell divisions. Despite the widespread use of spiral cleavage, regulatory and mol...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2860-6
更新日期:2016-08-05 00:00:00
abstract:BACKGROUND:Duplication, followed by fixation or random loss of novel genes, contributes to genome evolution. Particular outcomes of duplication events are possibly associated with pathogenic life histories in fungi. To date, differential gene gain and loss have not been studied at genomic scales in fungal pathogens, de...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-147
更新日期:2008-03-28 00:00:00
abstract:BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2826-8
更新日期:2016-08-31 00:00:00
abstract:BACKGROUND:Functional analysis of the catfish genome will be useful for the identification of genes controlling traits of economic importance, especially innate disease resistance. However, this species lacks a platform for global gene expression profiling, so we designed a first generation high-density oligonucleotide...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-134
更新日期:2006-06-01 00:00:00
abstract:BACKGROUND:Meta-analysis of QTLs combines the results of several QTL detection studies and provides narrow confidence intervals for meta-QTLs, permitting easier positional candidate gene identification. It is usually applied to multiple mapping populations, but can be applied to one. Here, a meta-analysis of drought re...
journal_title:BMC genomics
pub_type: 杂志文章,meta分析
doi:10.1186/1471-2164-10-276
更新日期:2009-06-22 00:00:00
abstract:BACKGROUND:The disease caused by Haemonchus contortus, a blood-feeding nematode of small ruminants, is of major economic importance worldwide. The infective third-stage larva (L3) of this gastric nematode is enclosed in a cuticle (sheath) and, once ingested with herbage by the host, undergoes an exsheathment process th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-266
更新日期:2010-04-27 00:00:00
abstract:BACKGROUND:The out of Africa hypothesis has gained generalized consensus. However, many specific questions remain unsettled. To know whether the two M and N macrohaplogroups that colonized Eurasia were already present in Africa before the exit is puzzling. It has been proposed that the east African clade M1 supports a ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-223
更新日期:2007-07-09 00:00:00
abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of mRNA expression and are involved in numerous cellular processes. Consequently, miRNAs are an important component of gene regulatory networks and an improved understanding of miRNAs will further our knowledge of these networks. There is a many-to-many ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-138
更新日期:2011-03-06 00:00:00
abstract:BACKGROUND:The European spruce bark beetle, Ips typographus, and the North American mountain pine beetle, Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae), are severe pests of coniferous forests. Both bark beetle species utilize aggregation pheromones to coordinate mass-attacks on host trees, while odora...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-198
更新日期:2013-03-21 00:00:00
abstract:BACKGROUND:Hibernation is a physiological state exploited by many animals exposed to prolonged adverse environmental conditions associated with winter. Large changes in metabolism and cellular function occur, with many stress response pathways modulated to tolerate physiological challenges that might otherwise be letha...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5750-x
更新日期:2019-06-06 00:00:00
abstract:BACKGROUND:Gene ontology analysis using the microarray database generated in a previous study by this laboratory was used to further evaluate how maternal dietary supplementation with pyridoxine combined with different sources of selenium (Se) affected global gene expression of expanded porcine blastocysts. Data were g...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5237-1
更新日期:2018-11-21 00:00:00
abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-173
更新日期:2012-05-06 00:00:00
abstract:BACKGROUND:Tandem repeats are ubiquitous and abundant in higher eukaryotic genomes and constitute, along with transposable elements, much of DNA underlying centromeres and other heterochromatic domains. In maize, centromeric satellite repeat (CentC) and centromeric retrotransposons (CR), a class of Ty3/gypsy retrotrans...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-142
更新日期:2013-03-04 00:00:00
abstract:BACKGROUND:Transversions (Tv's) are more likely to alter the amino acid sequence of proteins than transitions (Ts's), and local deviations in the Ts:Tv ratio are indicative of evolutionary selection on genes. Whether the two different types of mutations have different effects in non-protein-coding sequences remains unk...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3785-4
更新日期:2017-05-19 00:00:00
abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is not only important for exploiting heterosis in crop plants, but also as a model for investigating nuclear-cytoplasmic interaction. CMS may be caused by mutations, rearrangement or recombination in the mitochondrial genome. Understanding the mitochondrial genome is often th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-322
更新日期:2014-04-30 00:00:00
abstract:BACKGROUND:Of the hundreds of Vibrio cholerae serogroups, O1 and O139 are the main epidemic-causing ones. Although non-O1/non-O139 serogroups rarely cause epidemics, the possibility exists for strains within them to have pathogenic potential. RESULTS:We selected 25 representative strains within 16 V. cholerae serogrou...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5603-7
更新日期:2019-03-25 00:00:00
abstract:BACKGROUND:Atlantic salmon production in Tasmania (Southern Australia) occurs near the upper limits of the species thermal tolerance. Summer water temperatures can average over 19 °C over several weeks and have negative effects on performance and health. Liver tissue exerts important metabolic functions in thermal adap...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4517-0
更新日期:2018-02-12 00:00:00
abstract:BACKGROUND:While increasing data on bacterial evolution in controlled environments are available, our understanding of bacterial genome evolution in natural environments is limited. We thus performed full genome analyses on four Listeria monocytogenes, including human and food isolates from both a 1988 case of sporadic...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-539
更新日期:2008-11-13 00:00:00
abstract:BACKGROUND:Fistular leaves frequently appear in Allium species, and previous developmental studies have proposed that the process of fistular leaf formation involves programmed cell death. However, molecular evidence for the role of programmed cell death in the formation of fistular leaf cavities has yet to be reported...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3474-8
更新日期:2017-01-10 00:00:00
abstract:BACKGROUND:A critical step in the RT-qPCR workflow for studying gene expression is data normalization, one of the strategies being the use of reference genes. This study aimed to identify and validate a selection of reference genes for relative quantification in Talaromyces versatilis, a relevant industrial filamentous...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1224-y
更新日期:2015-02-14 00:00:00
abstract:BACKGROUND:Current ChIP-seq studies are interested in comparing multiple epigenetic profiles across several cell types and tissues simultaneously for studying constitutive and differential regulation. Simultaneous analysis of multiple epigenetic features in many samples can gain substantial power and specificity than a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-S9-S12
更新日期:2014-01-01 00:00:00
abstract:BACKGROUND:Viruses have unique properties, small genome and regions of high similarity, whose effects on metagenomic assemblies have not been characterized so far. This study uses diverse in silico simulated viromes to evaluate how extensively genomes can be assembled using different sequencing platforms and assemblers...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-989
更新日期:2014-11-18 00:00:00
abstract:BACKGROUND:Phlebotomine sand flies (Diptera, Nematocera) are important vectors of several pathogens, including Leishmania parasites, causing serious diseases of humans and dogs. Despite their importance as disease vectors, most aspects of sand fly biology remain unknown including the molecular basis of their reproducti...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5898-4
更新日期:2019-06-25 00:00:00
abstract:BACKGROUND:Escherichia coli infections known as colibacillosis constitute a considerable challenge to poultry farmers worldwide, in terms of decreased animal welfare and production economy. Colibacillosis is caused by avian pathogenic E. coli (APEC). APEC strains are extraintestinal pathogenic E. coli and have in gener...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3415-6
更新日期:2017-01-03 00:00:00