Abstract:
BACKGROUND:Genomic studies such as genome-wide association and genomic selection require genome-wide genotype data. All existing technologies used to create these data result in missing genotypes, which are often then inferred using genotype imputation software. However, existing imputation methods most often make use only of genotypes that are successfully inferred after having passed a certain read depth threshold. Because of this, any read information for genotypes that did not pass the threshold, and were thus set to missing, is ignored. Most genomic studies also choose read depth thresholds and quality filters without investigating their effects on the size and quality of the resulting genotype data. Moreover, almost all genotype imputation methods require ordered markers and are therefore of limited utility in non-model organisms. RESULTS:Here we introduce LinkImputeR, a software program that exploits the read count information that is normally ignored, and makes use of all available DNA sequence information for the purposes of genotype calling and imputation. It is specifically designed for non-model organisms since it requires neither ordered markers nor a reference panel of genotypes. Using next-generation DNA sequence (NGS) data from apple, cannabis and grape, we quantify the effect of varying read count and missingness thresholds on the quantity and quality of genotypes generated from LinkImputeR. We demonstrate that LinkImputeR can increase the number of genotype calls by more than an order of magnitude, can improve genotyping accuracy by several percent and can thus improve the power of downstream analyses. Moreover, we show that the effects of quality and read depth filters can differ substantially between data sets and should therefore be investigated on a per-study basis. CONCLUSIONS:By exploiting DNA sequence data that is normally ignored during genotype calling and imputation, LinkImputeR can significantly improve both the quantity and quality of genotype data generated from NGS technologies. It enables the user to quickly and easily examine the effects of varying thresholds and filters on the number and quality of the resulting genotype calls. In this manner, users can decide on thresholds that are most suitable for their purposes. We show that LinkImputeR can significantly augment the value and utility of NGS data sets, especially in non-model organisms with poor genomic resources.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Money D,Migicovsky Z,Gardner K,Myles Sdoi
10.1186/s12864-017-3873-5subject
Has Abstractpub_date
2017-07-10 00:00:00pages
523issue
1issn
1471-2164pii
10.1186/s12864-017-3873-5journal_volume
18pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Anthocyanins are water-soluble colored flavonoids present in multiple organs of various plant species including flowers, fruits, leaves, stems and roots. DNA-binding R2R3-MYB transcription factors, basic helix-loop-helix (bHLH) transcription factors, and WD40 repeat proteins are known to form MYB-bHLH-WD rep...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5135-6
更新日期:2018-11-08 00:00:00
abstract:BACKGROUND:Based on sequence similarity, the superfamily of G protein-coupled receptors (GPRs) can be subdivided into several subfamilies, the members of which often share similar ligands. The sequence data provided by the human genome project allows us to identify new GPRs by in silico homology screening, and to predi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-3-17
更新日期:2002-07-05 00:00:00
abstract:BACKGROUND:The sacred lotus (Nelumbo nucifera) is widely cultivated in China for its edible rhizomes and seeds. Traditional plant breeding methods have been used to breed cultivars with increased yields and quality of rhizomes and seeds with limited success. Currently, the available genetic maps and molecular markers i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2781-4
更新日期:2016-06-17 00:00:00
abstract:BACKGROUND:The growth and development of the posterior silk gland and the biosynthesis of the silk core protein at the fifth larval instar stage of Bombyx mori are of paramount importance for silk production. RESULTS:Here, aided by next-generation sequencing and microarry assay, we profile 1,229 microRNAs (miRNAs), in...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-410
更新日期:2014-05-29 00:00:00
abstract:BACKGROUND:S. erythraea is a Gram-positive filamentous bacterium used for the industrial-scale production of erythromycin A which is of high clinical importance. In this work, we sequenced the whole genome of a high-producing strain (E3) obtained by random mutagenesis and screening from the wild-type strain NRRL23338, ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-523
更新日期:2013-07-31 00:00:00
abstract:BACKGROUND:Avian infectious bronchitis virus (IBV) is a gamma coronavirus that severely affects the poultry industry worldwide. Long non-coding RNAs (lncRNAs), a subset of non-coding RNAs with a length of more than 200 nucleotides, have been recently recognized as pivotal factors in the pathogenesis of viral infections...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07359-3
更新日期:2021-01-20 00:00:00
abstract:BACKGROUND:A-to-I RNA-editing mediated by ADAR (adenosine deaminase acting on RNA) enzymes that converts adenosine to inosine in RNA sequence can generate mutations and alter gene regulation in metazoans. Previous studies have shown that A-to-I RNA-editing plays vital roles in mouse embryogenesis. However, the RNA-edit...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3115-2
更新日期:2016-09-29 00:00:00
abstract:BACKGROUND:Transposable elements are selfish genetic sequences which only occasionally provide useful functions to their host species. In addition, models of mobile element evolution assume a second type of selfishness: elements of different families do not cooperate, but they independently fight for their survival in ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-219
更新日期:2008-05-14 00:00:00
abstract:BACKGROUND:Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-365
更新日期:2010-06-09 00:00:00
abstract:BACKGROUND:Yarrowia lipolytica is an oleaginous ascomycete yeast that stores lipids in response to limitation of nitrogen. While the enzymatic pathways responsible for neutral lipid accumulation in Y. lipolytica are well characterized, regulation of these pathways has received little attention. We therefore sought to c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2471-2
更新日期:2016-02-25 00:00:00
abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S8-S8
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Blood-born miRNA signatures have recently been reported for various tumor diseases. Here, we compared the miRNA signature in Wilms tumor patients prior and after preoperative chemotherapy according to SIOP protocol 2001. RESULTS:We did not find a significant difference between miRNA signature of both groups...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-379
更新日期:2012-08-07 00:00:00
abstract:BACKGROUND:Inadvertent sample swaps are a real threat to data quality in any medium to large scale omics studies. While matches between samples from the same individual can in principle be identified from a few well characterized single nucleotide polymorphisms (SNPs), omics data types often only provide low to moderat...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6332-7
更新日期:2019-12-30 00:00:00
abstract:BACKGROUND:Although profiling of RNA in single cells has broadened our understanding of development, cancer biology and mechanisms of disease dissemination, it requires the development of reliable and flexible methods. Here we demonstrate that the EpiStem RNA-Amp™ methodology reproducibly generates microgram amounts of...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1129
更新日期:2014-12-17 00:00:00
abstract:BACKGROUND:Aging is affected by genetic and environmental factors, and cigarette smoking is strongly associated with accumulation of senescent cells. In this study, we wanted to identify genes that may potentially be beneficial for cell survival in response to cigarette smoke and thereby may contribute to development o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5409-z
更新日期:2019-01-09 00:00:00
abstract:BACKGROUND:The cellular proteome and metabolome are underlying dynamic regulation allowing rapid adaptation to changes in the environment. System-wide analysis of these dynamics will provide novel insights into mechanisms of stress adaptation for higher photosynthetic organisms. We applied pulsed-SILAC labeling to a ph...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-215
更新日期:2012-05-31 00:00:00
abstract:BACKGROUND:Maternal undernutrition leads to an increased risk of metabolic disorders in offspring including obesity and insulin resistance, thought to be due to a programmed thrifty phenotype which is inappropriate for a subsequent richer nutritional environment. In a rat model, both male and female offspring of undern...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-49
更新日期:2014-01-21 00:00:00
abstract:BACKGROUND:Litchi (Litchi chinensis Sonn.) is one of the most important fruit trees cultivated in tropical and subtropical areas. However, a lack of transcriptomic and genomic information hinders our understanding of the molecular mechanisms underlying fruit set and fruit development in litchi. Shading during early fru...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-552
更新日期:2013-08-14 00:00:00
abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-57
更新日期:2008-01-29 00:00:00
abstract:BACKGROUND:With the development of several new technologies using synthetic biology, it is possible to engineer genetically intractable organisms including Mycoplasma mycoides subspecies capri (Mmc), by cloning the intact bacterial genome in yeast, using the host yeast's genetic tools to modify the cloned genome, and s...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1180
更新日期:2014-12-24 00:00:00
abstract:BACKGROUND:Transcription factors (TFs) play essential roles during plant development and response to environmental stresses. However, the relationships among transcription factors, cis-acting elements and target gene expression under endo- and exogenous stimuli have not been systematically characterized. RESULTS:Here,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4469-4
更新日期:2018-05-09 00:00:00
abstract:BACKGROUND:Xanthomonas citri pv. citri (Xcc) is a citrus canker causing Gram-negative bacteria. Currently, little is known about the biological and molecular responses of Xcc to low temperatures. RESULTS:Results depicted that low temperature significantly reduced growth and increased biofilm formation and unsaturated ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6193-0
更新日期:2019-11-06 00:00:00
abstract:BACKGROUND:The transformation of a developing epithelium into an adult structure is a complex process, which often involves coordinated changes in cell proliferation, metabolism, adhesion, and shape. To identify genetic mechanisms that control epithelial differentiation, we analyzed the temporal patterns of gene expres...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-498
更新日期:2012-09-20 00:00:00
abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4301-6
更新日期:2018-01-05 00:00:00
abstract:BACKGROUND:Streptococcus agalactiae (group B Streptococcus; GBS) is a significant bacterial pathogen of neonates and an emerging pathogen of adults. Though transcriptional regulators are abundantly encoded on the GBS genome, their role in GBS pathogenesis is poorly understood. The mtaR gene encodes a putative LysR-type...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-607
更新日期:2008-12-16 00:00:00
abstract:BACKGROUND:Preterm birth (PTB), birth at <37 weeks of gestation, is a significant global public health problem. World-wide, about 15 million babies are born preterm each year resulting in more than a million deaths of children. Preterm neonates are more prone to problems and need intensive care hospitalization. Health ...
journal_title:BMC genomics
pub_type: 杂志文章,评审
doi:10.1186/s12864-016-3089-0
更新日期:2016-10-17 00:00:00
abstract:BACKGROUND:Infectious laryngotracheitis virus (ILTV; gallid herpesvirus 1) infection causes high mortality and huge economic losses in the poultry industry. To protect chickens against ILTV infection, chicken-embryo origin (CEO) and tissue-culture origin (TCO) vaccines have been used. However, the transmission of vacci...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-143
更新日期:2012-04-24 00:00:00
abstract:BACKGROUND:Arsenic is a carcinogen that is known to induce cell transformation and tumor formation. Although studies have been performed to examine the modulation of signaling molecules caused by arsenic exposure, the molecular mechanisms by which arsenic causes cancer are still unclear. We hypothesized that arsenic al...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-115
更新日期:2008-03-03 00:00:00
abstract:BACKGROUND:Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1479-3
更新日期:2015-04-11 00:00:00
abstract:BACKGROUND:Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-571
更新日期:2010-10-16 00:00:00