LinkImputeR: user-guided genotype calling and imputation for non-model organisms.

Abstract:

BACKGROUND:Genomic studies such as genome-wide association and genomic selection require genome-wide genotype data. All existing technologies used to create these data result in missing genotypes, which are often then inferred using genotype imputation software. However, existing imputation methods most often make use only of genotypes that are successfully inferred after having passed a certain read depth threshold. Because of this, any read information for genotypes that did not pass the threshold, and were thus set to missing, is ignored. Most genomic studies also choose read depth thresholds and quality filters without investigating their effects on the size and quality of the resulting genotype data. Moreover, almost all genotype imputation methods require ordered markers and are therefore of limited utility in non-model organisms. RESULTS:Here we introduce LinkImputeR, a software program that exploits the read count information that is normally ignored, and makes use of all available DNA sequence information for the purposes of genotype calling and imputation. It is specifically designed for non-model organisms since it requires neither ordered markers nor a reference panel of genotypes. Using next-generation DNA sequence (NGS) data from apple, cannabis and grape, we quantify the effect of varying read count and missingness thresholds on the quantity and quality of genotypes generated from LinkImputeR. We demonstrate that LinkImputeR can increase the number of genotype calls by more than an order of magnitude, can improve genotyping accuracy by several percent and can thus improve the power of downstream analyses. Moreover, we show that the effects of quality and read depth filters can differ substantially between data sets and should therefore be investigated on a per-study basis. CONCLUSIONS:By exploiting DNA sequence data that is normally ignored during genotype calling and imputation, LinkImputeR can significantly improve both the quantity and quality of genotype data generated from NGS technologies. It enables the user to quickly and easily examine the effects of varying thresholds and filters on the number and quality of the resulting genotype calls. In this manner, users can decide on thresholds that are most suitable for their purposes. We show that LinkImputeR can significantly augment the value and utility of NGS data sets, especially in non-model organisms with poor genomic resources.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Money D,Migicovsky Z,Gardner K,Myles S

doi

10.1186/s12864-017-3873-5

subject

Has Abstract

pub_date

2017-07-10 00:00:00

pages

523

issue

1

issn

1471-2164

pii

10.1186/s12864-017-3873-5

journal_volume

18

pub_type

杂志文章
  • Identification of transcription factor genes involved in anthocyanin biosynthesis in carrot (Daucus carota L.) using RNA-Seq.

    abstract:BACKGROUND:Anthocyanins are water-soluble colored flavonoids present in multiple organs of various plant species including flowers, fruits, leaves, stems and roots. DNA-binding R2R3-MYB transcription factors, basic helix-loop-helix (bHLH) transcription factors, and WD40 repeat proteins are known to form MYB-bHLH-WD rep...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5135-6

    authors: Kodama M,Brinch-Pedersen H,Sharma S,Holme IB,Joernsgaard B,Dzhanfezova T,Amby DB,Vieira FG,Liu S,Gilbert MTP

    更新日期:2018-11-08 00:00:00

  • GPR99, a new G protein-coupled receptor with homology to a new subgroup of nucleotide receptors.

    abstract:BACKGROUND:Based on sequence similarity, the superfamily of G protein-coupled receptors (GPRs) can be subdivided into several subfamilies, the members of which often share similar ligands. The sequence data provided by the human genome project allows us to identify new GPRs by in silico homology screening, and to predi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-3-17

    authors: Wittenberger T,Hellebrand S,Munck A,Kreienkamp HJ,Schaller HC,Hampe W

    更新日期:2002-07-05 00:00:00

  • Construction of a high-density, high-quality genetic map of cultivated lotus (Nelumbo nucifera) using next-generation sequencing.

    abstract:BACKGROUND:The sacred lotus (Nelumbo nucifera) is widely cultivated in China for its edible rhizomes and seeds. Traditional plant breeding methods have been used to breed cultivars with increased yields and quality of rhizomes and seeds with limited success. Currently, the available genetic maps and molecular markers i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2781-4

    authors: Liu Z,Zhu H,Liu Y,Kuang J,Zhou K,Liang F,Liu Z,Wang D,Ke W

    更新日期:2016-06-17 00:00:00

  • MicroRNA expression profiling of the fifth-instar posterior silk gland of Bombyx mori.

    abstract:BACKGROUND:The growth and development of the posterior silk gland and the biosynthesis of the silk core protein at the fifth larval instar stage of Bombyx mori are of paramount importance for silk production. RESULTS:Here, aided by next-generation sequencing and microarry assay, we profile 1,229 microRNAs (miRNAs), in...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-410

    authors: Li J,Cai Y,Ye L,Wang S,Che J,You Z,Yu J,Zhong B

    更新日期:2014-05-29 00:00:00

  • Systems perspectives on erythromycin biosynthesis by comparative genomic and transcriptomic analyses of S. erythraea E3 and NRRL23338 strains.

    abstract:BACKGROUND:S. erythraea is a Gram-positive filamentous bacterium used for the industrial-scale production of erythromycin A which is of high clinical importance. In this work, we sequenced the whole genome of a high-producing strain (E3) obtained by random mutagenesis and screening from the wild-type strain NRRL23338, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-523

    authors: Li YY,Chang X,Yu WB,Li H,Ye ZQ,Yu H,Liu BH,Zhang Y,Zhang SL,Ye BC,Li YX

    更新日期:2013-07-31 00:00:00

  • Identification and analysis of long non-coding RNAs and mRNAs in chicken macrophages infected with avian infectious bronchitis coronavirus.

    abstract:BACKGROUND:Avian infectious bronchitis virus (IBV) is a gamma coronavirus that severely affects the poultry industry worldwide. Long non-coding RNAs (lncRNAs), a subset of non-coding RNAs with a length of more than 200 nucleotides, have been recently recognized as pivotal factors in the pathogenesis of viral infections...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07359-3

    authors: Li H,Cui P,Fu X,Zhang L,Yan W,Zhai Y,Lei C,Wang H,Yang X

    更新日期:2021-01-20 00:00:00

  • Single-cell RNA sequencing reveals dynamic changes in A-to-I RNA editome during early human embryogenesis.

    abstract:BACKGROUND:A-to-I RNA-editing mediated by ADAR (adenosine deaminase acting on RNA) enzymes that converts adenosine to inosine in RNA sequence can generate mutations and alter gene regulation in metazoans. Previous studies have shown that A-to-I RNA-editing plays vital roles in mouse embryogenesis. However, the RNA-edit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3115-2

    authors: Qiu S,Li W,Xiong H,Liu D,Bai Y,Wu K,Zhang X,Yang H,Ma K,Hou Y,Li B

    更新日期:2016-09-29 00:00:00

  • How Athila retrotransposons survive in the Arabidopsis genome.

    abstract:BACKGROUND:Transposable elements are selfish genetic sequences which only occasionally provide useful functions to their host species. In addition, models of mobile element evolution assume a second type of selfishness: elements of different families do not cooperate, but they independently fight for their survival in ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-219

    authors: Marco A,Marín I

    更新日期:2008-05-14 00:00:00

  • Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury.

    abstract:BACKGROUND:Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-365

    authors: Ryge J,Winther O,Wienecke J,Sandelin A,Westerdahl AC,Hultborn H,Kiehn O

    更新日期:2010-06-09 00:00:00

  • Multi-omics analysis reveals regulators of the response to nitrogen limitation in Yarrowia lipolytica.

    abstract:BACKGROUND:Yarrowia lipolytica is an oleaginous ascomycete yeast that stores lipids in response to limitation of nitrogen. While the enzymatic pathways responsible for neutral lipid accumulation in Y. lipolytica are well characterized, regulation of these pathways has received little attention. We therefore sought to c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2471-2

    authors: Pomraning KR,Kim YM,Nicora CD,Chu RK,Bredeweg EL,Purvine SO,Hu D,Metz TO,Baker SE

    更新日期:2016-02-25 00:00:00

  • Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs).

    abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S8-S8

    authors: Handelman SK,Seweryn M,Smith RM,Hartmann K,Wang D,Pietrzak M,Johnson AD,Kloczkowski A,Sadee W

    更新日期:2015-01-01 00:00:00

  • Treatment-independent miRNA signature in blood of Wilms tumor patients.

    abstract:BACKGROUND:Blood-born miRNA signatures have recently been reported for various tumor diseases. Here, we compared the miRNA signature in Wilms tumor patients prior and after preoperative chemotherapy according to SIOP protocol 2001. RESULTS:We did not find a significant difference between miRNA signature of both groups...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-379

    authors: Schmitt J,Backes C,Nourkami-Tutdibi N,Leidinger P,Deutscher S,Beier M,Gessler M,Graf N,Lenhof HP,Keller A,Meese E

    更新日期:2012-08-07 00:00:00

  • SMaSH: Sample matching using SNPs in humans.

    abstract:BACKGROUND:Inadvertent sample swaps are a real threat to data quality in any medium to large scale omics studies. While matches between samples from the same individual can in principle be identified from a few well characterized single nucleotide polymorphisms (SNPs), omics data types often only provide low to moderat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6332-7

    authors: Westphal M,Frankhouser D,Sonzone C,Shields PG,Yan P,Bundschuh R

    更新日期:2019-12-30 00:00:00

  • Evaluation and validation of a robust single cell RNA-amplification protocol through transcriptional profiling of enriched lung cancer initiating cells.

    abstract:BACKGROUND:Although profiling of RNA in single cells has broadened our understanding of development, cancer biology and mechanisms of disease dissemination, it requires the development of reliable and flexible methods. Here we demonstrate that the EpiStem RNA-Amp™ methodology reproducibly generates microgram amounts of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1129

    authors: Rothwell DG,Li Y,Ayub M,Tate C,Newton G,Hey Y,Carter L,Faulkner S,Moro M,Pepper S,Miller C,Blackhall F,Bertolini G,Roz L,Dive C,Brady G

    更新日期:2014-12-17 00:00:00

  • RNA sequencing identifies common pathways between cigarette smoke exposure and replicative senescence in human airway epithelia.

    abstract:BACKGROUND:Aging is affected by genetic and environmental factors, and cigarette smoking is strongly associated with accumulation of senescent cells. In this study, we wanted to identify genes that may potentially be beneficial for cell survival in response to cigarette smoke and thereby may contribute to development o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5409-z

    authors: Voic H,Li X,Jang JH,Zou C,Sundd P,Alder J,Rojas M,Chandra D,Randell S,Mallampalli RK,Tesfaigzi Y,Ryba T,Nyunoya T

    更新日期:2019-01-09 00:00:00

  • Proteome dynamics and early salt stress response of the photosynthetic organism Chlamydomonas reinhardtii.

    abstract:BACKGROUND:The cellular proteome and metabolome are underlying dynamic regulation allowing rapid adaptation to changes in the environment. System-wide analysis of these dynamics will provide novel insights into mechanisms of stress adaptation for higher photosynthetic organisms. We applied pulsed-SILAC labeling to a ph...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-215

    authors: Mastrobuoni G,Irgang S,Pietzke M,Assmus HE,Wenzel M,Schulze WX,Kempa S

    更新日期:2012-05-31 00:00:00

  • Thrifty metabolic programming in rats is induced by both maternal undernutrition and postnatal leptin treatment, but masked in the presence of both: implications for models of developmental programming.

    abstract:BACKGROUND:Maternal undernutrition leads to an increased risk of metabolic disorders in offspring including obesity and insulin resistance, thought to be due to a programmed thrifty phenotype which is inappropriate for a subsequent richer nutritional environment. In a rat model, both male and female offspring of undern...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-49

    authors: Ellis PJ,Morris TJ,Skinner BM,Sargent CA,Vickers MH,Gluckman PD,Gilmour S,Affara NA

    更新日期:2014-01-21 00:00:00

  • De novo assembly and characterization of fruit transcriptome in Litchi chinensis Sonn and analysis of differentially regulated genes in fruit in response to shading.

    abstract:BACKGROUND:Litchi (Litchi chinensis Sonn.) is one of the most important fruit trees cultivated in tropical and subtropical areas. However, a lack of transcriptomic and genomic information hinders our understanding of the molecular mechanisms underlying fruit set and fruit development in litchi. Shading during early fru...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-552

    authors: Li C,Wang Y,Huang X,Li J,Wang H,Li J

    更新日期:2013-08-14 00:00:00

  • Analysis of 4,664 high-quality sequence-finished poplar full-length cDNA clones and their utility for the discovery of genes responding to insect feeding.

    abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-57

    authors: Ralph SG,Chun HJ,Cooper D,Kirkpatrick R,Kolosova N,Gunter L,Tuskan GA,Douglas CJ,Holt RA,Jones SJ,Marra MA,Bohlmann J

    更新日期:2008-01-29 00:00:00

  • TREC-IN: gene knock-in genetic tool for genomes cloned in yeast.

    abstract:BACKGROUND:With the development of several new technologies using synthetic biology, it is possible to engineer genetically intractable organisms including Mycoplasma mycoides subspecies capri (Mmc), by cloning the intact bacterial genome in yeast, using the host yeast's genetic tools to modify the cloned genome, and s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1180

    authors: Chandran S,Noskov VN,Segall-Shapiro TH,Ma L,Whiteis C,Lartigue C,Jores J,Vashee S,Chuang RY

    更新日期:2014-12-24 00:00:00

  • Delineation of condition specific Cis- and Trans-acting elements in plant promoters under various Endo- and exogenous stimuli.

    abstract:BACKGROUND:Transcription factors (TFs) play essential roles during plant development and response to environmental stresses. However, the relationships among transcription factors, cis-acting elements and target gene expression under endo- and exogenous stimuli have not been systematically characterized. RESULTS:Here,...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4469-4

    authors: Chow CN,Chiang-Hsieh YF,Chien CH,Zheng HQ,Lee TY,Wu NY,Tseng KC,Hou PF,Chang WC

    更新日期:2018-05-09 00:00:00

  • RNA-seq analysis provides insights into cold stress responses of Xanthomonas citri pv. citri.

    abstract:BACKGROUND:Xanthomonas citri pv. citri (Xcc) is a citrus canker causing Gram-negative bacteria. Currently, little is known about the biological and molecular responses of Xcc to low temperatures. RESULTS:Results depicted that low temperature significantly reduced growth and increased biofilm formation and unsaturated ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6193-0

    authors: Liao JX,Li KH,Wang JP,Deng JR,Liu QG,Chang CQ

    更新日期:2019-11-06 00:00:00

  • Combinatorial control of temporal gene expression in the Drosophila wing by enhancers and core promoters.

    abstract:BACKGROUND:The transformation of a developing epithelium into an adult structure is a complex process, which often involves coordinated changes in cell proliferation, metabolism, adhesion, and shape. To identify genetic mechanisms that control epithelial differentiation, we analyzed the temporal patterns of gene expres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-498

    authors: O'Keefe DD,Thomas SR,Bolin K,Griggs E,Edgar BA,Buttitta LA

    更新日期:2012-09-20 00:00:00

  • The complete and fully assembled genome sequence of Aeromonas salmonicida subsp. pectinolytica and its comparative analysis with other Aeromonas species: investigation of the mobilome in environmental and pathogenic strains.

    abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4301-6

    authors: Pfeiffer F,Zamora-Lagos MA,Blettinger M,Yeroslaviz A,Dahl A,Gruber S,Habermann BH

    更新日期:2018-01-05 00:00:00

  • Global transcriptional profiling reveals Streptococcus agalactiae genes controlled by the MtaR transcription factor.

    abstract:BACKGROUND:Streptococcus agalactiae (group B Streptococcus; GBS) is a significant bacterial pathogen of neonates and an emerging pathogen of adults. Though transcriptional regulators are abundantly encoded on the GBS genome, their role in GBS pathogenesis is poorly understood. The mtaR gene encodes a putative LysR-type...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-607

    authors: Bryan JD,Liles R,Cvek U,Trutschl M,Shelver D

    更新日期:2008-12-16 00:00:00

  • Spontaneous preterm birth and single nucleotide gene polymorphisms: a recent update.

    abstract:BACKGROUND:Preterm birth (PTB), birth at <37 weeks of gestation, is a significant global public health problem. World-wide, about 15 million babies are born preterm each year resulting in more than a million deaths of children. Preterm neonates are more prone to problems and need intensive care hospitalization. Health ...

    journal_title:BMC genomics

    pub_type: 杂志文章,评审

    doi:10.1186/s12864-016-3089-0

    authors: Sheikh IA,Ahmad E,Jamal MS,Rehan M,Assidi M,Tayubi IA,AlBasri SF,Bajouh OS,Turki RF,Abuzenadah AM,Damanhouri GA,Beg MA,Al-Qahtani M

    更新日期:2016-10-17 00:00:00

  • Genome-wide host responses against infectious laryngotracheitis virus vaccine infection in chicken embryo lung cells.

    abstract:BACKGROUND:Infectious laryngotracheitis virus (ILTV; gallid herpesvirus 1) infection causes high mortality and huge economic losses in the poultry industry. To protect chickens against ILTV infection, chicken-embryo origin (CEO) and tissue-culture origin (TCO) vaccines have been used. However, the transmission of vacci...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-143

    authors: Lee J,Bottje WG,Kong BW

    更新日期:2012-04-24 00:00:00

  • Arsenic-induced changes in the gene expression of lung epithelial L2 cells: implications in carcinogenesis.

    abstract:BACKGROUND:Arsenic is a carcinogen that is known to induce cell transformation and tumor formation. Although studies have been performed to examine the modulation of signaling molecules caused by arsenic exposure, the molecular mechanisms by which arsenic causes cancer are still unclear. We hypothesized that arsenic al...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-115

    authors: Posey T,Weng T,Chen Z,Chintagari NR,Wang P,Jin N,Stricker H,Liu L

    更新日期:2008-03-03 00:00:00

  • Assessing structural variation in a personal genome-towards a human reference diploid genome.

    abstract:BACKGROUND:Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity,...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1479-3

    authors: English AC,Salerno WJ,Hampton OA,Gonzaga-Jauregui C,Ambreth S,Ritter DI,Beck CR,Davis CF,Dahdouli M,Ma S,Carroll A,Veeraraghavan N,Bruestle J,Drees B,Hastie A,Lam ET,White S,Mishra P,Wang M,Han Y,Zhang F,Stankie

    更新日期:2015-04-11 00:00:00

  • Comparing de novo assemblers for 454 transcriptome data.

    abstract:BACKGROUND:Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-571

    authors: Kumar S,Blaxter ML

    更新日期:2010-10-16 00:00:00