Evaluation of variant identification methods for whole genome sequencing data in dairy cattle.

Abstract:

BACKGROUND:Advances in human genomics have allowed unprecedented productivity in terms of algorithms, software, and literature available for translating raw next-generation sequence data into high-quality information. The challenges of variant identification in organisms with lower quality reference genomes are less well documented. We explored the consequences of commonly recommended preparatory steps and the effects of single and multi sample variant identification methods using four publicly available software applications (Platypus, HaplotypeCaller, Samtools and UnifiedGenotyper) on whole genome sequence data of 65 key ancestors of Swiss dairy cattle populations. Accuracy of calling next-generation sequence variants was assessed by comparison to the same loci from medium and high-density single nucleotide variant (SNV) arrays. RESULTS:The total number of SNVs identified varied by software and method, with single (multi) sample results ranging from 17.7 to 22.0 (16.9 to 22.0) million variants. Computing time varied considerably between software. Preparatory realignment of insertions and deletions and subsequent base quality score recalibration had only minor effects on the number and quality of SNVs identified by different software, but increased computing time considerably. Average concordance for single (multi) sample results with high-density chip data was 58.3% (87.0%) and average genotype concordance in correctly identified SNVs was 99.2% (99.2%) across software. The average quality of SNVs identified, measured as the ratio of transitions to transversions, was higher using single sample methods than multi sample methods. A consensus approach using results of different software generally provided the highest variant quality in terms of transition/transversion ratio. CONCLUSIONS:Our findings serve as a reference for variant identification pipeline development in non-human organisms and help assess the implication of preparatory steps in next-generation sequencing pipelines for organisms with incomplete reference genomes (pipeline code is included). Benchmarking this information should prove particularly useful in processing next-generation sequencing data for use in genome-wide association studies and genomic selection.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Baes CF,Dolezal MA,Koltes JE,Bapst B,Fritz-Waters E,Jansen S,Flury C,Signer-Hasler H,Stricker C,Fernando R,Fries R,Moll J,Garrick DJ,Reecy JM,Gredler B

doi

10.1186/1471-2164-15-948

subject

Has Abstract

pub_date

2014-11-01 00:00:00

pages

948

issn

1471-2164

pii

1471-2164-15-948

journal_volume

15

pub_type

杂志文章
  • Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters.

    abstract:BACKGROUND:Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e.,...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S13-S5

    authors: Arkova OV,Ponomarenko MP,Rasskazov DA,Drachkova IA,Arshinova TV,Ponomarenko PM,Savinkova LK,Kolchanov NA

    更新日期:2015-01-01 00:00:00

  • Microcollinearity between autopolyploid sugarcane and diploid sorghum genomes.

    abstract:BACKGROUND:Sugarcane (Saccharum spp.) has become an increasingly important crop for its leading role in biofuel production. The high sugar content species S. officinarum is an octoploid without known diploid or tetraploid progenitors. Commercial sugarcane cultivars are hybrids between S. officinarum and wild species S....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-261

    authors: Wang J,Roe B,Macmil S,Yu Q,Murray JE,Tang H,Chen C,Najar F,Wiley G,Bowers J,Van Sluys MA,Rokhsar DS,Hudson ME,Moose SP,Paterson AH,Ming R

    更新日期:2010-04-23 00:00:00

  • Transcriptional profiling of liver during the critical embryo-to-hatchling transition period in the chicken (Gallus gallus).

    abstract:BACKGROUND:Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map the metabolic pathways induced in liver during the embryo-to-hatchling transition. Furthermore, we know very little about the metabolic and regulatory fac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5080-4

    authors: Cogburn LA,Trakooljul N,Chen C,Huang H,Wu CH,Carré W,Wang X,White HB 3rd

    更新日期:2018-09-21 00:00:00

  • Transcriptome sequencing reveals genome-wide variation in molecular evolutionary rate among ferns.

    abstract:BACKGROUND:Transcriptomics in non-model plant systems has recently reached a point where the examination of nuclear genome-wide patterns in understudied groups is an achievable reality. This progress is especially notable in evolutionary studies of ferns, for which molecular resources to date have been derived primaril...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3034-2

    authors: Grusz AL,Rothfels CJ,Schuettpelz E

    更新日期:2016-08-30 00:00:00

  • Urinary proteomic and non-prefractionation quantitative phosphoproteomic analysis during pregnancy and non-pregnancy.

    abstract:BACKGROUND:Progress in the fields of protein separation and identification technologies has accelerated research into biofluids proteomics for protein biomarker discovery. Urine has become an ideal and rich source of biomarkers in clinical proteomics. Here we performed a proteomic analysis of urine samples from pregnan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-777

    authors: Zheng J,Liu L,Wang J,Jin Q

    更新日期:2013-11-11 00:00:00

  • Phylogenetic reconstruction from transpositions.

    abstract:BACKGROUND:Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangem...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S2-S15

    authors: Yue F,Zhang M,Tang J

    更新日期:2008-09-16 00:00:00

  • A platform independent RNA-Seq protocol for the detection of transcriptome complexity.

    abstract:BACKGROUND:Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-855

    authors: Calabrese C,Mangiulli M,Manzari C,Paluscio AM,Caratozzolo MF,Marzano F,Kurelac I,D'Erchia AM,D'Elia D,Licciulli F,Liuni S,Picardi E,Attimonelli M,Gasparre G,Porcelli AM,Pesole G,Sbisà E,Tullo A

    更新日期:2013-12-05 00:00:00

  • Protein kinases of the human malaria parasite Plasmodium falciparum: the kinome of a divergent eukaryote.

    abstract:BACKGROUND:Malaria, caused by the parasitic protist Plasmodium falciparum, represents a major public health problem in the developing world. The P. falciparum genome has been sequenced, which provides new opportunities for the identification of novel drug targets. Eukaryotic protein kinases (ePKs) form a large family o...

    journal_title:BMC genomics

    pub_type: 杂志文章,评审

    doi:10.1186/1471-2164-5-79

    authors: Ward P,Equinet L,Packer J,Doerig C

    更新日期:2004-10-12 00:00:00

  • Gene expression changes during caste-specific neuronal development in the damp-wood termite Hodotermopsis sjostedti.

    abstract:BACKGROUND:One of the key characters of social insects is the division of labor, in which different tasks are allocated to various castes. In termites, one of the representative groups of social insects, morphological differences as well as behavioral differences can be recognized among castes. However, very little is ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-314

    authors: Ishikawa Y,Okada Y,Ishikawa A,Miyakawa H,Koshikawa S,Miura T

    更新日期:2010-05-20 00:00:00

  • Identification of transcription factor genes involved in anthocyanin biosynthesis in carrot (Daucus carota L.) using RNA-Seq.

    abstract:BACKGROUND:Anthocyanins are water-soluble colored flavonoids present in multiple organs of various plant species including flowers, fruits, leaves, stems and roots. DNA-binding R2R3-MYB transcription factors, basic helix-loop-helix (bHLH) transcription factors, and WD40 repeat proteins are known to form MYB-bHLH-WD rep...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5135-6

    authors: Kodama M,Brinch-Pedersen H,Sharma S,Holme IB,Joernsgaard B,Dzhanfezova T,Amby DB,Vieira FG,Liu S,Gilbert MTP

    更新日期:2018-11-08 00:00:00

  • Early transcriptional responses of internalization defective Brucella abortus mutants in professional phagocytes, RAW 264.7.

    abstract:BACKGROUND:Brucella abortus is an intracellular zoonotic pathogen which causes undulant fever, endocarditis, arthritis and osteomyelitis in human and abortion and infertility in cattle. This bacterium is able to invade and replicate in host macrophage instead of getting removed by this defense mechanism. Therefore, und...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-426

    authors: Cha SB,Lee WJ,Shin MK,Jung MH,Shin SW,Yoo AN,Kim JW,Yoo HS

    更新日期:2013-06-27 00:00:00

  • Comparison of feature selection and classification for MALDI-MS data.

    abstract:INTRODUCTION:In the classification of Mass Spectrometry (MS) proteomics data, peak detection, feature selection, and learning classifiers are critical to classification accuracy. To better understand which methods are more accurate when classifying data, some publicly available peak detection algorithms for Matrix assi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S1-S3

    authors: Liu Q,Sung AH,Qiao M,Chen Z,Yang JY,Yang MQ,Huang X,Deng Y

    更新日期:2009-07-07 00:00:00

  • Changes in Bacillus anthracis CodY regulation under host-specific environmental factor deprived conditions.

    abstract:BACKGROUND:Host-specific environmental factors induce changes in Bacillus anthracis gene transcription during infection. A global transcription regulator, CodY, plays a pivotal role in regulating central metabolism, biosynthesis, and virulence in B. anthracis. In this study, we utilized RNA-sequencing to assess changes...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3004-8

    authors: Kim SK,Jung KH,Chai YG

    更新日期:2016-08-17 00:00:00

  • Transcriptomic analysis of flower induction for long-day pitaya by supplementary lighting in short-day winter season.

    abstract:BACKGROUND:Pitayas are currently attracting considerable interest as a tropical fruit with numerous health benefits. However, as a long-day plant, pitaya plants cannot flower in the winter season from November to April in Hainan, China. To harvest pitayas with high economic value in the winter season, it is necessary t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6726-6

    authors: Xiong R,Liu C,Xu M,Wei SS,Huang JQ,Tang H

    更新日期:2020-04-29 00:00:00

  • Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearra...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-561

    authors: Jo YD,Choi Y,Kim DH,Kim BD,Kang BC

    更新日期:2014-07-04 00:00:00

  • Transcriptomic and proteomic analyses of a new cytoplasmic male sterile line with a wild Gossypium bickii genetic background.

    abstract:BACKGROUND:Cotton is an important fiber crop but has serious heterosis effects, and cytoplasmic male sterility (CMS) is the major cause of heterosis in plants. However, to the best of our knowledge, no studies have investigated CMS Yamian A in cotton with the genetic background of Australian wild Gossypium bickii. Conj...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07261-y

    authors: Zhao H,Wang J,Qu Y,Peng R,Magwanga RO,Liu F,Huang J

    更新日期:2020-12-02 00:00:00

  • Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.

    abstract:BACKGROUND:The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sen...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-37

    authors: Vázquez-Castellanos JF,García-López R,Pérez-Brocal V,Pignatelli M,Moya A

    更新日期:2014-01-18 00:00:00

  • The genomic architecture and association genetics of adaptive characters using a candidate SNP approach in boreal black spruce.

    abstract:BACKGROUND:The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the geno...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-368

    authors: Prunier J,Pelgas B,Gagnon F,Desponts M,Isabel N,Beaulieu J,Bousquet J

    更新日期:2013-06-01 00:00:00

  • LPS-treatment of bovine endometrial epithelial cells causes differential DNA methylation of genes associated with inflammation and endometrial function.

    abstract:BACKGROUND:Lipopolysaccharide (LPS) endotoxin stimulates pro-inflammatory pathways and is a key player in the pathological mechanisms involved in the development of endometritis. This study aimed to investigate LPS-induced DNA methylation changes in bovine endometrial epithelial cells (bEECs), which may affect endometr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06777-7

    authors: Jhamat N,Niazi A,Guo Y,Chanrot M,Ivanova E,Kelsey G,Bongcam-Rudloff E,Andersson G,Humblot P

    更新日期:2020-06-03 00:00:00

  • RNA-seq analysis reveals genetic response and tolerance mechanisms to ozone exposure in soybean.

    abstract:BACKGROUND:Oxidative stress caused by ground level ozone is a contributor to yield loss in a number of important crop plants. Soybean (Glycine max) is considered to be ozone sensitive, and current research into its response to oxidative stress is limited. To better understand the genetic response in soybean to oxidativ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1637-7

    authors: Whaley A,Sheridan J,Safari S,Burton A,Burkey K,Schlueter J

    更新日期:2015-06-04 00:00:00

  • Dual transcriptional profiling of mice and Toxoplasma gondii during acute and chronic infection.

    abstract:BACKGROUND:The obligate intracellular parasite Toxoplasma gondii establishes a life-long chronic infection within any warm-blooded host. After ingestion of an encysted parasite, T. gondii disseminates throughout the body as a rapidly replicating form during acute infection. Over time and after stimulation of the host i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-806

    authors: Pittman KJ,Aliota MT,Knoll LJ

    更新日期:2014-09-20 00:00:00

  • Gene expression profiles at different stages for formation of pearl sac and pearl in the pearl oyster Pinctada fucata.

    abstract:BACKGROUND:The most critical step in the pearl formation during aquaculture is issued to the proliferation and differentiation of outer epithelial cells of mantle graft into pearl sac. This pearl sac secretes various matrix proteins to produce pearls by a complex physiological process which has not been well-understood...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5579-3

    authors: Mariom,Take S,Igarashi Y,Yoshitake K,Asakawa S,Maeyama K,Nagai K,Watabe S,Kinoshita S

    更新日期:2019-03-25 00:00:00

  • Prioritizing disease candidate genes by a gene interconnectedness-based approach.

    abstract:BACKGROUND:Genome-wide disease-gene finding approaches may sometimes provide us with a long list of candidate genes. Since using pure experimental approaches to verify all candidates could be expensive, a number of network-based methods have been developed to prioritize candidates. Such tools usually have a set of para...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-S3-S25

    authors: Hsu CL,Huang YH,Hsu CT,Yang UC

    更新日期:2011-11-30 00:00:00

  • The complete and fully assembled genome sequence of Aeromonas salmonicida subsp. pectinolytica and its comparative analysis with other Aeromonas species: investigation of the mobilome in environmental and pathogenic strains.

    abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4301-6

    authors: Pfeiffer F,Zamora-Lagos MA,Blettinger M,Yeroslaviz A,Dahl A,Gruber S,Habermann BH

    更新日期:2018-01-05 00:00:00

  • Activation of metabolic and stress responses during subtoxic expression of the type I toxin hok in Erwinia amylovora.

    abstract:BACKGROUND:Toxin-antitoxin (TA) systems, abundant in prokaryotes, are composed of a toxin gene and its cognate antitoxin. Several toxins are implied to affect the physiological state and stress tolerance of bacteria in a population. We previously identified a chromosomally encoded hok-sok type I TA system in Erwinia am...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-021-07376-w

    authors: Peng J,Triplett LR,Sundin GW

    更新日期:2021-01-22 00:00:00

  • Novel Moraxella catarrhalis prophages display hyperconserved non-structural genes despite their genomic diversity.

    abstract:BACKGROUND:Moraxella catarrhalis is an important pathogen that often causes otitis media in children, a disease that is not currently vaccine preventable. Asymptomatic colonisation of the human upper respiratory tract is common and lack of clearance by the immune system is likely due to the emergence of seroresistant g...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2104-1

    authors: Ariff A,Wise MJ,Kahler CM,Tay CY,Peters F,Perkins TT,Chang BJ

    更新日期:2015-10-24 00:00:00

  • Identification of sensory hair-cell transcripts by thiouracil-tagging in zebrafish.

    abstract:BACKGROUND:Sensory hair cells are exquisitely sensitive to mechanical stimuli and as such, are prone to damage and apoptosis during dissections or in vitro manipulations. Thiouracil (TU)-tagging is a noninvasive method to label cell type-specific transcripts in an intact organism, thereby meeting the challenge of how t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2072-5

    authors: Erickson T,Nicolson T

    更新日期:2015-10-23 00:00:00

  • Genomic arrangement of salinity tolerance QTLs in salmonids: a comparative analysis of Atlantic salmon (Salmo salar) with Arctic charr (Salvelinus alpinus) and rainbow trout (Oncorhynchus mykiss).

    abstract:BACKGROUND:Quantitative trait locus (QTL) studies show that variation in salinity tolerance in Arctic charr and rainbow trout has a genetic basis, even though both these species have low to moderate salinity tolerance capacities. QTL were observed to localize to homologous linkage group segments within putative chromos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-420

    authors: Norman JD,Robinson M,Glebe B,Ferguson MM,Danzmann RG

    更新日期:2012-08-24 00:00:00

  • Nonlinear transcriptomic response to dietary fat intake in the small intestine of C57BL/6J mice.

    abstract:BACKGROUND:A high caloric diet, in conjunction with low levels of physical activity, promotes obesity. Many studies are available regarding the relation between dietary saturated fats and the etiology of obesity, but most focus on liver, muscle and white adipose tissue. Furthermore, the majority of transcriptomic studi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2424-9

    authors: Nyima T,Müller M,Hooiveld GJ,Morine MJ,Scotti M

    更新日期:2016-02-09 00:00:00

  • Metabolic modeling and analysis of the metabolic switch in Streptomyces coelicolor.

    abstract:BACKGROUND:The transition from exponential to stationary phase in Streptomyces coelicolor is accompanied by a major metabolic switch and results in a strong activation of secondary metabolism. Here we have explored the underlying reorganization of the metabolome by combining computational predictions based on constrain...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-202

    authors: Alam MT,Merlo ME,STREAM Consortium.,Hodgson DA,Wellington EM,Takano E,Breitling R

    更新日期:2010-03-26 00:00:00