A hybrid expectation maximisation and MCMC sampling algorithm to implement Bayesian mixture model based genomic prediction and QTL mapping.

Abstract:

BACKGROUND:Bayesian mixture models in which the effects of SNP are assumed to come from normal distributions with different variances are attractive for simultaneous genomic prediction and QTL mapping. These models are usually implemented with Monte Carlo Markov Chain (MCMC) sampling, which requires long compute times with large genomic data sets. Here, we present an efficient approach (termed HyB_BR), which is a hybrid of an Expectation-Maximisation algorithm, followed by a limited number of MCMC without the requirement for burn-in. RESULTS:To test prediction accuracy from HyB_BR, dairy cattle and human disease trait data were used. In the dairy cattle data, there were four quantitative traits (milk volume, protein kg, fat% in milk and fertility) measured in 16,214 cattle from two breeds genotyped for 632,002 SNPs. Validation of genomic predictions was in a subset of cattle either from the reference set or in animals from a third breeds that were not in the reference set. In all cases, HyB_BR gave almost identical accuracies to Bayesian mixture models implemented with full MCMC, however computational time was reduced by up to 1/17 of that required by full MCMC. The SNPs with high posterior probability of a non-zero effect were also very similar between full MCMC and HyB_BR, with several known genes affecting milk production in this category, as well as some novel genes. HyB_BR was also applied to seven human diseases with 4890 individuals genotyped for around 300 K SNPs in a case/control design, from the Welcome Trust Case Control Consortium (WTCCC). In this data set, the results demonstrated again that HyB_BR performed as well as Bayesian mixture models with full MCMC for genomic predictions and genetic architecture inference while reducing the computational time from 45 h with full MCMC to 3 h with HyB_BR. CONCLUSIONS:The results for quantitative traits in cattle and disease in humans demonstrate that HyB_BR can perform equally well as Bayesian mixture models implemented with full MCMC in terms of prediction accuracy, but with up to 17 times faster than the full MCMC implementations. The HyB_BR algorithm makes simultaneous genomic prediction, QTL mapping and inference of genetic architecture feasible in large genomic data sets.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Wang T,Chen YP,Bowman PJ,Goddard ME,Hayes BJ

doi

10.1186/s12864-016-3082-7

subject

Has Abstract

pub_date

2016-09-21 00:00:00

pages

744

issue

1

issn

1471-2164

pii

10.1186/s12864-016-3082-7

journal_volume

17

pub_type

杂志文章
  • Expression profiles of urbilaterian genes uniquely shared between honey bee and vertebrates.

    abstract:BACKGROUND:Large-scale comparison of metazoan genomes has revealed that a significant fraction of genes of the last common ancestor of Bilateria (Urbilateria) is lost in each animal lineage. This event could be one of the underlying mechanisms involved in generating metazoan diversity. However, the present functions of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-17

    authors: Matsui T,Yamamoto T,Wyder S,Zdobnov EM,Kadowaki T

    更新日期:2009-01-12 00:00:00

  • Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection.

    abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-173

    authors: Cho WK,Yu J,Lee KM,Son M,Min K,Lee YW,Kim KH

    更新日期:2012-05-06 00:00:00

  • A consensus linkage map of the grass carp (Ctenopharyngodon idella) based on microsatellites and SNPs.

    abstract:BACKGROUND:Grass carp (Ctenopharyngodon idella) belongs to the family Cyprinidae which includes more than 2000 fish species. It is one of the most important freshwater food fish species in world aquaculture. A linkage map is an essential framework for mapping traits of interest and is often the first step towards under...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-135

    authors: Xia JH,Liu F,Zhu ZY,Fu J,Feng J,Li J,Yue GH

    更新日期:2010-02-24 00:00:00

  • Identification of novel and differentially expressed MicroRNAs in goat enzootic nasal adenocarcinoma.

    abstract:BACKGROUND:MicroRNAs (miRNAs) post-transcriptionally regulate a variety of genes involved in eukaryotic cell growth, development, metabolism and other biological processes, and numerous miRNAs are implicated in the initiation and progression of cancer. Enzootic nasal adenocarcinoma (ENA), an epithelial tumor induced in...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3238-5

    authors: Wang B,Ye N,Cao SJ,Wen XT,Huang Y,Yan QG

    更新日期:2016-11-08 00:00:00

  • CEG: a database of essential gene clusters.

    abstract:BACKGROUND:Essential genes are indispensable for the survival of living entities. They are the cornerstones of synthetic biology, and are potential candidate targets for antimicrobial and vaccine design. DESCRIPTION:Here we describe the Cluster of Essential Genes (CEG) database, which contains clusters of orthologous ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-769

    authors: Ye YN,Hua ZG,Huang J,Rao N,Guo FB

    更新日期:2013-11-09 00:00:00

  • Partial depletion of yolk during zebrafish embryogenesis changes the dynamics of methionine cycle and metabolic genes.

    abstract:BACKGROUND:Limited nutrient availability during development is associated with metabolic diseases in adulthood. The molecular cause for these defects is unclear. Here, we investigate if transcriptional changes caused by developmental malnutrition reveal an early response that can be linked to metabolism and metabolic d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1654-6

    authors: Huang Y,Linsen SE

    更新日期:2015-06-04 00:00:00

  • A framework for TRIM21-mediated protein depletion in early mouse embryos: recapitulation of Tead4 null phenotype over three days.

    abstract:BACKGROUND:While DNA and RNA methods are routine to disrupt the expression of specific genes, complete understanding of developmental processes requires also protein methods, because: oocytes and early embryos accumulate proteins and these are not directly affected by DNA and RNA methods. When proteins in the oocyte en...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6106-2

    authors: Israel S,Casser E,Drexler HCA,Fuellen G,Boiani M

    更新日期:2019-10-21 00:00:00

  • Comparative genomic analysis revealed great plasticity and environmental adaptation of the genomes of Enterococcus faecium.

    abstract:BACKGROUND:As an important nosocomial pathogen, Enterococcus faecium has received increasing attention in recent years. However, a large number of studies have focused on the hospital-associated isolates and ignored isolates originated from the natural environments. RESULTS:In this study, comparative genomic analysis ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5975-8

    authors: Zhong Z,Kwok LY,Hou Q,Sun Y,Li W,Zhang H,Sun Z

    更新日期:2019-07-22 00:00:00

  • Comparative genomics of 84 Pectobacterium genomes reveals the variations related to a pathogenic lifestyle.

    abstract:BACKGROUND:Pectobacterium spp. are necrotrophic bacterial plant pathogens of the family Pectobacteriaceae, responsible for a wide spectrum of diseases of important crops and ornamental plants including soft rot, blackleg, and stem wilt. P. carotovorum is a genetically heterogeneous species consisting of three valid sub...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5269-6

    authors: Li X,Ma Y,Liang S,Tian Y,Yin S,Xie S,Xie H

    更新日期:2018-12-07 00:00:00

  • QTLs associated with dry matter intake, metabolic mid-test weight, growth and feed efficiency have little overlap across 4 beef cattle studies.

    abstract:BACKGROUND:The identification of genetic markers associated with complex traits that are expensive to record such as feed intake or feed efficiency would allow these traits to be included in selection programs. To identify large-effect QTL, we performed a series of genome-wide association studies and functional analyse...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1004

    authors: Saatchi M,Beever JE,Decker JE,Faulkner DB,Freetly HC,Hansen SL,Yampara-Iquise H,Johnson KA,Kachman SD,Kerley MS,Kim J,Loy DD,Marques E,Neibergs HL,Pollak EJ,Schnabel RD,Seabury CM,Shike DW,Snelling WM,Spangler ML,

    更新日期:2014-11-20 00:00:00

  • In silico and in situ characterization of the zebrafish (Danio rerio) gnrh3 (sGnRH) gene.

    abstract:BACKGROUND:Gonadotropin releasing hormone (GnRH) is responsible for stimulation of gonadotropic hormone (GtH) in the hypothalamus-pituitary-gonadal axis (HPG). The regulatory mechanisms responsible for brain specificity make the promoter attractive for in silico analysis and reporter gene studies in zebrafish (Danio re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-3-25

    authors: Torgersen J,Nourizadeh-Lillabadi R,Husebye H,Aleström P

    更新日期:2002-08-21 00:00:00

  • Repeats and EST analysis for new organisms.

    abstract:BACKGROUND:Repeat masking is an important step in the EST analysis pipeline. For new species, genomic knowledge is scarce and good repeat libraries are typically unavailable. In these cases it is common practice to mask against known repeats from other species (i.e., model organisms). There are few studies that investi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-23

    authors: Malde K,Jonassen I

    更新日期:2008-01-18 00:00:00

  • Global gene expression profiling of brown to white adipose tissue transformation in sheep reveals novel transcriptional components linked to adipose remodeling.

    abstract:BACKGROUND:Large mammals are capable of thermoregulation shortly after birth due to the presence of brown adipose tissue (BAT). The majority of BAT disappears after birth and is replaced by white adipose tissue (WAT). RESULTS:We analyzed the postnatal transformation of adipose in sheep with a time course study of the ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1405-8

    authors: Basse AL,Dixen K,Yadav R,Tygesen MP,Qvortrup K,Kristiansen K,Quistorff B,Gupta R,Wang J,Hansen JB

    更新日期:2015-03-19 00:00:00

  • Large-scale analysis of post-translational modifications in E. coli under glucose-limiting conditions.

    abstract:BACKGROUND:Post-translational modification (PTM) of proteins is central to many cellular processes across all domains of life, but despite decades of study and a wealth of genomic and proteomic data the biological function of many PTMs remains unknown. This is especially true for prokaryotic PTM systems, many of which ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3676-8

    authors: Brown CW,Sridhara V,Boutz DR,Person MD,Marcotte EM,Barrick JE,Wilke CO

    更新日期:2017-04-17 00:00:00

  • How to evaluate performance of prediction methods? Measures and their interpretation in variation effect analysis.

    abstract:BACKGROUND:Prediction methods are increasingly used in biosciences to forecast diverse features and characteristics. Binary two-state classifiers are the most common applications. They are usually based on machine learning approaches. For the end user it is often problematic to evaluate the true performance and applica...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S4-S2

    authors: Vihinen M

    更新日期:2012-06-18 00:00:00

  • Transversions have larger regulatory effects than transitions.

    abstract:BACKGROUND:Transversions (Tv's) are more likely to alter the amino acid sequence of proteins than transitions (Ts's), and local deviations in the Ts:Tv ratio are indicative of evolutionary selection on genes. Whether the two different types of mutations have different effects in non-protein-coding sequences remains unk...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3785-4

    authors: Guo C,McDowell IC,Nodzenski M,Scholtens DM,Allen AS,Lowe WL,Reddy TE

    更新日期:2017-05-19 00:00:00

  • Comparative genomics of downy mildews reveals potential adaptations to biotrophy.

    abstract:BACKGROUND:Spinach downy mildew caused by the oomycete Peronospora effusa is a significant burden on the expanding spinach production industry, especially for organic farms where synthetic fungicides cannot be deployed to control the pathogen. P. effusa is highly variable and 15 new races have been recognized in the pa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5214-8

    authors: Fletcher K,Klosterman SJ,Derevnina L,Martin F,Bertier LD,Koike S,Reyes-Chin-Wo S,Mou B,Michelmore R

    更新日期:2018-11-29 00:00:00

  • Culture-independent genomic characterisation of Candidatus Chlamydia sanzinia, a novel uncultivated bacterium infecting snakes.

    abstract:BACKGROUND:Recent molecular studies have revealed considerably more diversity in the phylum Chlamydiae than was previously thought. Evidence is growing that many of these novel chlamydiae may be important pathogens in humans and animals. A significant barrier to characterising these novel chlamydiae is the requirement ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3055-x

    authors: Taylor-Brown A,Bachmann NL,Borel N,Polkinghorne A

    更新日期:2016-09-05 00:00:00

  • Comprehensive analysis of CCCH-type zinc finger family genes facilitates functional gene discovery and reflects recent allopolyploidization event in tetraploid switchgrass.

    abstract:BACKGROUND:In recent years, dozens of Arabidopsis and rice CCCH-type zinc finger genes have been functionally studied, many of which confer important traits, such as abiotic and biotic stress tolerance, delayed leaf senescence and improved plant architecture. Switchgrass (Panicum virgatum) is an important bioenergy cro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1328-4

    authors: Yuan S,Xu B,Zhang J,Xie Z,Cheng Q,Yang Z,Cai Q,Huang B

    更新日期:2015-02-25 00:00:00

  • Genomic differences between cultivated soybean, G. max and its wild relative G. soja.

    abstract:BACKGROUND:Glycine max is an economically important crop and many different varieties of soybean exist around the world. The first draft sequences and gene models of G. max (domesticated soybean) as well as G. soja (wild soybean), both became available in 2010. This opened the door for comprehensive comparative genomic...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-S1-S5

    authors: Joshi T,Valliyodan B,Wu JH,Lee SH,Xu D,Nguyen HT

    更新日期:2013-01-01 00:00:00

  • Transcriptome profile analysis of flowering molecular processes of early flowering trifoliate orange mutant and the wild-type [Poncirus trifoliata (L.) Raf.] by massively parallel signature sequencing.

    abstract:BACKGROUND:After several years in the juvenile phase, trees undergo flowering transition to become mature (florally competent) trees. This transition depends on the balanced expression of a complex network of genes that is regulated by both endogenous and environmental factors. However, relatively little is known about...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-63

    authors: Zhang JZ,Ai XY,Sun LM,Zhang DL,Guo WW,Deng XX,Hu CG

    更新日期:2011-01-26 00:00:00

  • The sulfur/sulfonates transport systems in Xanthomonas citri pv. citri.

    abstract:BACKGROUND:The Xanthomonas citri pv. citri (X. citri) is a phytopathogenic bacterium that infects different species of citrus plants where it causes canker disease. The adaptation to different habitats is related to the ability of the cells to metabolize and to assimilate diverse compounds, including sulfur, an essenti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1736-5

    authors: Pereira CT,Moutran A,Fessel M,Balan A

    更新日期:2015-07-14 00:00:00

  • Complex sense-antisense architecture of TNFAIP1/POLDIP2 on 17q11.2 represents a novel transcriptional structural-functional gene module involved in breast cancer progression.

    abstract:BACKGROUND:A sense-antisense gene pair (SAGP) is a gene pair where two oppositely transcribed genes share a common nucleotide sequence region. In eukaryotic genomes, SAGPs can be organized in complex sense-antisense architectures (CSAGAs) in which at least one sense gene shares loci with two or more antisense partners....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S1-S9

    authors: Grinchuk OV,Motakis E,Kuznetsov VA

    更新日期:2010-02-10 00:00:00

  • Generation of a de novo transcriptome from equine lamellar tissue.

    abstract:BACKGROUND:Laminitis, the structural failure of interdigitated tissue that suspends the distal skeleton within the hoof capsule, is a devastating disease that is the second leading cause of both lameness and euthanasia in the horse. Current transcriptomic research focuses on the expression of known genes. However, as t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1948-8

    authors: Holl HM,Gao S,Fei Z,Andrews C,Brooks SA

    更新日期:2015-10-03 00:00:00

  • Methods for high-throughput MethylCap-Seq data analysis.

    abstract:BACKGROUND:Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S6-S14

    authors: Rodriguez BA,Frankhouser D,Murphy M,Trimarchi M,Tam HH,Curfman J,Huang R,Chan MW,Lai HC,Parikh D,Ball B,Schwind S,Blum W,Marcucci G,Yan P,Bundschuh R

    更新日期:2012-01-01 00:00:00

  • De novo sequencing of tree peony (Paeonia suffruticosa) transcriptome to identify critical genes involved in flowering and floral organ development.

    abstract:BACKGROUND:Tree peony (Paeonia suffruticosa Andrews) is a globally famous ornamental flower, with large and colorful flowers and abundant flower types. However, a relatively short and uniform flowering period hinders the applications and production of ornamental tree peony. Unfortunately, the molecular mechanism of reg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5857-0

    authors: Wang S,Gao J,Xue J,Xue Y,Li D,Guan Y,Zhang X

    更新日期:2019-07-11 00:00:00

  • Temporal transcriptome changes induced by MDV in Marek's disease-resistant and -susceptible inbred chickens.

    abstract:BACKGROUND:Marek's disease (MD) is a lymphoproliferative disease in chickens caused by Marek's disease virus (MDV) and characterized by T cell lymphoma and infiltration of lymphoid cells into various organs such as liver, spleen, peripheral nerves and muscle. Resistance to MD and disease risk have long been thought to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-501

    authors: Yu Y,Luo J,Mitra A,Chang S,Tian F,Zhang H,Yuan P,Zhou H,Song J

    更新日期:2011-10-12 00:00:00

  • CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.

    abstract:BACKGROUND:The problem of supervised DNA sequence classification arises in several fields of computational molecular biology. Although this problem has been extensively studied, it is still computationally challenging due to size of the datasets that modern sequencing technologies can produce. RESULTS:We introduce CLA...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1419-2

    authors: Ounit R,Wanamaker S,Close TJ,Lonardi S

    更新日期:2015-03-25 00:00:00

  • Expression profile of genes regulated by activity of the Na-H exchanger NHE1.

    abstract:BACKGROUND:In mammalian cells changes in intracellular pH (pHi), which are predominantly controlled by activity of plasma membrane ion exchangers, regulate a diverse range of normal and pathological cellular processes. How changes in pHi affect distinct cellular processes has primarily been determined by evaluating pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-5-46

    authors: Putney LK,Barber DL

    更新日期:2004-07-16 00:00:00

  • Time course of the response to ACTH in pig: biological and transcriptomic study.

    abstract:BACKGROUND:HPA axis plays a major role in physiological homeostasis. It is also involved in stress and adaptive response to the environment. In farm animals in general and specifically in pigs, breeding strategies have highly favored production traits such as lean growth rate, feed efficiency and prolificacy at the cos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2118-8

    authors: Sautron V,Terenina E,Gress L,Lippi Y,Billon Y,Larzul C,Liaubet L,Villa-Vialaneix N,Mormède P

    更新日期:2015-11-17 00:00:00