Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs).

Abstract:

BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outcome. Because evolutionary processes tend to act on gene regulation, we test whether regulatory variants are under positive selection. We introduce a new approach to enhance detection of genetic markers undergoing positive selection, using conditional entropy to capture recent local selection signals. RESULTS:We use conditional logistic regression to compare our Adjusted Haplotype Conditional Entropy (H|H) measure of positive selection to existing positive selection measures. H|H and existing measures were applied to published regulatory variants acting in cis (cis-eQTLs), with conditional logistic regression testing whether regulatory variants undergo stronger positive selection than the surrounding gene. These cis-eQTLs were drawn from six independent studies of genotype and RNA expression. The conditional logistic regression shows that, overall, H|H is substantially more powerful than existing positive-selection methods in identifying cis-eQTLs against other Single Nucleotide Polymorphisms (SNPs) in the same genes. When broken down by Gene Ontology, H|H predictions are particularly strong in some biological process categories, where regulatory variants are under strong positive selection compared to the bulk of the gene, distinct from those GO categories under overall positive selection. . However, cis-eQTLs in a second group of genes lack positive selection signatures detectable by H|H, consistent with ancient short haplotypes compared to the surrounding gene (for example, in innate immunity GO:0042742); under such other modes of selection, H|H would not be expected to be a strong predictor.. These conditional logistic regression models are adjusted for Minor allele frequency(MAF); otherwise, ascertainment bias is a huge factor in all eQTL data sets. Relationships between Gene Ontology categories, positive selection and eQTL specificity were replicated with H|H in a single larger data set. Our measure, Adjusted Haplotype Conditional Entropy (H|H), was essential in generating all of the results above because it: 1) is a stronger overall predictor for eQTLs than comparable existing approaches, and 2) shows low sequential auto-correlation, overcoming problems with convergence of these conditional regression statistical models. CONCLUSIONS:Our new method, H|H, provides a consistently more robust signal associated with cis-eQTLs compared to existing methods. We interpret this to indicate that some cis-eQTLs are under positive selection compared to their surrounding genes. Conditional entropy indicative of a selective sweep is an especially strong predictor of eQTLs for genes in several biological processes of medical interest. Where conditional entropy is a weak or negative predictor of eQTLs, such as innate immune genes, this would be consistent with balancing selection acting on such eQTLs over long time periods. Different measures of selection may be needed for variant prioritization under other modes of evolutionary selection.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Handelman SK,Seweryn M,Smith RM,Hartmann K,Wang D,Pietrzak M,Johnson AD,Kloczkowski A,Sadee W

doi

10.1186/1471-2164-16-S8-S8

subject

Has Abstract

pub_date

2015-01-01 00:00:00

pages

S8

issn

1471-2164

pii

1471-2164-16-S8-S8

journal_volume

16 Suppl 8

pub_type

杂志文章
  • Comparing de novo assemblers for 454 transcriptome data.

    abstract:BACKGROUND:Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-571

    authors: Kumar S,Blaxter ML

    更新日期:2010-10-16 00:00:00

  • Changes in Bacillus anthracis CodY regulation under host-specific environmental factor deprived conditions.

    abstract:BACKGROUND:Host-specific environmental factors induce changes in Bacillus anthracis gene transcription during infection. A global transcription regulator, CodY, plays a pivotal role in regulating central metabolism, biosynthesis, and virulence in B. anthracis. In this study, we utilized RNA-sequencing to assess changes...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3004-8

    authors: Kim SK,Jung KH,Chai YG

    更新日期:2016-08-17 00:00:00

  • Cell periphery-related proteins as major genomic targets behind the adaptive evolution of an industrial Saccharomyces cerevisiae strain to combined heat and hydrolysate stress.

    abstract:BACKGROUND:Laboratory evolution is an important tool for developing robust yeast strains for bioethanol production since the biological basis behind combined tolerance requires complex alterations whose proper regulation is difficult to achieve by rational metabolic engineering. Previously, we reported on the evolved i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1737-4

    authors: Wallace-Salinas V,Brink DP,Ahrén D,Gorwa-Grauslund MF

    更新日期:2015-07-09 00:00:00

  • A multispecies comparison of the metazoan 3'-processing downstream elements and the CstF-64 RNA recognition motif.

    abstract:BACKGROUND:The Cleavage Stimulation Factor (CstF) is a required protein complex for eukaryotic mRNA 3'-processing. CstF interacts with 3'-processing downstream elements (DSEs) through its 64-kDa subunit, CstF-64; however, the exact nature of this interaction has remained unclear. We used EST-to-genome alignments to ide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-55

    authors: Salisbury J,Hutchison KW,Graber JH

    更新日期:2006-03-16 00:00:00

  • Genome-wide mapping of Hif-1α binding sites in zebrafish.

    abstract:BACKGROUND:Hypoxia Inducible Factor (HIF) regulates a cascade of transcriptional events in response to decreased oxygenation, acting from the cellular to the physiological level. This response is evolutionarily conserved, allowing the use of zebrafish (Danio rerio) as a model for studying the hypoxic response. Activati...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2169-x

    authors: Greenald D,Jeyakani J,Pelster B,Sealy I,Mathavan S,van Eeden FJ

    更新日期:2015-11-11 00:00:00

  • Gene expression analyses in Atlantic salmon challenged with infectious salmon anemia virus reveal differences between individuals with early, intermediate and late mortality.

    abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-179

    authors: Jørgensen SM,Afanasyev S,Krasnov A

    更新日期:2008-04-18 00:00:00

  • Gene silencing pathways found in the green alga Volvox carteri reveal insights into evolution and origins of small RNA systems in plants.

    abstract:BACKGROUND:Volvox carteri (V. carteri) is a multicellular green alga used as model system for the evolution of multicellularity. So far, the contribution of small RNA pathways to these phenomena is not understood. Thus, we have sequenced V. carteri Argonaute 3 (VcAGO3)-associated small RNAs from different developmental...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3202-4

    authors: Dueck A,Evers M,Henz SR,Unger K,Eichner N,Merkl R,Berezikov E,Engelmann JC,Weigel D,Wenzl S,Meister G

    更新日期:2016-11-02 00:00:00

  • A transcript profiling approach reveals the zinc finger transcription factor ZNF191 is a pleiotropic factor.

    abstract:BACKGROUND:The human zinc finger protein 191 (ZNF191) is a member of the SCAN domain family of Krüppel-like zinc finger transcription factors. ZNF191 shows 94% identity to its mouse homologue zinc finger protein 191(Zfp191), which is the most highly conserved among the human-mouse SCAN family member orthologues pairs. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-241

    authors: Li J,Chen X,Gong X,Liu Y,Feng H,Qiu L,Hu Z,Zhang J

    更新日期:2009-05-22 00:00:00

  • Development of the first marmoset-specific DNA microarray (EUMAMA): a new genetic tool for large-scale expression profiling in a non-human primate.

    abstract:BACKGROUND:The common marmoset monkey (Callithrix jacchus), a small non-endangered New World primate native to eastern Brazil, is becoming increasingly used as a non-human primate model in biomedical research, drug development and safety assessment. In contrast to the growing interest for the marmoset as an animal mode...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-190

    authors: Datson NA,Morsink MC,Atanasova S,Armstrong VW,Zischler H,Schlumbohm C,Dutilh BE,Huynen MA,Waegele B,Ruepp A,de Kloet ER,Fuchs E

    更新日期:2007-06-25 00:00:00

  • The metabolome as a link in the genotype-phenotype map for peroxide resistance in the fruit fly, Drosophila melanogaster.

    abstract:BACKGROUND:Genetic association studies that seek to explain the inheritance of complex traits typically fail to explain a majority of the heritability of the trait under study. Thus, we are left with a gap in the map from genotype to phenotype. Several approaches have been used to fill this gap, including those that at...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6739-1

    authors: Harrison BR,Wang L,Gajda E,Hoffman EV,Chung BY,Pletcher SD,Raftery D,Promislow DEL

    更新日期:2020-05-04 00:00:00

  • Revealing common disease mechanisms shared by tumors of different tissues of origin through semantic representation of genomic alterations and topic modeling.

    abstract:BACKGROUND:Cancer is a complex disease driven by somatic genomic alterations (SGAs) that perturb signaling pathways and consequently cellular function. Identifying patterns of pathway perturbations would provide insights into common disease mechanisms shared among tumors, which is important for guiding treatment and pr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3494-z

    authors: Chen V,Paisley J,Lu X

    更新日期:2017-03-14 00:00:00

  • scReQTL: an approach to correlate SNVs to gene expression from individual scRNA-seq datasets.

    abstract:BACKGROUND:Recently, pioneering expression quantitative trait loci (eQTL) studies on single cell RNA sequencing (scRNA-seq) data have revealed new and cell-specific regulatory single nucleotide variants (SNVs). Here, we present an alternative QTL-related approach applicable to transcribed SNV loci from scRNA-seq data: ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07334-y

    authors: Liu H,Prashant NM,Spurr LF,Bousounis P,Alomran N,Ibeawuchi H,Sein J,Słowiński P,Tsaneva-Atanasova K,Horvath A

    更新日期:2021-01-08 00:00:00

  • Simultaneous gene expression profiling in human macrophages infected with Leishmania major parasites using SAGE.

    abstract:BACKGROUND:Leishmania (L) are intracellular protozoan parasites that are able to survive and replicate within the harsh and potentially hostile phagolysosomal environment of mammalian mononuclear phagocytes. A complex interplay then takes place between the macrophage (MPhi) striving to eliminate the pathogen and the pa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-238

    authors: Guerfali FZ,Laouini D,Guizani-Tabbane L,Ottones F,Ben-Aissa K,Benkahla A,Manchon L,Piquemal D,Smandi S,Mghirbi O,Commes T,Marti J,Dellagi K

    更新日期:2008-05-21 00:00:00

  • Anomaly detection in gene expression via stochastic models of gene regulatory networks.

    abstract:BACKGROUND:The steady-state behaviour of gene regulatory networks (GRNs) can provide crucial evidence for detecting disease-causing genes. However, monitoring the dynamics of GRNs is particularly difficult because biological data only reflects a snapshot of the dynamical behaviour of the living organism. Also most GRN ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S3-S26

    authors: Kim H,Gelenbe E

    更新日期:2009-12-03 00:00:00

  • De novo transcriptome sequencing in a songbird, the dark-eyed junco (Junco hyemalis): genomic tools for an ecological model system.

    abstract:BACKGROUND:Though genomic-level data are becoming widely available, many of the metazoan species sequenced are laboratory systems whose natural history is not well documented. In contrast, the wide array of species with very well-characterized natural history have, until recently, lacked genomics tools. It is now possi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-305

    authors: Peterson MP,Whittaker DJ,Ambreth S,Sureshchandra S,Buechlein A,Podicheti R,Choi JH,Lai Z,Mockatis K,Colbourne J,Tang H,Ketterson ED

    更新日期:2012-07-09 00:00:00

  • De novo assembly of middle-sized genome using MinION and Illumina sequencers.

    abstract:BACKGROUND:The plastid acquisition by secondary endosymbiosis is a driving force for the algal evolution, and the comparative genomics was required to examine the genomic change of symbiont. Therefore, we established a pipeline of a de novo assembly of middle-sized genomes at a low cost and with high quality using long...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5067-1

    authors: Minei R,Hoshina R,Ogura A

    更新日期:2018-09-24 00:00:00

  • An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.

    abstract:BACKGROUND:Europeans and American Indians were major genetic ancestry of Hispanics in the U.S. These ancestral groups have markedly different incidence rates and outcomes in many types of cancers. Therefore, the genetic admixture may cause biased genetic association study with cancer susceptibility variants specificall...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6333-6

    authors: Wang LJ,Zhang CW,Su SC,Chen HH,Chiu YC,Lai Z,Bouamar H,Ramirez AG,Cigarroa FG,Sun LZ,Chen Y

    更新日期:2019-12-30 00:00:00

  • Accumulation of interspersed and sex-specific repeats in the non-recombining region of papaya sex chromosomes.

    abstract:BACKGROUND:The papaya Y chromosome has undergone a degenerative expansion from its ancestral autosome, as a consequence of recombination suppression in the sex determining region of the sex chromosomes. The non-recombining feature led to the accumulation of repetitive sequences in the male- or hermaphrodite-specific re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-335

    authors: Na JK,Wang J,Ming R

    更新日期:2014-05-04 00:00:00

  • Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling.

    abstract:BACKGROUND:Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamica...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-520

    authors: Li S,Dong X,Su Z

    更新日期:2013-07-30 00:00:00

  • Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach.

    abstract:BACKGROUND:There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular mar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-217

    authors: Chambers AH,Pillet J,Plotto A,Bai J,Whitaker VM,Folta KM

    更新日期:2014-04-17 00:00:00

  • Impact of analytic provenance in genome analysis.

    abstract:BACKGROUND:Many computational methods are available for assembly and annotation of newly sequenced microbial genomes. However, when new genomes are reported in the literature, there is frequently very little critical analysis of choices made during the sequence assembly and gene annotation stages. These choices have a ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S8-S1

    authors: Morrison SS,Pyzh R,Jeon MS,Amaro C,Roig FJ,Baker-Austin C,Oliver JD,Gibas CJ

    更新日期:2014-01-01 00:00:00

  • Omics profiles used to evaluate the gene expression of Exiguobacterium antarcticum B7 during cold adaptation.

    abstract:BACKGROUND:Exiguobacterium antarcticum strain B7 is a Gram-positive psychrotrophic bacterial species isolated in Antarctica. Although this bacteria has been poorly studied, its genome has already been sequenced. Therefore, it is an appropriate model for the study of thermal adaptation. In the present study, we analyzed...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-986

    authors: Dall'Agnol HP,Baraúna RA,de Sá PH,Ramos RT,Nóbrega F,Nunes CI,das Graças DA,Carneiro AR,Santos DM,Pimenta AM,Carepo MS,Azevedo V,Pellizari VH,Schneider MP,Silva A

    更新日期:2014-11-18 00:00:00

  • A comprehensive analysis of Helicobacter pylori plasticity zones reveals that they are integrating conjugative elements with intermediate integration specificity.

    abstract:BACKGROUND:The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to chan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-310

    authors: Fischer W,Breithaupt U,Kern B,Smith SI,Spicher C,Haas R

    更新日期:2014-04-27 00:00:00

  • Widespread promoter methylation of synaptic plasticity genes in long-term potentiation in the adult brain in vivo.

    abstract:BACKGROUND:DNA methylation is a key modulator of gene expression in mammalian development and cellular differentiation, including neurons. To date, the role of DNA modifications in long-term potentiation (LTP) has not been explored. RESULTS:To investigate the occurrence of DNA methylation changes in LTP, we undertook ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3621-x

    authors: Maag JL,Kaczorowski DC,Panja D,Peters TJ,Bramham CR,Wibrand K,Dinger ME

    更新日期:2017-03-23 00:00:00

  • The Zygosaccharomyces bailii transcription factor Haa1 is required for acetic acid and copper stress responses suggesting subfunctionalization of the ancestral bifunctional protein Haa1/Cup2.

    abstract:BACKGROUND:The food spoilage yeast species Zygosaccharomyces bailii exhibits an extraordinary capacity to tolerate weak acids, in particular acetic acid. In Saccharomyces cerevisiae, the transcription factor Haa1 (ScHaa1) is considered the main player in genomic expression reprogramming in response to acetic acid stres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3443-2

    authors: Palma M,Dias PJ,Roque FC,Luzia L,Guerreiro JF,Sá-Correia I

    更新日期:2017-01-13 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • Comparison of gene expression microarray data with count-based RNA measurements informs microarray interpretation.

    abstract:BACKGROUND:Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most mic...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-649

    authors: Richard AC,Lyons PA,Peters JE,Biasci D,Flint SM,Lee JC,McKinney EF,Siegel RM,Smith KG

    更新日期:2014-08-04 00:00:00

  • An improved approach for the segmentation of starch granules in microscopic images.

    abstract:BACKGROUND:Starches are the main storage polysaccharides in plants and are distributed widely throughout plants including seeds, roots, tubers, leaves, stems and so on. Currently, microscopic observation is one of the most important ways to investigate and analyze the structure of starches. The position, shape, and siz...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S2-S13

    authors: Guo S,Tang J,Deng Y,Xia Q

    更新日期:2010-11-02 00:00:00

  • Bioinformatics and DNA-extraction strategies to reliably detect genetic variants from FFPE breast tissue samples.

    abstract:BACKGROUND:Archived formalin fixed paraffin embedded (FFPE) samples are valuable clinical resources to examine clinically relevant morphology features and also to study genetic changes. However, DNA quality and quantity of FFPE samples are often sub-optimal, and resulting NGS-based genetics variant detections are prone...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6056-8

    authors: Bhagwate AV,Liu Y,Winham SJ,McDonough SJ,Stallings-Mann ML,Heinzen EP,Davila JI,Vierkant RA,Hoskin TL,Frost M,Carter JM,Radisky DC,Cunningham JM,Degnim AC,Wang C

    更新日期:2019-09-02 00:00:00

  • Gene expression patterns that predict sensitivity to epidermal growth factor receptor tyrosine kinase inhibitors in lung cancer cell lines and human lung tumors.

    abstract:BACKGROUND:Increased focus surrounds identifying patients with advanced non-small cell lung cancer (NSCLC) who will benefit from treatment with epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKI). EGFR mutation, gene copy number, coexpression of ErbB proteins and ligands, and epithelial to mesenchy...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-289

    authors: Balko JM,Potti A,Saunders C,Stromberg A,Haura EB,Black EP

    更新日期:2006-11-10 00:00:00