Transversions have larger regulatory effects than transitions.

Abstract:

BACKGROUND:Transversions (Tv's) are more likely to alter the amino acid sequence of proteins than transitions (Ts's), and local deviations in the Ts:Tv ratio are indicative of evolutionary selection on genes. Whether the two different types of mutations have different effects in non-protein-coding sequences remains unknown. Genetic variants primarily impact gene expression by disrupting the binding of transcription factors (TFs) and other DNA-binding proteins. Because Tv's cause larger changes in the shape of a DNA backbone, we hypothesized that Tv's would have larger impacts on TF binding and gene expression. RESULTS:Here, we provide multiple lines of evidence demonstrating that Tv's have larger impacts on regulatory DNA including analyses of TF binding motifs and allele-specific TF binding. In these analyses, we observed a depletion of Tv's within TF binding motifs and TF binding sites. Using massively parallel population-scale reporter assays, we also provided empirical evidence that Tv's have larger effects than Ts's on the activity of human gene regulatory elements. CONCLUSIONS:Tv's are more likely to disrupt TF binding, resulting in larger changes in gene expression. Although the observed differences are small, these findings represent a novel, fundamental property of regulatory variation. Understanding the features of functional non-coding variation could be valuable for revealing the genetic underpinnings of complex traits and diseases in future studies.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Guo C,McDowell IC,Nodzenski M,Scholtens DM,Allen AS,Lowe WL,Reddy TE

doi

10.1186/s12864-017-3785-4

subject

Has Abstract

pub_date

2017-05-19 00:00:00

pages

394

issue

1

issn

1471-2164

pii

10.1186/s12864-017-3785-4

journal_volume

18

pub_type

杂志文章
  • De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray.

    abstract:BACKGROUND:Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a comple...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-952

    authors: Benzekri H,Armesto P,Cousin X,Rovira M,Crespo D,Merlo MA,Mazurais D,Bautista R,Guerrero-Fernández D,Fernandez-Pozo N,Ponce M,Infante C,Zambonino JL,Nidelet S,Gut M,Rebordinos L,Planas JV,Bégout ML,Claros MG,Manchado

    更新日期:2014-11-03 00:00:00

  • Comparative analyses of genotype dependent expressed sequence tags and stress-responsive transcriptome of chickpea wilt illustrate predicted and unexpected genes and novel regulators of plant immunity.

    abstract:BACKGROUND:The ultimate phenome of any organism is modulated by regulated transcription of many genes. Characterization of genetic makeup is thus crucial for understanding the molecular basis of phenotypic diversity, evolution and response to intra- and extra-cellular stimuli. Chickpea is the world's third most importa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-415

    authors: Ashraf N,Ghai D,Barman P,Basu S,Gangisetty N,Mandal MK,Chakraborty N,Datta A,Chakraborty S

    更新日期:2009-09-05 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • A hybrid expectation maximisation and MCMC sampling algorithm to implement Bayesian mixture model based genomic prediction and QTL mapping.

    abstract:BACKGROUND:Bayesian mixture models in which the effects of SNP are assumed to come from normal distributions with different variances are attractive for simultaneous genomic prediction and QTL mapping. These models are usually implemented with Monte Carlo Markov Chain (MCMC) sampling, which requires long compute times ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3082-7

    authors: Wang T,Chen YP,Bowman PJ,Goddard ME,Hayes BJ

    更新日期:2016-09-21 00:00:00

  • MicroRNA modulate alveolar epithelial response to cyclic stretch.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of gene expression implicated in multiple cellular processes. Cyclic stretch of alveoli is characteristic of mechanical ventilation, and is postulated to be partly responsible for the lung injury and inflammation in ventilator-induced lung injury. We pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-154

    authors: Yehya N,Yerrapureddy A,Tobias J,Margulies SS

    更新日期:2012-04-26 00:00:00

  • Accumulation of CTCF-binding sites drives expression divergence between tandemly duplicated genes in humans.

    abstract:BACKGROUND:During eukaryotic genome evolution, tandem gene duplication is the most frequent event giving rise to clustered gene families. However, how expression divergence between tandemly duplicated genes has emerged and maintained remain unclear. In particular, it is unknown if epigenetic regulators have been involv...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S1-S8

    authors: Liao BY,Chang A

    更新日期:2014-01-01 00:00:00

  • Endometrial gene expression profiling in pregnant Meishan and Yorkshire pigs on day 12 of gestation.

    abstract:BACKGROUND:Litter size in pigs is a major factor affecting the profitability in the pig industry. The peri-implantation window in pigs is characterized by the coordinated interactions between the maternal uterine endometrium and the rapidly elongating conceptuses and represents a period of time during which a large per...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-156

    authors: Gu T,Zhu MJ,Schroyen M,Qu L,Nettleton D,Kuhar D,Lunney JK,Ross JW,Zhao SH,Tuggle CK

    更新日期:2014-02-24 00:00:00

  • Transcriptional response of Mexican axolotls to Ambystoma tigrinum virus (ATV) infection.

    abstract:BACKGROUND:Very little is known about the immunological responses of amphibians to pathogens that are causing global population declines. We used a custom microarray gene chip to characterize gene expression responses of axolotls (Ambystoma mexicanum) to an emerging viral pathogen, Ambystoma tigrinum virus (ATV). RESU...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-493

    authors: Cotter JD,Storfer A,Page RB,Beachy CK,Voss SR

    更新日期:2008-10-20 00:00:00

  • Expression of immune-response genes in lepidopteran host is suppressed by venom from an endoparasitoid, Pteromalus puparum.

    abstract:BACKGROUND:The relationships between parasitoids and their insect hosts have attracted attention at two levels. First, the basic biology of host-parasitoid interactions is of fundamental interest. Second, parasitoids are widely used as biological control agents in sustainable agricultural programs. Females of the grega...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-484

    authors: Fang Q,Wang L,Zhu J,Li Y,Song Q,Stanley DW,Akhtar ZR,Ye G

    更新日期:2010-09-02 00:00:00

  • Investigation of transmembrane proteins using a computational approach.

    abstract:BACKGROUND:An important subfamily of membrane proteins are the transmembrane alpha-helical proteins, in which the membrane-spanning regions are made up of alpha-helices. Given the obvious biological and medical significance of these proteins, it is of tremendous practical importance to identify the location of transmem...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S1-S7

    authors: Yang JY,Yang MQ,Dunker AK,Deng Y,Huang X

    更新日期:2008-01-01 00:00:00

  • Flux of transcript patterns during soybean seed development.

    abstract:BACKGROUND:To understand gene expression networks leading to functional properties of the soybean seed, we have undertaken a detailed examination of soybean seed development during the stages of major accumulation of oils, proteins, and starches, as well as the desiccating and mature stages, using microarrays consistin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-136

    authors: Jones SI,Gonzalez DO,Vodkin LO

    更新日期:2010-02-24 00:00:00

  • Systems genomics evaluation of the SH-SY5Y neuroblastoma cell line as a model for Parkinson's disease.

    abstract:BACKGROUND:The human neuroblastoma cell line, SH-SY5Y, is a commonly used cell line in studies related to neurotoxicity, oxidative stress, and neurodegenerative diseases. Although this cell line is often used as a cellular model for Parkinson's disease, the relevance of this cellular model in the context of Parkinson's...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1154

    authors: Krishna A,Biryukov M,Trefois C,Antony PM,Hussong R,Lin J,Heinäniemi M,Glusman G,Köglsberger S,Boyd O,van den Berg BH,Linke D,Huang D,Wang K,Hood L,Tholey A,Schneider R,Galas DJ,Balling R,May P

    更新日期:2014-12-20 00:00:00

  • Genome-wide survey of two-component signal transduction systems in the plant growth-promoting bacterium Azospirillum.

    abstract:BACKGROUND:Two-component systems (TCS) play critical roles in sensing and responding to environmental cues. Azospirillum is a plant growth-promoting rhizobacterium living in the rhizosphere of many important crops. Despite numerous studies about its plant beneficial properties, little is known about how the bacterium s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1962-x

    authors: Borland S,Oudart A,Prigent-Combaret C,Brochier-Armanet C,Wisniewski-Dyé F

    更新日期:2015-10-22 00:00:00

  • "Integrative genomic analysis of the bioprospection of regulators and accessory enzymes associated with cellulose degradation in a filamentous fungus (Trichoderma harzianum)".

    abstract:BACKGROUND:Unveiling fungal genome structure and function reveals the potential biotechnological use of fungi. Trichoderma harzianum is a powerful CAZyme-producing fungus. We studied the genomic regions in T. harzianum IOC3844 containing CAZyme genes, transcription factors and transporters. RESULTS:We used bioinformat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07158-w

    authors: Ferreira Filho JA,Horta MAC,Dos Santos CA,Almeida DA,Murad NF,Mendes JS,Sforça DA,Silva CBC,Crucello A,de Souza AP

    更新日期:2020-11-02 00:00:00

  • Functional elucidation of the non-coding RNAs of Kluyveromyces marxianus in the exponential growth phase.

    abstract:BACKGROUND:Non-coding RNAs (ncRNAs), which perform diverse regulatory roles, have been found in organisms from all superkingdoms of life. However, there have been limited numbers of studies on the functions of ncRNAs, especially in nonmodel organisms such as Kluyveromyces marxianus that is widely used in the field of i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2474-z

    authors: Cho YB,Lee EJ,Cho S,Kim TY,Park JH,Cho BK

    更新日期:2016-02-29 00:00:00

  • Unlocking the mystery of the hard-to-sequence phage genome: PaP1 methylome and bacterial immunity.

    abstract:BACKGROUND:Whole-genome sequencing is an important method to understand the genetic information, gene function, biological characteristics and survival mechanisms of organisms. Sequencing large genomes is very simple at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. Sho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-803

    authors: Lu S,Le S,Tan Y,Li M,Liu C,Zhang K,Huang J,Chen H,Rao X,Zhu J,Zou L,Ni Q,Li S,Wang J,Jin X,Hu Q,Yao X,Zhao X,Zhang L,Huang G,Hu F

    更新日期:2014-09-19 00:00:00

  • Transcriptome analysis revealed the drought-responsive genes in Tibetan hulless barley.

    abstract:BACKGROUND:Hulless barley, also called naked barley, is an important cereal crop worldwide, serving as a healthy food both for human consumption and animal feed. Nevertheless, it often suffered from drought stress during its growth and development, resulting in a drastic reduction in barley yields. Therefore, study on ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2685-3

    authors: Zeng X,Bai L,Wei Z,Yuan H,Wang Y,Xu Q,Tang Y,Nyima T

    更新日期:2016-05-20 00:00:00

  • Transcriptomic analysis of the lesser spotted catshark (Scyliorhinus canicula) pancreas, liver and brain reveals molecular level conservation of vertebrate pancreas function.

    abstract:BACKGROUND:Understanding the evolution of the vertebrate pancreas is key to understanding its functions. The chondrichthyes (cartilaginous fish such as sharks and rays) have often been suggested to possess the most ancient example of a distinct pancreas with both hormonal (endocrine) and digestive (exocrine) roles. The...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1074

    authors: Mulley JF,Hargreaves AD,Hegarty MJ,Heller RS,Swain MT

    更新日期:2014-12-06 00:00:00

  • MS2CNN: predicting MS/MS spectrum based on protein sequence using deep convolutional neural networks.

    abstract:BACKGROUND:Tandem mass spectrometry allows biologists to identify and quantify protein samples in the form of digested peptide sequences. When performing peptide identification, spectral library search is more sensitive than traditional database search but is limited to peptides that have been previously identified. An...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6297-6

    authors: Lin YM,Chen CT,Chang JM

    更新日期:2019-12-24 00:00:00

  • Differences in DNA curvature-related sequence periodicity between prokaryotic chromosomes and phages, and relationship to chromosomal prophage content.

    abstract:BACKGROUND:Periodic spacing of A-tracts (short runs of A or T) with the DNA helical period of ~10-11 bp is characteristic of intrinsically bent DNA. In eukaryotes, the DNA bending is related to chromatin structure and nucleosome positioning. However, the physiological role of strong sequence periodicity detected in man...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-188

    authors: Abel J,Mrázek J

    更新日期:2012-05-15 00:00:00

  • Patterned sequence in the transcriptome of vascular plants.

    abstract:BACKGROUND:Microsatellites (repeated subsequences based on motifs of one to six nucleotides) are widely used as codominant genetic markers because of their frequent polymorphism and relative selective neutrality. Minisatellites are repeats of motifs having seven or more nucleotides. The large number of EST sequences no...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-173

    authors: Crane CF

    更新日期:2007-06-15 00:00:00

  • High-density 80 K SNP array is a powerful tool for genotyping G. hirsutum accessions and genome analysis.

    abstract:BACKGROUND:High-throughput genotyping platforms play important roles in plant genomic studies. Cotton (Gossypium spp.) is the world's important natural textile fiber and oil crop. Upland cotton accounts for more than 90% of the world's cotton production, however, modern upland cotton cultivars have narrow genetic diver...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4062-2

    authors: Cai C,Zhu G,Zhang T,Guo W

    更新日期:2017-08-23 00:00:00

  • Gene2vec: distributed representation of genes based on co-expression.

    abstract:BACKGROUND:Existing functional description of genes are categorical, discrete, and mostly through manual process. In this work, we explore the idea of gene embedding, distributed representation of genes, in the spirit of word embedding. RESULTS:From a pure data-driven fashion, we trained a 200-dimension vector represe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5370-x

    authors: Du J,Jia P,Dai Y,Tao C,Zhao Z,Zhi D

    更新日期:2019-02-04 00:00:00

  • The role of retinoic acid in hepatic lipid homeostasis defined by genomic binding and transcriptome profiling.

    abstract:BACKGROUND:The eyes and skin are obvious retinoid target organs. Vitamin A deficiency causes night blindness and retinoids are widely used to treat acne and psoriasis. However, more than 90% of total body retinol is stored in liver stellate cells. In addition, hepatocytes produce the largest amount of retinol binding p...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-575

    authors: He Y,Gong L,Fang Y,Zhan Q,Liu HX,Lu Y,Guo GL,Lehman-McKeeman L,Fang J,Wan YJ

    更新日期:2013-08-28 00:00:00

  • Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution.

    abstract:BACKGROUND:New gene emergence is so far assumed to be mostly driven by duplication and divergence of existing genes. The possibility that entirely new genes could emerge out of the non-coding genomic background was long thought to be almost negligible. With the increasing availability of fully sequenced genomes across ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-117

    authors: Neme R,Tautz D

    更新日期:2013-02-21 00:00:00

  • Transcriptomic time-series analysis of early development in olive from germinated embryos to juvenile tree.

    abstract:BACKGROUND:Despite its relevance, almost no studies account for the genetic control in the early stages of tree development, i.e. from germination on. This study seeks to make a quite complete transcriptome for olive development and to elucidate the dynamic regulation of the transcriptomic response during the early-juv...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5232-6

    authors: Jiménez-Ruiz J,de la O Leyva-Pérez M,Vidoy-Mercado I,Barceló A,Luque F

    更新日期:2018-11-19 00:00:00

  • Nonlinear transcriptomic response to dietary fat intake in the small intestine of C57BL/6J mice.

    abstract:BACKGROUND:A high caloric diet, in conjunction with low levels of physical activity, promotes obesity. Many studies are available regarding the relation between dietary saturated fats and the etiology of obesity, but most focus on liver, muscle and white adipose tissue. Furthermore, the majority of transcriptomic studi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2424-9

    authors: Nyima T,Müller M,Hooiveld GJ,Morine MJ,Scotti M

    更新日期:2016-02-09 00:00:00

  • CAMBer: an approach to support comparative analysis of multiple bacterial strains.

    abstract:BACKGROUND:There is a large amount of inconsistency in gene structure annotations of bacterial strains. This inconsistency is a frustrating impedance to effective comparative genomic analysis of bacterial strains in promising applications such as gaining insights into bacterial drug resistance. RESULTS:Here, we propos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-S2-S6

    authors: Wozniak M,Wong L,Tiuryn J

    更新日期:2011-01-01 00:00:00

  • A systematic model of the LC-MS proteomics pipeline.

    abstract:MOTIVATION:Mass spectrometry is a complex technique used for large-scale protein profiling with clinical and pharmaceutical applications. While individual components in the system have been studied extensively, little work has been done to integrate various modules and evaluate them from a systems point of view. RESUL...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S6-S2

    authors: Sun Y,Braga-Neto U,Dougherty ER

    更新日期:2012-01-01 00:00:00

  • A statistical framework for consolidating "sibling" probe sets for Affymetrix GeneChip data.

    abstract:BACKGROUND:Affymetrix GeneChip typically contains multiple probe sets per gene, defined as sibling probe sets in this study. These probe sets may or may not behave similar across treatments. The most appropriate way of consolidating sibling probe sets suitable for analysis is an open problem. We propose the Analysis of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-188

    authors: Li H,Zhu D,Cook M

    更新日期:2008-04-24 00:00:00