Identification of protein features encoded by alternative exons using Exon Ontology.

Abstract:

:Transcriptomic genome-wide analyses demonstrate massive variation of alternative splicing in many physiological and pathological situations. One major challenge is now to establish the biological contribution of alternative splicing variation in physiological- or pathological-associated cellular phenotypes. Toward this end, we developed a computational approach, named "Exon Ontology," based on terms corresponding to well-characterized protein features organized in an ontology tree. Exon Ontology is conceptually similar to Gene Ontology-based approaches but focuses on exon-encoded protein features instead of gene level functional annotations. Exon Ontology describes the protein features encoded by a selected list of exons and looks for potential Exon Ontology term enrichment. By applying this strategy to exons that are differentially spliced between epithelial and mesenchymal cells and after extensive experimental validation, we demonstrate that Exon Ontology provides support to discover specific protein features regulated by alternative splicing. We also show that Exon Ontology helps to unravel biological processes that depend on suites of coregulated alternative exons, as we uncovered a role of epithelial cell-enriched splicing factors in the AKT signaling pathway and of mesenchymal cell-enriched splicing factors in driving splicing events impacting on autophagy. Freely available on the web, Exon Ontology is the first computational resource that allows getting a quick insight into the protein features encoded by alternative exons and investigating whether coregulated exons contain the same biological information.

journal_name

Genome Res

journal_title

Genome research

authors

Tranchevent LC,Aubé F,Dulaurier L,Benoit-Pilven C,Rey A,Poret A,Chautard E,Mortada H,Desmet FO,Chakrama FZ,Moreno-Garcia MA,Goillot E,Janczarski S,Mortreux F,Bourgeois CF,Auboeuf D

doi

10.1101/gr.212696.116

subject

Has Abstract

pub_date

2017-06-01 00:00:00

pages

1087-1097

issue

6

eissn

1088-9051

issn

1549-5469

pii

gr.212696.116

journal_volume

27

pub_type

杂志文章
  • Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties.

    abstract::Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse co...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.078303.108

    authors: Bacolla A,Larson JE,Collins JR,Li J,Milosavljevic A,Stenson PD,Cooper DN,Wells RD

    更新日期:2008-10-01 00:00:00

  • Sister chromatid telomere fusions, but not NHEJ-mediated inter-chromosomal telomere fusions, occur independently of DNA ligases 3 and 4.

    abstract::Telomeres shorten with each cell division and can ultimately become substrates for nonhomologous end-joining repair, leading to large-scale genomic rearrangements of the kind frequently observed in human cancers. We have characterized more than 1400 telomere fusion events at the single-molecule level, using a combinat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200840.115

    authors: Liddiard K,Ruis B,Takasugi T,Harvey A,Ashelford KE,Hendrickson EA,Baird DM

    更新日期:2016-05-01 00:00:00

  • The landscape of histone modifications across 1% of the human genome in five human cell lines.

    abstract::We generated high-resolution maps of histone H3 lysine 9/14 acetylation (H3ac), histone H4 lysine 5/8/12/16 acetylation (H4ac), and histone H3 at lysine 4 mono-, di-, and trimethylation (H3K4me1, H3K4me2, H3K4me3, respectively) across the ENCODE regions. Studying each modification in five human cell lines including th...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5704207

    authors: Koch CM,Andrews RM,Flicek P,Dillon SC,Karaöz U,Clelland GK,Wilcox S,Beare DM,Fowler JC,Couttet P,James KD,Lefebvre GC,Bruce AW,Dovey OM,Ellis PD,Dhami P,Langford CF,Weng Z,Birney E,Carter NP,Vetrie D,Dunham I

    更新日期:2007-06-01 00:00:00

  • Arboretum: reconstruction and analysis of the evolutionary history of condition-specific transcriptional modules.

    abstract::Comparative functional genomics studies the evolution of biological processes by analyzing functional data, such as gene expression profiles, across species. A major challenge is to compare profiles collected in a complex phylogeny. Here, we present Arboretum, a novel scalable computational algorithm that integrates e...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.146233.112

    authors: Roy S,Wapinski I,Pfiffner J,French C,Socha A,Konieczka J,Habib N,Kellis M,Thompson D,Regev A

    更新日期:2013-06-01 00:00:00

  • Genomic evolution, patterns of global dissemination, and interspecies transmission of human and simian T-cell leukemia/lymphotropic viruses.

    abstract::Using both env and long terminal repeat (LTR) sequences, with maximal representation of genetic diversity within primate strains, we revise and expand the unique evolutionary history of human and simian T-cell leukemia/lymphotropic viruses (HTLV/STLV). Based on the robust application of three different phylogenetic al...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:

    authors: Slattery JP,Franchini G,Gessain A

    更新日期:1999-06-01 00:00:00

  • Natural genetic variation in yeast longevity.

    abstract::The genetics of aging in the yeast Saccharomyces cerevisiae has involved the manipulation of individual genes in laboratory strains. We have instituted a quantitative genetic analysis of the yeast replicative lifespan by sampling the natural genetic variation in a wild yeast isolate. Haploid segregants from a cross be...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.136549.111

    authors: Stumpferl SW,Brand SE,Jiang JC,Korona B,Tiwari A,Dai J,Seo JG,Jazwinski SM

    更新日期:2012-10-01 00:00:00

  • A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes.

    abstract::Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important compone...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.551

    authors: Devine SE,Chissoe SL,Eby Y,Wilson RK,Boeke JD

    更新日期:1997-05-01 00:00:00

  • The repetitive landscape of the chicken genome.

    abstract::Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, an...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2438004

    authors: Wicker T,Robertson JS,Schulze SR,Feltus FA,Magrini V,Morrison JA,Mardis ER,Wilson RK,Peterson DG,Paterson AH,Ivarie R

    更新日期:2005-01-01 00:00:00

  • Properties of overlapping genes are conserved across microbial genomes.

    abstract::There are numerous examples from the genomes of viruses, mitochondria, and chromosomes that adjacent genes can overlap, sharing at least one nucleotide. Overlaps have been hypothesized to be involved in genome size minimization and as a regulatory mechanism of gene expression. Here we show that overlapping genes are a...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.2433104

    authors: Johnson ZI,Chisholm SW

    更新日期:2004-11-01 00:00:00

  • Diversification of transcriptional modulation: large-scale identification and characterization of putative alternative promoters of human genes.

    abstract::By analyzing 1,780,295 5'-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4039406

    authors: Kimura K,Wakamatsu A,Suzuki Y,Ota T,Nishikawa T,Yamashita R,Yamamoto J,Sekine M,Tsuritani K,Wakaguri H,Ishii S,Sugiyama T,Saito K,Isono Y,Irie R,Kushida N,Yoneyama T,Otsuka R,Kanda K,Yokoi T,Kondo H,Wagatsuma M

    更新日期:2006-01-01 00:00:00

  • A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles.

    abstract::An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4303406

    authors: Chang LW,Nagarajan R,Magee JA,Milbrandt J,Stormo GD

    更新日期:2006-03-01 00:00:00

  • Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining.

    abstract::Insertion of the human non-LTR retrotransposon LINE-1 (L1) into chromosomal DNA is thought to be initiated by a mechanism called target-primed reverse transcription (TPRT). This mechanism readily accounts for the attachment of the 3'-end of an L1 copy to the genomic target, but the subsequent integration steps leading...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3421505

    authors: Zingler N,Willhoeft U,Brose HP,Schoder V,Jahns T,Hanschmann KM,Morrish TA,Löwer J,Schumann GG

    更新日期:2005-06-01 00:00:00

  • Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.

    abstract::Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch req...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.089060.108

    authors: Grobei MA,Qeli E,Brunner E,Rehrauer H,Zhang R,Roschitzki B,Basler K,Ahrens CH,Grossniklaus U

    更新日期:2009-10-01 00:00:00

  • Comparative analysis of mammalian Y chromosomes illuminates ancestral structure and lineage-specific evolution.

    abstract::Although more than thirty mammalian genomes have been sequenced to draft quality, very few of these include the Y chromosome. This has limited our understanding of the evolutionary dynamics of gene persistence and loss, our ability to identify conserved regulatory elements, as well our knowledge of the extent to which...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.154286.112

    authors: Li G,Davis BW,Raudsepp T,Pearks Wilkerson AJ,Mason VC,Ferguson-Smith M,O'Brien PC,Waters PD,Murphy WJ

    更新日期:2013-09-01 00:00:00

  • Multiple waves of recent DNA transposon activity in the bat, Myotis lucifugus.

    abstract::DNA transposons, or class 2 transposable elements, have successfully propagated in a wide variety of genomes. However, it is widely believed that DNA transposon activity has ceased in mammalian genomes for at least the last 40 million years. We recently reported evidence for the relatively recent activity of hAT and H...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.071886.107

    authors: Ray DA,Feschotte C,Pagan HJ,Smith JD,Pritham EJ,Arensburger P,Atkinson PW,Craig NL

    更新日期:2008-05-01 00:00:00

  • A periodic pattern of SNPs in the human genome.

    abstract::By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for exampl...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6223207

    authors: Madsen BE,Villesen P,Wiuf C

    更新日期:2007-10-01 00:00:00

  • Coding and noncoding variants in HFM1, MLH3, MSH4, MSH5, RNF212, and RNF212B affect recombination rate in cattle.

    abstract::We herein study genetic recombination in three cattle populations from France, New Zealand, and the Netherlands. We identify 2,395,177 crossover (CO) events in 94,516 male gametes, and 579,996 CO events in 25,332 female gametes. The average number of COs was found to be larger in males (23.3) than in females (21.4). T...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.204214.116

    authors: Kadri NK,Harland C,Faux P,Cambisano N,Karim L,Coppieters W,Fritz S,Mullaart E,Baurain D,Boichard D,Spelman R,Charlier C,Georges M,Druet T

    更新日期:2016-10-01 00:00:00

  • Screening of gene-associated polymorphisms by use of in-gel competitive reassociation and EST (cDNA) array hybridization.

    abstract::In-gel competitive reassociation (IGCR) is a method of differential subtraction to enrich polymorphic DNA restriction fragments between two DNA samples without probes or specific sequence information. Here, we show that by combining IGCR and expressed sequence tags (EST) array hybridization, polymorphic DNA fragments ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.434103

    authors: Gotoh K,Oishi M

    更新日期:2003-03-01 00:00:00

  • Genome-wide A-to-I RNA editing in fungi independent of ADAR enzymes.

    abstract::Yeasts and filamentous fungi do not have adenosine deaminase acting on RNA (ADAR) orthologs and are believed to lack A-to-I RNA editing, which is the most prevalent editing of mRNA in animals. However, during this study with the PUK1(FGRRES_01058) pseudokinase gene important for sexual reproduction in Fusarium gramine...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.199877.115

    authors: Liu H,Wang Q,He Y,Chen L,Hao C,Jiang C,Li Y,Dai Y,Kang Z,Xu JR

    更新日期:2016-04-01 00:00:00

  • Identification and analysis of internal promoters in Caenorhabditis elegans operons.

    abstract::The current Caenorhabditis elegans genomic annotation has many genes organized in operons. Using directionally stitched promoterGFP methodology, we have conducted the largest survey to date on the regulatory regions of annotated C. elegans operons and identified 65, over 25% of those studied, with internal promoters. ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6824707

    authors: Huang P,Pleasance ED,Maydan JS,Hunt-Newbury R,O'Neil NJ,Mah A,Baillie DL,Marra MA,Moerman DG,Jones SJ

    更新日期:2007-10-01 00:00:00

  • The first five years of single-cell cancer genomics and beyond.

    abstract::Single-cell sequencing (SCS) is a powerful new tool for investigating evolution and diversity in cancer and understanding the role of rare cells in tumor progression. These methods have begun to unravel key questions in cancer biology that have been difficult to address with bulk tumor measurements. Over the past five...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.191098.115

    authors: Navin NE

    更新日期:2015-10-01 00:00:00

  • Unique DNA methylome profiles in CpG island methylator phenotype colon cancers.

    abstract::A subset of colorectal cancers was postulated to have the CpG island methylator phenotype (CIMP), a higher propensity for CpG island DNA methylation. The validity of CIMP, its molecular basis, and its prognostic value remain highly controversial. Using MBD-isolated genome sequencing, we mapped and compared genome-wide...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.122788.111

    authors: Xu Y,Hu B,Choi AJ,Gopalan B,Lee BH,Kalady MF,Church JM,Ting AH

    更新日期:2012-02-01 00:00:00

  • Dynamic effects of interacting genes underlying rice flowering-time phenotypic plasticity and global adaptation.

    abstract::The phenotypic variation of living organisms is shaped by genetics, environment, and their interaction. Understanding phenotypic plasticity under natural conditions is hindered by the apparently complex environment and the interacting genes and pathways. Herein, we report findings from the dissection of rice flowering...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.255703.119

    authors: Guo T,Mu Q,Wang J,Vanous AE,Onogi A,Iwata H,Li X,Yu J

    更新日期:2020-05-01 00:00:00

  • Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data.

    abstract::Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.265173.120

    authors: Shao W,Wang T

    更新日期:2021-01-01 00:00:00

  • rVista for comparative sequence-based discovery of functional transcription factor binding sites.

    abstract::Identifying transcriptional regulatory elements represents a significant challenge in annotating the genomes of higher vertebrates. We have developed a computational tool, rVista, for high-throughput discovery of cis-regulatory elements that combines clustering of predicted transcription factor binding sites (TFBSs) a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.225502

    authors: Loots GG,Ovcharenko I,Pachter L,Dubchak I,Rubin EM

    更新日期:2002-05-01 00:00:00

  • Molecular genetic maps in wild emmer wheat, Triticum dicoccoides: genome-wide coverage, massive negative interference, and putative quasi-linkage.

    abstract::The main objectives of the study reported here were to construct a molecular map of wild emmer wheat, Triticum dicoccoides, to characterize the marker-related anatomy of the genome, and to evaluate segregation and recombination patterns upon crossing T. dicoccoides with its domesticated descendant Triticum durum (cult...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.150300

    authors: Peng J,Korol AB,Fahima T,Röder MS,Ronin YI,Li YC,Nevo E

    更新日期:2000-10-01 00:00:00

  • Estimating coarse gene network structure from large-scale gene perturbation data.

    abstract::Large scale gene perturbation experiments generate information about the number of genes whose activity is directly or indirectly affected by a gene perturbation. From this information, one can numerically estimate coarse structural network features such as the total number of direct regulatory interactions and the nu...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.193902

    authors: Wagner A

    更新日期:2002-02-01 00:00:00

  • The effect of translocation-induced nuclear reorganization on gene expression.

    abstract::Translocations are known to affect the expression of genes at the breakpoints and, in the case of unbalanced translocations, alter the gene copy number. However, a comprehensive understanding of the functional impact of this class of variation is lacking. Here, we have studied the effect of balanced chromosomal rearra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.103622.109

    authors: Harewood L,Schütz F,Boyle S,Perry P,Delorenzi M,Bickmore WA,Reymond A

    更新日期:2010-05-01 00:00:00

  • Parallel radiation hybrid mapping: a powerful tool for high-resolution genomic comparison.

    abstract::Comparative gene mapping in mammals typically involves identification of segments of conserved synteny in diverse genomes. The development of maps that permit comparison of gene order within conserved synteny has not advanced beyond the mouse map that takes advantage of linkage analysis in interspecific backcrosses. R...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.7.731

    authors: Yang YP,Womack JE

    更新日期:1998-07-01 00:00:00

  • Computational identification of operons in microbial genomes.

    abstract::By applying graph representations to biochemical pathways, a new computational pipeline is proposed to find potential operons in microbial genomes. The algorithm relies on the fact that enzyme genes in operons tend to catalyze successive reactions in metabolic pathways. We applied this algorithm to 42 microbial genome...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200602

    authors: Zheng Y,Szustakowski JD,Fortnow L,Roberts RJ,Kasif S

    更新日期:2002-08-01 00:00:00