Proteome-scale analysis of phase-separated proteins in immunofluorescence images.

Abstract:

:Phase separation is an important mechanism that mediates the spatial distribution of proteins in different cellular compartments. While phase-separated proteins share certain sequence characteristics, including intrinsically disordered regions (IDRs) and prion-like domains, such characteristics are insufficient for making accurate predictions; thus, a proteome-wide understanding of phase separation is currently lacking. Here, we define phase-separated proteomes based on the systematic analysis of immunofluorescence images of 12 073 proteins in the Human Protein Atlas. The analysis of these proteins reveals that phase-separated candidate proteins exhibit higher IDR contents, higher mean net charge and lower hydropathy and prefer to bind to RNA. Kinases and transcription factors are also enriched among these candidate proteins. Strikingly, both phase-separated kinases and phase-separated transcription factors display significantly reduced substrate specificity. Our work provides the first global view of the phase-separated proteome and suggests that the spatial proximity resulting from phase separation reduces the requirement for motif specificity and expands the repertoire of substrates. The source code and data are available at https://github.com/cheneyyu/deepphase.

journal_name

Brief Bioinform

authors

Yu C,Shen B,You K,Huang Q,Shi M,Wu C,Chen Y,Zhang C,Li T

doi

10.1093/bib/bbaa187

subject

Has Abstract

pub_date

2020-09-02 00:00:00

eissn

1467-5463

issn

1477-4054

pii

5900570

pub_type

杂志文章
  • Exploring the function of genetic variants in the non-coding genomic regions: approaches for identifying human regulatory variants affecting gene expression.

    abstract::Understanding the genetic basis of human traits/diseases and the underlying mechanisms of how these traits/diseases are affected by genetic variations is critical for public health. Current genome-wide functional genomics data uncovered a large number of functional elements in the noncoding regions of human genome, pr...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbu018

    authors: Li MJ,Yan B,Sham PC,Wang J

    更新日期:2015-05-01 00:00:00

  • Toward more realistic drug-target interaction predictions.

    abstract::A number of supervised machine learning models have recently been introduced for the prediction of drug-target interactions based on chemical structure and genomic sequence information. Although these models could offer improved means for many network pharmacology applications, such as repositioning of drugs for new t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbu010

    authors: Pahikkala T,Airola A,Pietilä S,Shakyawar S,Szwajda A,Tang J,Aittokallio T

    更新日期:2015-03-01 00:00:00

  • Fighting against uncertainty: an essential issue in bioinformatics.

    abstract::Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the 'uncertainty' of a solution, that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimen...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbt038

    authors: Hamada M

    更新日期:2014-09-01 00:00:00

  • Optimization of cell lines as tumour models by integrating multi-omics data.

    abstract::Cell lines are widely used as in vitro models of tumorigenesis. However, an increasing number of researchers have found that cell lines differ from their sourced tumour samples after long-term cell culture. The application of unsuitable cell lines in experiments will affect the experimental accuracy and the treatment ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbw082

    authors: Zhao N,Liu Y,Wei Y,Yan Z,Zhang Q,Wu C,Chang Z,Xu Y

    更新日期:2017-05-01 00:00:00

  • A survey of software tools for microRNA discovery and characterization using RNA-seq.

    abstract::Since the small RNA-sequencing (sRNA-seq) technology became available, it allowed the discovery of thousands new microRNAs (miRNAs) in humans and many other species, providing new data on these small RNAs (sRNAs) of high biological and translational relevance. MiRNA discovery has not yet reached saturation, even in th...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx148

    authors: Bortolomeazzi M,Gaffo E,Bortoluzzi S

    更新日期:2019-05-21 00:00:00

  • Strategies for calibrating models of biology.

    abstract::Computational and mathematical modelling has become a valuable tool for investigating biological systems. Modelling enables prediction of how biological components interact to deliver system-level properties and extrapolation of biological system performance to contexts and experimental conditions where this is unknow...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby092

    authors: Read MN,Alden K,Timmis J,Andrews PS

    更新日期:2018-09-18 00:00:00

  • Common introns within orthologous genes: software and application to plants.

    abstract::The residence of spliceosomal introns within protein-coding genes can fluctuate over time, with genes gaining, losing or conserving introns in a complex process that is not entirely understood. One approach for studying intron evolution is to compare introns with respect to position and type within closely related gen...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbp051

    authors: Wilkerson MD,Ru Y,Brendel VP

    更新日期:2009-11-01 00:00:00

  • Evaluation of gene-drug common module identification methods using pharmacogenomics data.

    abstract::Accurately identifying the interactions between genomic factors and the response of cancer drugs plays important roles in drug discovery, drug repositioning and cancer treatment. A number of studies revealed that interactions between genes and drugs were 'many-genes-to-many drugs' interactions, i.e. common modules, op...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa087

    authors: Huang J,Chen J,Zhang B,Zhu L,Cai H

    更新日期:2020-06-26 00:00:00

  • Identifying mutual exclusivity across cancer genomes: computational approaches to discover genetic interaction and reveal tumor vulnerability.

    abstract::Systematic sequencing of cancer genomes has revealed prevalent heterogeneity, with patients harboring various combinatorial patterns of genetic alteration. In particular, a phenomenon that a group of genes exhibits mutually exclusive patterns has been widespread across cancers, covering a broad spectrum of crucial can...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx109

    authors: Deng Y,Luo S,Deng C,Luo T,Yin W,Zhang H,Zhang Y,Zhang X,Lan Y,Ping Y,Xiao Y,Li X

    更新日期:2019-01-18 00:00:00

  • Current development of integrated web servers for preclinical safety and pharmacokinetics assessments in drug development.

    abstract::In drug development, preclinical safety and pharmacokinetics assessments of candidate drugs to ensure the safety profile are a must. While in vivo and in vitro tests are traditionally used, experimental determinations have disadvantages, as they are usually time-consuming and costly. In silico predictions of these pre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa160

    authors: Hsiao Y,Su BH,Tseng YJ

    更新日期:2020-08-07 00:00:00

  • The computational challenges of applying comparative-based computational methods to whole genomes.

    abstract::The explosion in genomic sequence available in public databases has resulted in an unprecedented opportunity for computational whole genome analyses. A number of promising comparative-based approaches have been developed for gene finding, regulatory element discovery and other purposes, and it is clear that these tool...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/3.1.18

    authors: Dubchak I,Pachter L

    更新日期:2002-03-01 00:00:00

  • The virtual cell--a candidate co-ordinator for 'middle-out' modelling of biological systems.

    abstract::Understanding the functioning of biological systems depends on tackling complexity spanning spatial scales from genome to organ to whole organism. The basic unit of life, the cell, acts to co-ordinate information received across these scales and processes the myriad of signals to produce an integrated cellular respons...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbp010

    authors: Walker DC,Southgate J

    更新日期:2009-07-01 00:00:00

  • Sequencing technologies and tools for short tandem repeat variation detection.

    abstract::Short tandem repeats are highly polymorphic and associated with a wide range of phenotypic variation, some of which cause neurodegenerative disease in humans. With advances in high-throughput sequencing technologies, there are novel opportunities to study genetic variation. While available sequencing technologies and ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbu001

    authors: Cao MD,Balasubramanian S,Bodén M

    更新日期:2015-03-01 00:00:00

  • HITS-PR-HHblits: protein remote homology detection by combining PageRank and Hyperlink-Induced Topic Search.

    abstract::As one of the most important fundamental problems in protein sequence analysis, protein remote homology detection is critical for both theoretical research (protein structure and function studies) and real world applications (drug design). Although several computational predictors have been proposed, their detection p...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby104

    authors: Liu B,Jiang S,Zou Q

    更新日期:2018-11-07 00:00:00

  • Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.

    abstract::RNA-seq has been an increasingly popular high-throughput platform to identify differentially expressed (DE) genes, which is much more reproducible and accurate than the previous microarray technology. Yet, a number of statistical issues remain to be resolved in data analysis, largely due to the high-throughput data vo...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbv035

    authors: Gu X

    更新日期:2016-03-01 00:00:00

  • Comparison of haplotype-based tests for detecting gene-environment interactions with rare variants.

    abstract::Dissecting the genetic mechanism underlying a complex disease hinges on discovering gene-environment interactions (GXE). However, detecting GXE is a challenging problem especially when the genetic variants under study are rare. Haplotype-based tests have several advantages over the so-called collapsing tests for detec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz031

    authors: Papachristou C,Biswas S

    更新日期:2020-05-21 00:00:00

  • Comparative genome assembly.

    abstract::One of the most complex and computationally intensive tasks of genome sequence analysis is genome assembly. Even today, few centres have the resources, in both software and hardware, to assemble a genome from the thousands or millions of individual sequences generated in a whole-genome shotgun sequencing project. With...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/5.3.237

    authors: Pop M,Phillippy A,Delcher AL,Salzberg SL

    更新日期:2004-09-01 00:00:00

  • Comprehensive characterization of tissue-specific circular RNAs in the human and mouse genomes.

    abstract::Circular RNA (circRNA) is a group of RNA family generated by RNA circularization, which was discovered ubiquitously across different species and tissues. However, there is no global view of tissue specificity for circRNAs to date. Here we performed the comprehensive analysis to characterize the features of human and m...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw081

    authors: Xia S,Feng J,Lei L,Hu J,Xia L,Wang J,Xiang Y,Liu L,Zhong S,Han L,He C

    更新日期:2017-11-01 00:00:00

  • Comparative study of computational methods to detect the correlated reaction sets in biochemical networks.

    abstract::Correlated reaction sets (Co-Sets) are mathematically defined modules in biochemical reaction networks which facilitate the study of biological processes by decomposing complex reaction networks into conceptually simple units. According to the degree of association, Co-Sets can be classified into three types: perfect,...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbp068

    authors: Xi Y,Chen YP,Qian C,Wang F

    更新日期:2011-03-01 00:00:00

  • Identification and comprehensive characterization of lncRNAs with copy number variations and their driving transcriptional perturbed subpathways reveal functional significance for cancer.

    abstract::Numerous studies have shown that copy number variation (CNV) in lncRNA regions play critical roles in the initiation and progression of cancer. However, our knowledge about their functionalities is still limited. Here, we firstly provided a computational method to identify lncRNAs with copy number variation (lncRNAs-C...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz113

    authors: Xu Y,Wu T,Li F,Dong Q,Wang J,Shang D,Xu Y,Zhang C,Dou Y,Hu C,Yang H,Zheng X,Zhang Y,Wang L,Li X

    更新日期:2020-12-01 00:00:00

  • Comparing enrichment analysis and machine learning for identifying gene properties that discriminate between gene classes.

    abstract::Biologists very often use enrichment methods based on statistical hypothesis tests to identify gene properties that are significantly over-represented in a given set of genes of interest, by comparison with a 'background' set of genes. These enrichment methods, although based on rigorous statistical foundations, are n...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz028

    authors: Fabris F,Palmer D,de Magalhães JP,Freitas AA

    更新日期:2020-05-21 00:00:00

  • Capacity building for whole genome sequencing of Mycobacterium tuberculosis and bioinformatics in high TB burden countries.

    abstract:BACKGROUND:Whole genome sequencing (WGS) is increasingly used for Mycobacterium tuberculosis (Mtb) research. Countries with the highest tuberculosis (TB) burden face important challenges to integrate WGS into surveillance and research. METHODS:We assessed the global status of Mtb WGS and developed a 3-week training co...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa246

    authors: Rivière E,Heupink TH,Ismail N,Dippenaar A,Clarke C,Abebe G,Heusden P,Warren R,Meehan CJ,Van Rie A

    更新日期:2020-10-03 00:00:00

  • Opportunities for community awareness platforms in personal genomics and bioinformatics education.

    abstract::Precision and personalized medicine will be increasingly based on the integration of various type of information, particularly electronic health records and genome sequences. The availability of cheap genome sequencing services and the information interoperability will increase the role of online bioinformatics analys...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw078

    authors: Bianchi L,Liò P

    更新日期:2017-11-01 00:00:00

  • Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing.

    abstract::Technical advances such as the development of molecular cloning, Sanger sequencing, PCR and oligonucleotide microarrays are key to our current capacity to sequence, annotate and study complete organismal genomes. Recent years have seen the development of a variety of so-called 'next-generation' sequencing platforms, w...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbp046

    authors: Horner DS,Pavesi G,Castrignanò T,De Meo PD,Liuni S,Sammeth M,Picardi E,Pesole G

    更新日期:2010-03-01 00:00:00

  • New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.

    abstract::With the development of next-generation sequencing (NGS) technologies, a large amount of short read data has been generated. Assembly of these short reads can be challenging for genomes and metagenomes without template sequences, making alignment-based genome sequence comparison difficult. In addition, sequence reads ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbt067

    authors: Song K,Ren J,Reinert G,Deng M,Waterman MS,Sun F

    更新日期:2014-05-01 00:00:00

  • Data-driven rational biosynthesis design: from molecules to cell factories.

    abstract::A proliferation of chemical, reaction and enzyme databases, new computational methods and software tools for data-driven rational biosynthesis design have emerged in recent years. With the coming of the era of big data, particularly in the bio-medical field, data-driven rational biosynthesis design could potentially b...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz065

    authors: Chen F,Yuan L,Ding S,Tian Y,Hu QN

    更新日期:2020-07-15 00:00:00

  • Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

    abstract::Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa304

    authors: Manavalan B,Basith S,Shin TH,Lee G

    更新日期:2020-11-25 00:00:00

  • Comparison and integration of computational methods for deleterious synonymous mutation prediction.

    abstract::Synonymous mutations do not change the encoded amino acids but may alter the structure or function of an mRNA in ways that impact gene function. Advances in next generation sequencing technologies have detected numerous synonymous mutations in the human genome. Several computational models have been proposed to predic...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz047

    authors: Cheng N,Li M,Zhao L,Zhang B,Yang Y,Zheng CH,Xia J

    更新日期:2020-05-21 00:00:00

  • A feature-based approach to predict hot spots in protein-DNA binding interfaces.

    abstract::DNA-binding hot spot residues of proteins are dominant and fundamental interface residues that contribute most of the binding free energy of protein-DNA interfaces. As experimental methods for identifying hot spots are expensive and time consuming, computational approaches are urgently required in predicting hot spots...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz037

    authors: Zhang S,Zhao L,Zheng CH,Xia J

    更新日期:2020-05-21 00:00:00

  • A review of bioinformatics education in the UK.

    abstract::If the completion of the first draft of the human genome represents the coming of age of bioinformatics, then the emergence of bioinformatics as a university degree subject represents its establishment. In this paper bioinformatics as a subject for formal study is discussed, rather than as a subject for research, and ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/4.1.7

    authors: Counsell D

    更新日期:2003-03-01 00:00:00