Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers.

Abstract:

:Facing the explosion of newly generated protein sequences in the post genomic era, we are challenged to develop an automated method for fast and reliably annotating their subcellular locations. Knowledge of subcellular locations of proteins can provide useful hints for revealing their functions and understanding how they interact with each other in cellular networking. Unfortunately, it is both expensive and time-consuming to determine the localization of an uncharacterized protein in a living cell purely based on experiments. To tackle the challenge, a novel hybridization classifier was developed by fusing many basic individual classifiers through a voting system. The "engine" of these basic classifiers was operated by the OET-KNN (Optimized Evidence-Theoretic K-Nearest Neighbor) rule. As a demonstration, predictions were performed with the fusion classifier for proteins among the following 16 localizations: (1) cell wall, (2) centriole, (3) chloroplast, (4) cyanelle, (5) cytoplasm, (6) cytoskeleton, (7) endoplasmic reticulum, (8) extracell, (9) Golgi apparatus, (10) lysosome, (11) mitochondria, (12) nucleus, (13) peroxisome, (14) plasma membrane, (15) plastid, and (16) vacuole. To get rid of redundancy and homology bias, none of the proteins investigated here had >/=25% sequence identity to any other in a same subcellular location. The overall success rates thus obtained via the jack-knife cross-validation test and independent dataset test were 81.6% and 83.7%, respectively, which were 46 approximately 63% higher than those performed by the other existing methods on the same benchmark datasets. Also, it is clearly elucidated that the overwhelmingly high success rates obtained by the fusion classifier is by no means a trivial utilization of the GO annotations as prone to be misinterpreted because there is a huge number of proteins with given accession numbers and the corresponding GO numbers, but their subcellular locations are still unknown, and that the percentage of proteins with GO annotations indicating their subcellular components is even less than the percentage of proteins with known subcellular location annotation in the Swiss-Prot database. It is anticipated that the powerful fusion classifier may also become a very useful high throughput tool in characterizing other attributes of proteins according to their sequences, such as enzyme class, membrane protein type, and nuclear receptor subfamily, among many others. A web server, called "Euk-OET-PLoc", has been designed at http://202.120.37.186/bioinf/euk-oet for public to predict subcellular locations of eukaryotic proteins by the fusion OET-KNN classifier.

journal_name

J Proteome Res

authors

Chou KC,Shen HB

doi

10.1021/pr060167c

subject

Has Abstract

pub_date

2006-08-01 00:00:00

pages

1888-97

issue

8

eissn

1535-3893

issn

1535-3907

journal_volume

5

pub_type

杂志文章
  • Quantitative analysis of brain nuclear phosphoproteins identifies developmentally regulated phosphorylation events.

    abstract::Protein phosphorylation is a globally adopted and tightly controlled post-translational modification, and represents one of the most important molecular switching mechanisms that govern the entire spectrum of biological processes. In the central nervous system, it has been demonstrated that phosphorylation of key prot...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr8003198

    authors: Liao L,McClatchy DB,Park SK,Xu T,Lu B,Yates JR 3rd

    更新日期:2008-11-01 00:00:00

  • In-gel isoelectric focusing of peptides as a tool for improved protein identification.

    abstract::In the analysis of proteins in complex samples, pre-fractionation is imperative to obtain the necessary depth in the number of reliable protein identifications by mass spectrometry. Here we explore isoelectric focusing of peptides (peptide IEF) as an effective fractionation step that at the same time provides the adde...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr0601180

    authors: Krijgsveld J,Gauci S,Dormeyer W,Heck AJ

    更新日期:2006-07-01 00:00:00

  • Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.

    abstract::Target-decoy approach (TDA) is the dominant strategy for false discovery rate (FDR) estimation in mass-spectrometry-based proteomics. One of its main applications is direct FDR estimation based on counting of decoy matches above a certain score threshold. The corresponding equations are widely employed for filtering o...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00144

    authors: Levitsky LI,Ivanov MV,Lobas AA,Gorshkov MV

    更新日期:2017-02-03 00:00:00

  • Special Enrichment Strategies Greatly Increase the Efficiency of Missing Proteins Identification from Regular Proteome Samples.

    abstract::As part of the Chromosome-Centric Human Proteome Project (C-HPP) mission, laboratories all over the world have tried to map the entire missing proteins (MPs) since 2012. On the basis of the first and second Chinese Chromosome Proteome Database (CCPD 1.0 and 2.0) studies, we developed systematic enrichment strategies t...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b00481

    authors: Su N,Zhang C,Zhang Y,Wang Z,Fan F,Zhao M,Wu F,Gao Y,Li Y,Chen L,Tian M,Zhang T,Wen B,Sensang N,Xiong Z,Wu S,Liu S,Yang P,Zhen B,Zhu Y,He F,Xu P

    更新日期:2015-09-04 00:00:00

  • Metabolic flux analysis and visualization.

    abstract::One of the ultimate goals of systems biology research is to obtain a comprehensive understanding of the control mechanisms of complex cellular metabolisms. Metabolic Flux Analysis (MFA) is a important method for the quantitative estimation of intracellular metabolic flows through metabolic pathways and the elucidation...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr2002885

    authors: Toya Y,Kono N,Arakawa K,Tomita M

    更新日期:2011-08-05 00:00:00

  • Quantitative mass spectrometry-based proteomics reveals the dynamic range of primary mouse astrocyte protein secretion.

    abstract::Growing appreciation for astrocytes as active participants in nervous system development, neurovascular metabolic coupling, and neurological disease progression has stimulated recent investigation into specific astrocyte-secreted proteins that may mediate these functions. The current work utilized SILAC-generated isot...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr100134n

    authors: Greco TM,Seeholzer SH,Mak A,Spruce L,Ischiropoulos H

    更新日期:2010-05-07 00:00:00

  • Enhanced Validation of Antibodies Enables the Discovery of Missing Proteins.

    abstract::The localization of proteins at a tissue- or cell-type-specific level is tightly linked to the protein function. To better understand each protein's role in cellular systems, spatial information constitutes an important complement to quantitative data. The standard methods for determining the spatial distribution of p...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.0c00486

    authors: Sivertsson Å,Lindström E,Oksvold P,Katona B,Hikmet F,Vuu J,Gustavsson J,Sjöstedt E,von Feilitzen K,Kampf C,Schwenk JM,Uhlén M,Lindskog C

    更新日期:2020-12-04 00:00:00

  • Metabolomic Profiling of the Aqueous Humor in Patients with Wet Age-Related Macular Degeneration Using UHPLC-MS/MS.

    abstract::Assessing metabolomic alterations in age-related macular degeneration (AMD) can provide insights into its pathogenesis. We compared the metabolomic profiles of the aqueous humor between wet AMD patients (n = 26) and age- and sex-matched patients undergoing cataract surgery without AMD as controls (n = 20). A global un...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.0c00036

    authors: Han G,Wei P,He M,Teng H,Chu Y

    更新日期:2020-06-05 00:00:00

  • Label-Free LC-MS/MS Proteomic Analysis of Cerebrospinal Fluid Identifies Protein/Pathway Alterations and Candidate Biomarkers for Amyotrophic Lateral Sclerosis.

    abstract::Analysis of the cerebrospinal fluid (CSF) proteome has proven valuable to the study of neurodegenerative disorders. To identify new protein/pathway alterations and candidate biomarkers for amyotrophic lateral sclerosis (ALS), we performed comparative proteomic profiling of CSF from sporadic ALS (sALS), healthy control...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b00804

    authors: Collins MA,An J,Hood BL,Conrads TP,Bowser RP

    更新日期:2015-11-06 00:00:00

  • Characterization of human tear proteome using multiple proteomic analysis techniques.

    abstract::Tear proteome profiling may generate useful information for the understanding of the interaction between an eye and its contacting objects, such as a contact lens or a lens implant. This is important for designing improved eye-care devices and maintaining the health of an eye. Proteome profiles of tear fluids may also...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr0501970

    authors: Li N,Wang N,Zheng J,Liu XM,Lever OW,Erickson PM,Li L

    更新日期:2005-11-01 00:00:00

  • Effects of Histidine Supplementation on Global Serum and Urine 1H NMR-based Metabolomics and Serum Amino Acid Profiles in Obese Women from a Randomized Controlled Study.

    abstract::The aim of current study was to investigate the metabolic changes associated with histidine supplementation in serum and urine metabolic signatures and serum amino acid (AA) profiles. Serum and urine 1H NMR-based metabolomics and serum AA profiles were employed in 32 and 37 obese women with metabolic syndrome (MetS) i...

    journal_title:Journal of proteome research

    pub_type: 杂志文章,随机对照试验

    doi:10.1021/acs.jproteome.7b00030

    authors: Du S,Sun S,Liu L,Zhang Q,Guo F,Li C,Feng R,Sun C

    更新日期:2017-06-02 00:00:00

  • Quantitative analysis of protein complex constituents and their phosphorylation states on a LTQ-Orbitrap instrument.

    abstract::Cellular functions are largely carried out by noncovalent protein complexes that may exist within the cell as stable modules or as assemblies of dynamically changing composition, whose formation and decomposition are triggered in response to extracellular stimuli. The protein constituents of complexes often exhibit po...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr1003888

    authors: Przybylski C,Jünger MA,Aubertin J,Radvanyi F,Aebersold R,Pflieger D

    更新日期:2010-10-01 00:00:00

  • Solid Digestion of Demineralized Bone as a Method To Access Potentially Insoluble Proteins and Post-Translational Modifications.

    abstract::Bone proteomics is an expanding field for understanding protein changes associated with disease as well as characterizing and detecting proteins preserved in fossil bone. Most previous studies have utilized a protocol with demineralization and extraction approach to isolate and characterize proteins from bone. Through...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.7b00670

    authors: Cleland TP

    更新日期:2018-01-05 00:00:00

  • Liquid- and gas-phase nitration of bovine serum albumin studied by LC-MS and LC-MS/MS using monolithic columns.

    abstract::Post-translational nitration of proteins was analyzed by capillary reversed-phase high-performance liquid chromatography (RP-HPLC) on-line interfaced to electrospray ionization mass spectrometry (ESI--MS) or tandem mass spectrometry (ESI--MS/MS). Both methods were compared using a tryptic digest of bovine serum albumi...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr034034s

    authors: Walcher W,Franze T,Weller MG,Pöschl U,Huber CG

    更新日期:2003-09-01 00:00:00

  • Assessing protein patterns in disease using imaging mass spectrometry.

    abstract::Direct tissue profiling and imaging mass spectrometry (MS) provides a detailed assessment of the complex protein pattern within a tissue sample. MALDI MS analysis of thin tissue sections results in over of 500 individual protein signals in the mass range of 2 to 70 kDa that directly correlate with protein composition ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章,评审

    doi:10.1021/pr0341282

    authors: Chaurand P,Schwartz SA,Caprioli RM

    更新日期:2004-03-01 00:00:00

  • Identification of kalirin-7 as a potential post-synaptic density signaling hub.

    abstract::Kalirin-7 (Kal7), a multifunctional Rho GDP/GTP exchange factor (GEF) for Rac1 and RhoG, is embedded in the postsynaptic density at excitatory synapses, where it participates in the formation and maintenance of dendritic spines. Kal7 has been implicated in long-term potentiation, fear memories, and addiction-like beha...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr200088w

    authors: Kiraly DD,Stone KL,Colangelo CM,Abbott T,Wang Y,Mains RE,Eipper BA

    更新日期:2011-06-03 00:00:00

  • Differential proteomic shotgun analysis elucidates involvement of water channel aquaporin 8 in presence of α-amylase in the colon.

    abstract::Aquaporin (AQP) family plays a pivotal role in fluid secretion and absorption, especially in the digestive system and secretory glands. Within this family, AQP8 was reported to be widely expressed in the epithelia of the digestive tract, liver, and pancreas. In two parallel experimental platforms with different analyt...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr100789v

    authors: Magdeldin S,Li H,Yoshida Y,Satokata I,Maeda Y,Yokoyama M,Enany S,Zhang Y,Xu B,Fujinaka H,Yaoita E,Yamamoto T

    更新日期:2010-12-03 00:00:00

  • Use of lysozyme as a standard for evaluating the effectiveness of a proteomics process.

    abstract::Automated sequencing of unknowns in bottom-up proteomics makes the data produced susceptible to process control errors, which can be propagated into mistakes in analyte identification. Inclusion of an unintrusive internal standard, such as lysozyme, allows monitoring all phases of the proteomics process including samp...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr049819s

    authors: Riter LS,Hodge BD,Gooding KM,Julian RK Jr

    更新日期:2005-01-01 00:00:00

  • An Adaptive Pipeline To Maximize Isobaric Tagging Data in Large-Scale MS-Based Proteomics.

    abstract::Isobaric tagging is the method of choice in mass-spectrometry-based proteomics for comparing several conditions at a time. Despite its multiplexing capabilities, some drawbacks appear when multiple experiments are merged for comparison in large sample-size studies due to the presence of missing values, which result fr...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.8b00110

    authors: Corthésy J,Theofilatos K,Mavroudi S,Macron C,Cominetti O,Remlawi M,Ferraro F,Núñez Galindo A,Kussmann M,Likothanassis S,Dayon L

    更新日期:2018-06-01 00:00:00

  • Identification of novel proteins from the venom of a cryptic snake Drysdalia coronoides by a combined transcriptomics and proteomics approach.

    abstract::We have investigated the transcriptome and proteome of the venom of a cryptic Australian elapid snake Drysdalia coronoides. To probe into the transcriptome, we constructed a partial cDNA library from the venom gland of D. coronoides. The proteome of the venom of D. coronoides was explored by tryptic digestion of the c...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr1008916

    authors: Chatrath ST,Chapeaurouge A,Lin Q,Lim TK,Dunstan N,Mirtschin P,Kumar PP,Kini RM

    更新日期:2011-02-04 00:00:00

  • The cardiovascular risk of healthy individuals studied by NMR metabonomics of plasma samples.

    abstract::The identification and the present wide acceptance of cardiovascular risk factors such as age, sex, hypertension, hyperlipidemia, smoking, obesity, diabetes, and physical inactivity have led to dramatic reductions in cardiovascular morbidity and mortality. However, novel risk predictors present opportunities to identi...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr200452j

    authors: Bernini P,Bertini I,Luchinat C,Tenori L,Tognaccini A

    更新日期:2011-11-04 00:00:00

  • Thioridazine Alters the Cell-Envelope Permeability of Mycobacterium tuberculosis.

    abstract::The increasing occurrence of multidrug resistant tuberculosis exerts a major burden on treatment of this infectious disease. Thioridazine, previously used as a neuroleptic, is active against extensively drug resistant tuberculosis when added to other second- and third-line antibiotics. By quantitatively studying the p...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b01037

    authors: de Keijzer J,Mulder A,de Haas PE,de Ru AH,Heerkens EM,Amaral L,van Soolingen D,van Veelen PA

    更新日期:2016-06-03 00:00:00

  • The effects of rosiglitazone and high glucose on protein expression in endothelial cells.

    abstract::Rosiglitazone is a thiazolidinedione used to treat insulin resistance in diabetes. Although thiazolidinediones may also exert cardiovascular effects, contrasting results were reported. Favorable effects were shown for pioglitazone, whereas adverse reactions were suspected for rosiglitazone. Therefore, a reassessment o...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr900435z

    authors: Millioni R,Puricelli L,Iori E,Arrigoni G,Tessari P

    更新日期:2010-01-01 00:00:00

  • Proteomic study of pilocytic astrocytoma pediatric brain tumor intracystic fluid.

    abstract::Liquid chromatography in coupling with high-resolution ESI-LTQ-Orbitrap mass spectrometry was applied for a proteomic study of pediatric pilocytic astrocytoma brain tumor intracystic fluid by an integrated top-down/bottom-up platform. Both of the proteomic strategies resulted complementary and support each other in co...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr500806k

    authors: Inserra I,Iavarone F,Martelli C,D'Angelo L,Delfino D,Rossetti DV,Tamburrini G,Massimi L,Caldarelli M,Di Rocco C,Messana I,Castagnola M,Desiderio C

    更新日期:2014-11-07 00:00:00

  • Protein profiling of human pancreatic islets by two-dimensional gel electrophoresis and mass spectrometry.

    abstract::Completion of the human genome sequence has provided scientists with powerful resources with which to explore the molecular events associated with disease states such as diabetes. Understanding the relative levels of expression of gene products, especially of proteins, and their post-translational modifications will b...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr050024a

    authors: Ahmed M,Forsberg J,Bergsten P

    更新日期:2005-05-01 00:00:00

  • Proteomic analyses of Caenorhabditis elegans dauer larvae and long-lived daf-2 mutants implicates a shared detoxification system in longevity assurance.

    abstract::The insulin/insulin-like growth factor-1 (IGF-1) signaling system is a public regulator of aging in the model animals Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus. For the first time, proteomic analyses of the environmentally resistant and 'nonaging' C. elegans dauer stage and long-lived daf-2 mut...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr9009639

    authors: Jones LM,Staffa K,Perally S,LaCourse EJ,Brophy PM,Hamilton JV

    更新日期:2010-06-04 00:00:00

  • Workflow for Integrated Processing of Multicohort Untargeted 1H NMR Metabolomics Data in Large-Scale Metabolic Epidemiology.

    abstract::Large-scale metabolomics studies involving thousands of samples present multiple challenges in data analysis, particularly when an untargeted platform is used. Studies with multiple cohorts and analysis platforms exacerbate existing problems such as peak alignment and normalization. Therefore, there is a need for robu...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00125

    authors: Karaman I,Ferreira DL,Boulangé CL,Kaluarachchi MR,Herrington D,Dona AC,Castagné R,Moayyeri A,Lehne B,Loh M,de Vries PS,Dehghan A,Franco OH,Hofman A,Evangelou E,Tzoulaki I,Elliott P,Lindon JC,Ebbels TM

    更新日期:2016-12-02 00:00:00

  • Pancreatic cancer serum detection using a lectin/glyco-antibody array method.

    abstract::Pancreatic cancer is a formidable disease and early detection biomarkers are needed to make inroads into improving the outcomes in these patients. In this work, lectin antibody microarrays were utilized to detect unique glycosylation patterns of proteins from serum. Antibodies to four potential glycoprotein markers th...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr8007013

    authors: Li C,Simeone DM,Brenner DE,Anderson MA,Shedden KA,Ruffin MT,Lubman DM

    更新日期:2009-02-01 00:00:00

  • BioNSi: A Discrete Biological Network Simulator Tool.

    abstract::Modeling and simulation of biological networks is an effective and widely used research methodology. The Biological Network Simulator (BioNSi) is a tool for modeling biological networks and simulating their discrete-time dynamics, implemented as a Cytoscape App. BioNSi includes a visual representation of the network t...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00278

    authors: Rubinstein A,Bracha N,Rudner L,Zucker N,Sloin HE,Chor B

    更新日期:2016-08-05 00:00:00

  • Comparison of protein immunoprecipitation-multiple reaction monitoring with ELISA for assay of biomarker candidates in plasma.

    abstract::Quantitative analysis of protein biomarkers in plasma is typically done by ELISA, but this method is limited by the availability of high-quality antibodies. An alternative approach is protein immunoprecipitation combined with multiple reaction monitoring mass spectrometry (IP-MRM). We compared IP-MRM to ELISA for the ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr400877e

    authors: Lin D,Alborn WE,Slebos RJ,Liebler DC

    更新日期:2013-12-06 00:00:00