Statistical validation of peptide identifications in large-scale proteomics using the target-decoy database search strategy and flexible mixture modeling.

Abstract:

:Reliable statistical validation of peptide and protein identifications is a top priority in large-scale mass spectrometry based proteomics. PeptideProphet is one of the computational tools commonly used for assessing the statistical confidence in peptide assignments to tandem mass spectra obtained using database search programs such as SEQUEST, MASCOT, or X! TANDEM. We present two flexible methods, the variable component mixture model and the semiparametric mixture model, that remove the restrictive parametric assumptions in the mixture modeling approach of PeptideProphet. Using a control protein mixture data set generated on an linear ion trap Fourier transform (LTQ-FT) mass spectrometer, we demonstrate that both methods improve parametric models in terms of the accuracy of probability estimates and the power to detect correct identifications controlling the false discovery rate to the same degree. The statistical approaches presented here require that the data set contain a sufficient number of decoy (known to be incorrect) peptide identifications, which can be obtained using the target-decoy database search strategy.

journal_name

J Proteome Res

authors

Choi H,Ghosh D,Nesvizhskii AI

doi

10.1021/pr7006818

subject

Has Abstract

pub_date

2008-01-01 00:00:00

pages

286-92

issue

1

eissn

1535-3893

issn

1535-3907

journal_volume

7

pub_type

杂志文章
  • Epsilon-Q: An Automated Analyzer Interface for Mass Spectral Library Search and Label-Free Protein Quantification.

    abstract::Mass spectrometry (MS) is a widely used proteome analysis tool for biomedical science. In an MS-based bottom-up proteomic approach to protein identification, sequence database (DB) searching has been routinely used because of its simplicity and convenience. However, searching a sequence DB with multiple variable modif...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b01019

    authors: Cho JY,Lee HJ,Jeong SK,Paik YK

    更新日期:2017-12-01 00:00:00

  • Evolutionary conservation of mammalian sperm proteins associates with overall, not tyrosine, phosphorylation in human spermatozoa.

    abstract::We investigated possible associations between sequence evolution of mammalian sperm proteins and their phosphorylation status in humans. As a reference, spermatozoa from three normozoospermic men were analyzed combining two-dimensional gel electrophoresis, immunoblotting, and mass spectrometry. We identified 99 sperm ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr400228c

    authors: Schumacher J,Ramljak S,Asif AR,Schaffrath M,Zischler H,Herlyn H

    更新日期:2013-12-06 00:00:00

  • "Differential Visual Proteomics": Enabling the Proteome-Wide Comparison of Protein Structures of Single-Cells.

    abstract::Proteins are involved in all tasks of life, and their characterization is essential to understand the underlying mechanisms of biological processes. We present a method called "differential visual proteomics" geared to study proteome-wide structural changes of proteins and protein-complexes between a disturbed and an ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.9b00447

    authors: Syntychaki A,Rima L,Schmidli C,Stohler T,Bieri A,Sütterlin R,Stahlberg H,Castaño-Díez D,Braun T

    更新日期:2019-09-06 00:00:00

  • New structural proteins of Halobacterium salinarum gas vesicle revealed by comparative proteomics analysis.

    abstract::The Halobacterium salinarum gas vesicle (GV) is an extremely stable intracellular organelle with air trapped inside a proteinaceous membrane. Reported here is a comparative proteomics analysis of GV and GV depleted lysate (GVD) to reveal the membrane structural proteins. Ten proteins encoded by gvp-1 (gvpMLKJIHGFED-1 ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr1009383

    authors: Chu LJ,Chen MC,Setter J,Tsai YS,Yang H,Fang X,Ting YS,Shaffer SA,Taylor GK,von Haller PD,Goodlett DR,Ng WV

    更新日期:2011-03-04 00:00:00

  • The COVID-19 Pandemic from a Human Genetic Perspective.

    abstract::The coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has impacted a large portion of the world population. From a virus genetic perspective, a recent study described what genomic data revealed about the origin and emergence of SARS-CoV-2, proposing stronger a...

    journal_title:Journal of proteome research

    pub_type: 杂志文章,评审

    doi:10.1021/acs.jproteome.0c00671

    authors: Zhang YM,Wang L,Liu XZ,Zhang H

    更新日期:2020-11-06 00:00:00

  • Improved Integrated Whole Proteomic and Phosphoproteomic Profiles of Severe Acute Pancreatitis.

    abstract::Severe acute pancreatitis (SAP) is caused by complicated biological factors, and revealing its complex pathogenesis by single-target analysis is difficult. Systematic studies have developed slowly because extraction of degradable pancreatic proteins exposed to multiple proteases is challenging. We present integrated w...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.0c00229

    authors: Wang C,Zhang Y,Tan J,Chen B,Sun L

    更新日期:2020-06-05 00:00:00

  • Characterization of amyloid beta peptides in cerebrospinal fluid by an automated immunoprecipitation procedure followed by mass spectrometry.

    abstract::Pathogenic events in Alzheimer's disease are believed to involve an imbalance between the production and clearance of the neurotoxic 42 amino acid form of the beta-amyloid peptide (Abeta1-42). Although much is known about the production of Abeta1-42, many questions remain about its degradation. Here, we describe an op...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr0703627

    authors: Portelius E,Tran AJ,Andreasson U,Persson R,Brinkmalm G,Zetterberg H,Blennow K,Westman-Brinkmalm A

    更新日期:2007-11-01 00:00:00

  • Elucidating Escherichia coli Proteoform Families Using Intact-Mass Proteomics and a Global PTM Discovery Database.

    abstract::A proteoform family is a group of related molecular forms of a protein (proteoforms) derived from the same gene. We have previously described a strategy to identify proteoforms and elucidate proteoform families in complex mixtures of intact proteins. The strategy is based upon measurements of two properties for each p...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.7b00516

    authors: Dai Y,Shortreed MR,Scalf M,Frey BL,Cesnik AJ,Solntsev S,Schaffer LV,Smith LM

    更新日期:2017-11-03 00:00:00

  • Site-specific IGFBP-1 hyper-phosphorylation in fetal growth restriction: clinical and functional relevance.

    abstract::Phosphorylation enhances IGFBP-1 binding to IGF-I, thereby limiting the bioavailability of IGF-I that may be important in fetal growth. Our goal in this study was to determine whether changes in site-specific IGFBP-1 phosphorylation were unique to fetal growth restriction. To establish a link, we compared IGFBP-1 phos...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr900987n

    authors: Abu Shehab M,Khosravi J,Han VK,Shilton BH,Gupta MB

    更新日期:2010-04-05 00:00:00

  • Hepatocystin is not secreted in cyst fluid of hepatocystin mutant polycystic liver patients.

    abstract::Autosomal dominant polycystic liver disease (PCLD) is characterized by multiple liver cysts and is caused by mutations in PRKCSH (hepatocystin). Mechanisms of cystogenesis are unknown, but previous studies have shown that hepatocystin is secreted in vitro. The goal of this study was to determine the fate of hepatocyst...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr8000282

    authors: Waanders E,Lameris AL,Op den Camp HJ,Pluk W,Gloerich J,Strijk SP,Drenth JP

    更新日期:2008-06-01 00:00:00

  • Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.

    abstract::Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visuali...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b01031

    authors: Mai TL,Hu GM,Chen CM

    更新日期:2016-07-01 00:00:00

  • ProXL (Protein Cross-Linking Database): A Platform for Analysis, Visualization, and Sharing of Protein Cross-Linking Mass Spectrometry Data.

    abstract::ProXL is a Web application and accompanying database designed for sharing, visualizing, and analyzing bottom-up protein cross-linking mass spectrometry data with an emphasis on structural analysis and quality control. ProXL is designed to be independent of any particular software pipeline. The import process is simpli...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00274

    authors: Riffle M,Jaschob D,Zelter A,Davis TN

    更新日期:2016-08-05 00:00:00

  • Proteomics analysis of rice lesion mimic mutant (spl1) reveals tightly localized probenazole-induced protein (PBZ1) in cells undergoing programmed cell death.

    abstract::Numerous reports have predicted/hypothesized a role for probenazole-induced protein (PBZ1) as a molecular marker in rice self-defense mechanism. However, the precise function of PBZ1 remains unknown. In the present study, we examined PBZ1 as a putative cell death marker in rice. For this, we focused our attention on a...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr700878t

    authors: Kim ST,Kim SG,Kang YH,Wang Y,Kim JY,Yi N,Kim JK,Rakwal R,Koh HJ,Kang KY

    更新日期:2008-04-01 00:00:00

  • Genome-Wide Functional Annotation of Human Protein-Coding Splice Variants Using Multiple Instance Learning.

    abstract::The vast majority of human multiexon genes undergo alternative splicing and produce a variety of splice variant transcripts and proteins, which can perform different functions. These protein-coding splice variants (PCSVs) greatly increase the functional diversity of proteins. Most functional annotation algorithms have...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b00883

    authors: Panwar B,Menon R,Eksi R,Li HD,Omenn GS,Guan Y

    更新日期:2016-06-03 00:00:00

  • Proteomic Characterization of Epithelial-Like Extracellular Vesicles in Advanced Endometrial Cancer.

    abstract::Endometrial cancer (EC) is the most frequent gynecological cancer. Tumor dissemination affecting ∼20% of EC patients is characterized at the primary carcinoma by epithelial-to-mesenchymal transition (EMT) associated with myometrial infiltration. At distant sites, the interaction of circulating tumor cells (CTCs) with ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.8b00750

    authors: Mariscal J,Fernandez-Puente P,Calamia V,Abalo A,Santacana M,Matias-Guiu X,Lopez-Lopez R,Gil-Moreno A,Alonso-Alconada L,Abal M

    更新日期:2019-03-01 00:00:00

  • Introduction of the disulfide proteome: application of a technique for the analysis of plant storage proteins as well as allergens.

    abstract::Redox regulation plays an important role across a broad spectrum of biology. Accumulating evidence suggests that thioredoxin, a widely distributed redox enzyme, participates in the redox control of numerous target proteins, thus, playing a key role as a signaling intermediate that senses the metabolic state or environ...

    journal_title:Journal of proteome research

    pub_type: 杂志文章,评审

    doi:10.1021/pr8003453

    authors: Yano H,Kuroda S

    更新日期:2008-08-01 00:00:00

  • Empirical Bayesian random censoring threshold model improves detection of differentially abundant proteins.

    abstract::A challenge in proteomics is that many observations are missing with the probability of missingness increasing as abundance decreases. Adjusting for this informative missingness is required to assess accurately which proteins are differentially abundant. We propose an empirical Bayesian random censoring threshold (EBR...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr500171u

    authors: Koopmans F,Cornelisse LN,Heskes T,Dijkstra TM

    更新日期:2014-09-05 00:00:00

  • METATRYP v 2.0: Metaproteomic Least Common Ancestor Analysis for Taxonomic Inference Using Specialized Sequence Assemblies-Standalone Software and Web Servers for Marine Microorganisms and Coronaviruses.

    abstract::We present METATRYP version 2 software that identifies shared peptides across the predicted proteomes of organisms within environmental metaproteomics studies to enable accurate taxonomic attribution of peptides during protein inference. Improvements include ingestion of complex sequence assembly data categories (meta...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.0c00385

    authors: Saunders JK,Gaylord DA,Held NA,Symmonds N,Dupont CL,Shepherd A,Kinkade DB,Saito MA

    更新日期:2020-11-06 00:00:00

  • Transgenomic metabolic interactions in a mouse disease model: interactions of Trichinella spiralis infection with dietary Lactobacillus paracasei supplementation.

    abstract::Irritable Bowel Syndrome (IBS) is a common multifactorial intestinal disorder for which the aetiology remains largely undefined. Here, we have used a Trichinella spiralis (T. spiralis)-induced model of post-infective IBS, and the effects of probiotic bacteria on gut dysfunction have been investigated using a metabonom...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr060157b

    authors: Martin FP,Verdu EF,Wang Y,Dumas ME,Yap IK,Cloarec O,Bergonzelli GE,Corthesy-Theulaz I,Kochhar S,Holmes E,Lindon JC,Collins SM,Nicholson JK

    更新日期:2006-09-01 00:00:00

  • Identification of novel proteins from the venom of a cryptic snake Drysdalia coronoides by a combined transcriptomics and proteomics approach.

    abstract::We have investigated the transcriptome and proteome of the venom of a cryptic Australian elapid snake Drysdalia coronoides. To probe into the transcriptome, we constructed a partial cDNA library from the venom gland of D. coronoides. The proteome of the venom of D. coronoides was explored by tryptic digestion of the c...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr1008916

    authors: Chatrath ST,Chapeaurouge A,Lin Q,Lim TK,Dunstan N,Mirtschin P,Kumar PP,Kini RM

    更新日期:2011-02-04 00:00:00

  • Compositional Proteomics: Effects of Spatial Constraints on Protein Quantification Utilizing Isobaric Tags.

    abstract::Mass spectrometry (MS) has become an accessible tool for whole proteome quantitation with the ability to characterize protein expression across thousands of proteins within a single experiment. A subset of MS quantification methods (e.g., SILAC and label-free) monitor the relative intensity of intact peptides, where t...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.7b00699

    authors: O'Brien JJ,O'Connell JD,Paulo JA,Thakurta S,Rose CM,Weekes MP,Huttlin EL,Gygi SP

    更新日期:2018-01-05 00:00:00

  • Global Analysis of Protein Lysine Succinylation Profiles and Their Overlap with Lysine Acetylation in the Marine Bacterium Vibrio parahemolyticus.

    abstract::Protein lysine acylation, including acetylation and succinylation, has been found to be a major post-translational modification (PTM) and is associated with the regulation of cellular processes that are widespread in bacteria. Vibrio parahemolyticus is a model marine bacterium that causes seafood-borne illness in huma...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b00485

    authors: Pan J,Chen R,Li C,Li W,Ye Z

    更新日期:2015-10-02 00:00:00

  • Serum metabolic signatures of fulminant type 1 diabetes.

    abstract::Fulminant type 1 diabetes (FT1DM) is a relatively new clinical entity featured by acute destruction of pancreatic beta cells. Clinical consequences of FT1DM could be fatal when timely medications are not provided, suggesting the particular importance of rapid and accurate diagnosis. Here we report a serum metabonomics...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr300523x

    authors: Lu J,Zhou J,Bao Y,Chen T,Zhang Y,Zhao A,Qiu Y,Xie G,Wang C,Jia W,Jia W

    更新日期:2012-09-07 00:00:00

  • abFASP-MS: affinity-based filter-aided sample preparation mass spectrometry for quantitative analysis of chemically labeled protein complexes.

    abstract::Affinity purification coupled to 1-D gel-free liquid chromatography mass spectrometry (LC-MS) is a well-established and widespread approach for the analyses of noncovalently interacting protein complexes. In this study, two proteins conjugated to a streptavidin-binding peptide and hemagglutinin double tag were express...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr4009892

    authors: Huber ML,Sacco R,Parapatics K,Skucha A,Khamina K,Müller AC,Rudashevskaya EL,Bennett KL

    更新日期:2014-02-07 00:00:00

  • Mass spectrometry characterization of species-specific peptides from arginine kinase for the identification of commercially relevant shrimp species.

    abstract::The identification of commercial shrimp species is a relevant issue to ensure correct labeling, maintain consumer confidence and enhance the knowledge of the captured species, benefiting both, fisheries and manufacturers. A proteomic approach, based on 2DE, tryptic in-gel digestion, MALDI-TOF MS, and ESI-MS/MS analyse...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr900663d

    authors: Ortea I,Cañas B,Gallardo JM

    更新日期:2009-11-01 00:00:00

  • Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features.

    abstract::Liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based proteomic methods have been widely used to identify lysine acylation proteins. However, these experimental approaches often fail to detect proteins that are in low abundance or absent in specific biological samples. To circumvent these problems, we develo...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00240

    authors: Du Y,Zhai Z,Li Y,Lu M,Cai T,Zhou B,Huang L,Wei T,Li T

    更新日期:2016-12-02 00:00:00

  • Integrative Omics Analysis Reveals Post-Transcriptionally Enhanced Protective Host Response in Colorectal Cancers with Microsatellite Instability.

    abstract::Microsatellite instability (MSI) is a frequent and clinically relevant molecular phenotype in colorectal cancer. MSI cancers have favorable survival compared with microsatellite stable cancers (MSS), possibly due to the pronounced tumor-infiltrating lymphocytes observed in MSI cancers. Consistent with the strong immun...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.5b00847

    authors: Liu Q,Zhang B

    更新日期:2016-03-04 00:00:00

  • Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.

    abstract::Target-decoy approach (TDA) is the dominant strategy for false discovery rate (FDR) estimation in mass-spectrometry-based proteomics. One of its main applications is direct FDR estimation based on counting of decoy matches above a certain score threshold. The corresponding equations are widely employed for filtering o...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/acs.jproteome.6b00144

    authors: Levitsky LI,Ivanov MV,Lobas AA,Gorshkov MV

    更新日期:2017-02-03 00:00:00

  • Early events in plastid protein degradation in stay-green Arabidopsis reveal differential regulation beyond the retention of LHCII and chlorophyll.

    abstract::An individually darkened leaf model was used to study protein changes in the Arabidopsis mutant stay-green1 (sgr1) to partially mimic the process of leaf covering senescence that occurs naturally in the shaded rosettes of Arabidopsis plants. Utilizing this controlled and predictable induced senescence model has allowe...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr300691k

    authors: Grassl J,Pružinská A,Hörtensteiner S,Taylor NL,Millar AH

    更新日期:2012-11-02 00:00:00

  • Effect of amphotericin B on the metabolic profiles of Candida albicans.

    abstract::Amphotericin B (AmB) is a polyene antifungal drug widely used for systemic fungal infections. In this study, a metabonomic method using gas chromatography-mass spectrometry (GC/MS) was developed to characterize the metabolic profiles of Candida albicans cells exposed to AmB. Thirty-one differentially produced metaboli...

    journal_title:Journal of proteome research

    pub_type: 杂志文章

    doi:10.1021/pr4002178

    authors: Cao Y,Zhu Z,Chen X,Yao X,Zhao L,Wang H,Yan L,Wu H,Chai Y,Jiang Y

    更新日期:2013-06-07 00:00:00