Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning.

Abstract:

:In mass-spectrometry-based proteomics, the identification and quantification of peptides and proteins heavily rely on sequence database searching or spectral library matching. The lack of accurate predictive models for fragment ion intensities impairs the realization of the full potential of these approaches. Here, we extended the ProteomeTools synthetic peptide library to 550,000 tryptic peptides and 21 million high-quality tandem mass spectra. We trained a deep neural network, termed Prosit, resulting in chromatographic retention time and fragment ion intensity predictions that exceed the quality of the experimental data. Integrating Prosit into database search pipelines led to more identifications at >10× lower false discovery rates. We show the general applicability of Prosit by predicting spectra for proteases other than trypsin, generating spectral libraries for data-independent acquisition and improving the analysis of metaproteomes. Prosit is integrated into ProteomicsDB, allowing search result re-scoring and custom spectral library generation for any organism on the basis of peptide sequence alone.

journal_name

Nat Methods

journal_title

Nature methods

authors

Gessulat S,Schmidt T,Zolg DP,Samaras P,Schnatbaum K,Zerweck J,Knaute T,Rechenberger J,Delanghe B,Huhmer A,Reimer U,Ehrlich HC,Aiche S,Kuster B,Wilhelm M

doi

10.1038/s41592-019-0426-7

subject

Has Abstract

pub_date

2019-06-01 00:00:00

pages

509-518

issue

6

eissn

1548-7091

issn

1548-7105

pii

10.1038/s41592-019-0426-7

journal_volume

16

pub_type

杂志文章
  • Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq.

    abstract::Massively parallel single-cell and single-nucleus RNA sequencing has opened the way to systematic tissue atlases in health and disease, but as the scale of data generation is growing, so is the need for computational pipelines for scaled analysis. Here we developed Cumulus-a cloud-based framework for analyzing large-s...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/s41592-020-0905-x

    authors: Li B,Gould J,Yang Y,Sarkizova S,Tabaka M,Ashenberg O,Rosen Y,Slyper M,Kowalczyk MS,Villani AC,Tickle T,Hacohen N,Rozenblatt-Rosen O,Regev A

    更新日期:2020-08-01 00:00:00

  • FRETting for a more detailed interactome.

    abstract::A quantitative high-throughput FRET-based method of screening fluorescent protein fusion libraries brings the promise of more detailed interactome maps. ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth0207-112b

    authors: Kaganman I

    更新日期:2007-02-01 00:00:00

  • A photoprotection strategy for microsecond-resolution single-molecule fluorescence spectroscopy.

    abstract::Time resolution of current single-molecule fluorescence techniques is limited to milliseconds because of dye blinking and bleaching. Here we introduce a photoprotection strategy that affords microsecond resolution by combining efficient triplet quenching by oxygen and Trolox with minimized bleaching via the oxygen rad...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1553

    authors: Campos LA,Liu J,Wang X,Ramanathan R,English DS,Muñoz V

    更新日期:2011-02-01 00:00:00

  • Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling.

    abstract::Single-cell RNA sequencing has enabled the decomposition of complex tissues into functionally distinct cell types. Often, investigators wish to assign cells to cell types through unsupervised clustering followed by manual annotation or via 'mapping' to existing data. However, manual interpretation scales poorly to lar...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/s41592-019-0529-1

    authors: Zhang AW,O'Flanagan C,Chavez EA,Lim JLP,Ceglia N,McPherson A,Wiens M,Walters P,Chan T,Hewitson B,Lai D,Mottok A,Sarkozy C,Chong L,Aoki T,Wang X,Weng AP,McAlpine JN,Aparicio S,Steidl C,Campbell KR,Shah SP

    更新日期:2019-10-01 00:00:00

  • In vivo imaging of extracellular matrix remodeling by tumor-associated fibroblasts.

    abstract::Here we integrated multiphoton laser scanning microscopy and the registration of second harmonic generation images of collagen fibers to overcome difficulties in tracking stromal cell-matrix interactions for several days in live mice. We show that the matrix-modifying hormone relaxin increased tumor-associated fibrobl...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1295

    authors: Perentes JY,McKee TD,Ley CD,Mathiew H,Dawson M,Padera TP,Munn LL,Jain RK,Boucher Y

    更新日期:2009-02-01 00:00:00

  • Putative cell type discovery from single-cell gene expression data.

    abstract::We present the Single-Cell Clustering Assessment Framework, a method for the automated identification of putative cell types from single-cell RNA sequencing (scRNA-seq) data. By iteratively applying a machine learning approach to a given set of cells, we simultaneously identify distinct cell groups and a weighted list...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/s41592-020-0825-9

    authors: Miao Z,Moreno P,Huang N,Papatheodorou I,Brazma A,Teichmann SA

    更新日期:2020-06-01 00:00:00

  • Onco-proteogenomics: cancer proteomics joins forces with genomics.

    abstract::The complexities of tumor genomes are rapidly being uncovered, but how they are regulated into functional proteomes remains poorly understood. Standard proteomics workflows use databases of known proteins, but these databases do not capture the uniqueness of the cancer transcriptome, with its point mutations, unusual ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3138

    authors: Alfaro JA,Sinha A,Kislinger T,Boutros PC

    更新日期:2014-11-01 00:00:00

  • De novo assembly and analysis of RNA-seq data.

    abstract::We describe Trans-ABySS, a de novo short-read transcriptome assembly and analysis pipeline that addresses variation in local read densities by assembling read substrings with varying stringencies and then merging the resulting contigs before analysis. Analyzing 7.4 gigabases of 50-base-pair paired-end Illumina reads f...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1517

    authors: Robertson G,Schein J,Chiu R,Corbett R,Field M,Jackman SD,Mungall K,Lee S,Okada HM,Qian JQ,Griffith M,Raymond A,Thiessen N,Cezard T,Butterfield YS,Newsome R,Chan SK,She R,Varhol R,Kamoh B,Prabhu AL,Tam A,Zhao Y

    更新日期:2010-11-01 00:00:00

  • Binning metagenomic contigs by coverage and composition.

    abstract::Shotgun sequencing enables the reconstruction of genomes from complex microbial communities, but because assembly does not reconstruct entire genomes, it is necessary to bin genome fragments. Here we present CONCOCT, a new algorithm that combines sequence composition and coverage across multiple samples, to automatica...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3103

    authors: Alneberg J,Bjarnason BS,de Bruijn I,Schirmer M,Quick J,Ijaz UZ,Lahti L,Loman NJ,Andersson AF,Quince C

    更新日期:2014-11-01 00:00:00

  • Single-cell systems biology by super-resolution imaging and combinatorial labeling.

    abstract::Fluorescence microscopy is a powerful quantitative tool for exploring regulatory networks in single cells. However, the number of molecular species that can be measured simultaneously is limited by the spectral overlap between fluorophores. Here we demonstrate a simple but general strategy to drastically increase the ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.2069

    authors: Lubeck E,Cai L

    更新日期:2012-06-03 00:00:00

  • In vivo three-photon imaging of activity of GCaMP6-labeled neurons deep in intact mouse brain.

    abstract::High-resolution optical imaging is critical to understanding brain function. We demonstrate that three-photon microscopy at 1,300-nm excitation enables functional imaging of GCaMP6s-labeled neurons beyond the depth limit of two-photon microscopy. We record spontaneous activity from up to 150 neurons in the hippocampal...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.4183

    authors: Ouzounov DG,Wang T,Wang M,Feng DD,Horton NG,Cruz-Hernández JC,Cheng YT,Reimer J,Tolias AS,Nishimura N,Xu C

    更新日期:2017-04-01 00:00:00

  • Glycomics: an integrated systems approach to structure-function relationships of glycans.

    abstract::In comparison with genomics and proteomics, the advancement of glycomics has faced unique challenges in the pursuit of developing analytical and biochemical tools and biological readouts to investigate glycan structure-function relationships. Glycans are more diverse in terms of chemical structure and information dens...

    journal_title:Nature methods

    pub_type: 杂志文章,评审

    doi:10.1038/nmeth807

    authors: Raman R,Raguram S,Venkataraman G,Paulson JC,Sasisekharan R

    更新日期:2005-11-01 00:00:00

  • Quantifying small numbers of antibodies with a 'near-universal' protein-DNA chimera.

    abstract::We present general means to greatly increase the sensitivity of antibody-based assays. Augmentation relies on a 'tadpole' protein-DNA chimera whose protein moiety binds most classes of mammalian antibodies but not avian immunoglobulin Y (IgY). We used this tadpole in affinity capture assays followed by real-time PCR t...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth1127

    authors: Burbulis I,Yamaguchi K,Yu R,Resnekov O,Brent R

    更新日期:2007-12-01 00:00:00

  • NeuroGPS-Tree: automatic reconstruction of large-scale neuronal populations with dense neurites.

    abstract::The reconstruction of neuronal populations, a key step in understanding neural circuits, remains a challenge in the presence of densely packed neurites. Here we achieved automatic reconstruction of neuronal populations by partially mimicking human strategies to separate individual neurons. For populations not resolvab...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3662

    authors: Quan T,Zhou H,Li J,Li S,Li A,Li Y,Lv X,Luo Q,Gong H,Zeng S

    更新日期:2016-01-01 00:00:00

  • All-optical electrophysiology in mammalian neurons using engineered microbial rhodopsins.

    abstract::All-optical electrophysiology-spatially resolved simultaneous optical perturbation and measurement of membrane voltage-would open new vistas in neuroscience research. We evolved two archaerhodopsin-based voltage indicators, QuasAr1 and QuasAr2, which show improved brightness and voltage sensitivity, have microsecond r...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3000

    authors: Hochbaum DR,Zhao Y,Farhi SL,Klapoetke N,Werley CA,Kapoor V,Zou P,Kralj JM,Maclaurin D,Smedemark-Margulies N,Saulnier JL,Boulting GL,Straub C,Cho YK,Melkonian M,Wong GK,Harrison DJ,Murthy VN,Sabatini BL,Boyden ES,C

    更新日期:2014-08-01 00:00:00

  • A naturally monomeric infrared fluorescent protein for protein labeling in vivo.

    abstract::Infrared fluorescent proteins (IFPs) provide an additional color to GFP and its homologs in protein labeling. Drawing on structural analysis of the dimer interface, we identified a bacteriophytochrome in the sequence database that is monomeric in truncated form and engineered it into a naturally monomeric IFP (mIFP). ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3447

    authors: Yu D,Baird MA,Allen JR,Howe ES,Klassen MP,Reade A,Makhijani K,Song Y,Liu S,Murthy Z,Zhang SQ,Weiner OD,Kornberg TB,Jan YN,Davidson MW,Shu X

    更新日期:2015-08-01 00:00:00

  • MAZe: a tool for mosaic analysis of gene function in zebrafish.

    abstract::To trace cell lineages in a developing vertebrate and to observe, in vivo, how behaviors of individual cells are affected by the genes they express, we created a zebrafish line containing a transgene called mosaic analysis in zebrafish (MAZe), built around a self-excising hsp70:Cre cassette. Heat shock triggers Cre re...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1423

    authors: Collins RT,Linker C,Lewis J

    更新日期:2010-03-01 00:00:00

  • Blotting protein complexes from native gels to electron microscopy grids.

    abstract::We report a simple and generic method for the direct transfer of protein complexes separated by native gel electrophoresis to electron microscopy grids. After transfer, sufficient material remains in the gel for identification and characterization by mass spectrometry. The method should facilitate higher-throughput si...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1840

    authors: Knispel RW,Kofler C,Boicu M,Baumeister W,Nickell S

    更新日期:2012-01-08 00:00:00

  • Segregation of molecules at cell division reveals native protein localization.

    abstract::We introduce a nonintrusive method exploiting single-cell variability after cell division to validate protein localization. We found that Clp proteases, widely reported to form biologically relevant foci, were uniformly distributed in Escherichia coli cells, and that many commonly used fluorescent proteins caused seve...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1955

    authors: Landgraf D,Okumus B,Chien P,Baker TA,Paulsson J

    更新日期:2012-04-08 00:00:00

  • Automated mapping of phenotype space with single-cell data.

    abstract::Accurate identification of cell subsets in complex populations is key to discovering novelty in multidimensional single-cell experiments. We present X-shift (http://web.stanford.edu/~samusik/vortex/), an algorithm that processes data sets using fast k-nearest-neighbor estimation of cell event density and arranges popu...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3863

    authors: Samusik N,Good Z,Spitzer MH,Davis KL,Nolan GP

    更新日期:2016-06-01 00:00:00

  • Efficient and quantitative high-throughput tRNA sequencing.

    abstract::Despite its biological importance, tRNA has not been adequately sequenced by standard methods because of its abundant post-transcriptional modifications and stable structure, which interfere with cDNA synthesis. We achieved efficient and quantitative tRNA sequencing in HEK293T cells by using engineered demethylases to...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3478

    authors: Zheng G,Qin Y,Clark WC,Dai Q,Yi C,He C,Lambowitz AM,Pan T

    更新日期:2015-09-01 00:00:00

  • Author Correction: Genetically engineered cerebral organoids model brain tumor formation.

    abstract::In the originally published paper, the "before" image for the afatinib condition in Fig. 6c was incorrect. Instead of an image displaying a GBM-3 neoplastic organoid before afatinib treatment, this panel showed an image from the GBM-2 control (DMSO) group before treatment. This error has now been corrected in the HTML...

    journal_title:Nature methods

    pub_type: 已发布勘误

    doi:10.1038/s41592-018-0118-8

    authors: Bian S,Repic M,Guo Z,Kavirayani A,Burkard T,Bagley JA,Krauditsch C,Knoblich JA

    更新日期:2018-09-01 00:00:00

  • PyClone: statistical inference of clonal population structure in cancer.

    abstract::We introduce PyClone, a statistical model for inference of clonal population structures in cancers. PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.2883

    authors: Roth A,Khattra J,Yap D,Wan A,Laks E,Biele J,Ha G,Aparicio S,Bouchard-Côté A,Shah SP

    更新日期:2014-04-01 00:00:00

  • Imaging without lenses: achievements and remaining challenges of wide-field on-chip microscopy.

    abstract::We discuss unique features of lens-free computational imaging tools and report some of their emerging results for wide-field on-chip microscopy, such as the achievement of a numerical aperture (NA) of ∼0.8-0.9 across a field of view (FOV) of more than 20 mm(2) or an NA of ∼0.1 across a FOV of ∼18 cm(2), which correspo...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.2114

    authors: Greenbaum A,Luo W,Su TW,Göröcs Z,Xue L,Isikman SO,Coskun AF,Mudanyali O,Ozcan A

    更新日期:2012-09-01 00:00:00

  • Gas cluster ion beam SEM for imaging of large tissue samples with 10 nm isotropic resolution.

    abstract::We demonstrate gas cluster ion beam scanning electron microscopy (SEM), in which wide-area ion milling is performed on a series of thick tissue sections. This three-dimensional electron microscopy technique acquires datasets with <10 nm isotropic resolution of each section, and these can then be stitched together to s...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/s41592-019-0641-2

    authors: Hayworth KJ,Peale D,Januszewski M,Knott GW,Lu Z,Xu CS,Hess HF

    更新日期:2020-01-01 00:00:00

  • Atomic-accuracy models from 4.5-Å cryo-electron microscopy data with density-guided iterative local refinement.

    abstract::We describe a general approach for refining protein structure models on the basis of cryo-electron microscopy maps with near-atomic resolution. The method integrates Monte Carlo sampling with local density-guided optimization, Rosetta all-atom refinement and real-space B-factor fitting. In tests on experimental maps o...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.3286

    authors: DiMaio F,Song Y,Li X,Brunner MJ,Xu C,Conticello V,Egelman E,Marlovits T,Cheng Y,Baker D

    更新日期:2015-04-01 00:00:00

  • Membrane-protein structure determination by solid-state NMR spectroscopy of microcrystals.

    abstract::Membrane proteins are largely underrepresented among available atomic-resolution structures. The use of detergents in protein purification procedures hinders the formation of well-ordered crystals for X-ray crystallography and leads to slower molecular tumbling, impeding the application of solution-state NMR. Solid-st...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.2248

    authors: Shahid SA,Bardiaux B,Franks WT,Krabben L,Habeck M,van Rossum BJ,Linke D

    更新日期:2012-12-01 00:00:00

  • Visualizing a one-way protein encounter complex by ultrafast single-molecule mixing.

    abstract::We combined rapid microfluidic mixing with single-molecule fluorescence resonance energy transfer to study the folding kinetics of the intrinsically disordered human protein α-synuclein. The time-resolution of 0.2 ms revealed initial collapse of the unfolded protein induced by binding with lipid mimics and subsequent ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth.1568

    authors: Gambin Y,VanDelinder V,Ferreon AC,Lemke EA,Groisman A,Deniz AA

    更新日期:2011-03-01 00:00:00

  • scGen predicts single-cell perturbation responses.

    abstract::Accurately modeling cellular response to perturbations is a central goal of computational biology. While such modeling has been based on statistical, mechanistic and machine learning models in specific settings, no generalization of predictions to phenomena absent from training data (out-of-sample) has yet been demons...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/s41592-019-0494-8

    authors: Lotfollahi M,Wolf FA,Theis FJ

    更新日期:2019-08-01 00:00:00

  • Overcoming protein diversity on arrays.

    abstract::Researchers show that to reliably establish protein interaction networks from microarray data it is necessary to measure saturation binding curves for each arrayed protein and its potential binding partners. ...

    journal_title:Nature methods

    pub_type: 杂志文章

    doi:10.1038/nmeth1206-966

    authors: Rusk N

    更新日期:2006-12-01 00:00:00