Has the yo-yo stopped? An assessment of human protein-coding gene number.

Abstract:

:Since the identification of approximately 25,000 proteins from the draft human genome assembly in 2001, estimates of the total have oscillated between 30,000 and 70,000. The recently announced genome closure has not generated a consensus gene count despite this being a key parameter for many areas of biology including drug target discovery and characterization of the human proteome. Contrary to earlier predictions of constitutive under-detection for eukaryotic genes, the latest model organism updates have produced minor increases in the worm but fly and yeast gene numbers have decreased. The postdraft, precompletion interval has produced large increases in human transcript coverage, continuous improvements in genome assembly and refinements in automated genomic annotation. Notably these enhancements have resulted in an Ensembl human protein-coding gene number of 22,184, a decrease of 1862 since the first release. Longitudinal database surveys indicate that redundancy-reduced human mRNA and protein collections are flattening out at approximately 28,000, although Ensembl maps approximately 20,000 known sequences. Observations suggest high-throughput cloning projects are predominantly extending known genes or sampling new splice forms and novel protein discovery has slowed to a trickle. The hypothesis that substantial numbers of short proteins remain experimentally and computationally undetected in mammalian genomes is neither supported by sequence data nor by the extensive homology between mouse and human proteins. Aggregating the independent annotations for complete transcripts from seven completed human chromosomes extrapolates to approximately 25,000 genes. The inclusion of partial putative genes would increase this to above 30,000 but recent data suggest these represent predominantly nonprotein-coding transcripts. Mass spectrometry-based proteomics has already verified more than 10% of human genes but has not identified significant numbers of unpredicted proteins. The available data are thus converging to a basal protein-coding gene number well below 30,000, which could even be as low as 25,000.

journal_name

Proteomics

journal_title

Proteomics

authors

Southan C

doi

10.1002/pmic.200300700

subject

Has Abstract

pub_date

2004-06-01 00:00:00

pages

1712-26

issue

6

eissn

1615-9853

issn

1615-9861

journal_volume

4

pub_type

杂志文章,评审
  • Identification by 2-D DIGE of apoplastic proteins regulated by oligogalacturonides in Arabidopsis thaliana.

    abstract::Oligogalacturonides (OGs) are elicitors of plant defence responses released from the homogalacturonan of the plant cell wall during the attack by pathogenic micro-organisms. The signalling pathway mediated by OGs remains poorly understood, and no proteins involved in their signal perception and transduction have yet b...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200700523

    authors: Casasoli M,Spadoni S,Lilley KS,Cervone F,De Lorenzo G,Mattei B

    更新日期:2008-03-01 00:00:00

  • Efficiency improvement of peptide identification for an organism without complete genome sequence, using expressed sequence tag database and tandem mass spectral data.

    abstract::We compared peptide identification by database (DB) search methods with de novo sequencing results for proteomics study in an organism without genome sequence information. When the former was done by searching the Expressed Sequence Tag (EST) DB of the sample organism or the NCBI nonredundant (nr) protein DB of green ...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200300620

    authors: Kwon KH,Kim M,Kim JY,Kim KW,Kim SI,Park YM,Yoo JS

    更新日期:2003-12-01 00:00:00

  • Further steps in standardisation. Report of the second annual Proteomics Standards Initiative Spring Workshop (Siena, Italy 17-20th April 2005).

    abstract::The spring workshop of the HUPO-PSI convened in Siena to further progress the data standards which are already making an impact on data exchange and deposition in the field of proteomics. Separate work groups pushed forward existing XML standards for the exchange of Molecular Interaction data (PSI-MI, MIF) and Mass Sp...

    journal_title:Proteomics

    pub_type:

    doi:10.1002/pmic.200500626

    authors: Orchard S,Hermjakob H,Taylor CF,Potthast F,Jones P,Zhu W,Julian RK Jr,Apweiler R

    更新日期:2005-09-01 00:00:00

  • Novel identification of expressed genes and functional classification of hypothetical proteins from Neisseria meningitidis serogroup A.

    abstract::To implement the 2-DE database of serogroup A Neisseria meningitidis (MenA) and improve its potential of investigation in bacterial biology, cell extracts were separated by tricine-SDS-PAGE and 131 novel proteins were identified by microLC-ESI-IT-MS/MS. These identifications extended to 404, the number of MenA gene ex...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200700210

    authors: Bernardini G,Arena S,Braconi D,Scaloni A,Santucci A

    更新日期:2007-09-01 00:00:00

  • Quantitative protein microarrays for time-resolved measurements of protein phosphorylation.

    abstract::The quantitative analysis of signaling networks requires highly sensitive methods for the time-resolved determination of protein phosphorylation. For this reason, we developed a quantitative protein microarray that monitors the activation of multiple signaling pathways in parallel, and at high temporal resolution. A l...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200800112

    authors: Korf U,Derdak S,Tresch A,Henjes F,Schumacher S,Schmidt C,Hahn B,Lehmann WD,Poustka A,Beissbarth T,Klingmüller U

    更新日期:2008-11-01 00:00:00

  • Ubiquitin specific peptidase 5 mediates Histidine-rich protein Hpn induced cell apoptosis in hepatocellular carcinoma through P14-P53 signaling.

    abstract::Hpn is a small histidine-rich cytoplasmic protein from Helicobacter pylori and has been recognized as a high-risk factor for several cancers including gastric cancer, colorectal cancer, and MALT lymphoma. However, the relationship between Hpn and cancers remains elusive. In this study, we discovered that Hpn protein e...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201600350

    authors: Liu Y,Wang WM,Zou LY,Li L,Feng L,Pan MZ,Lv MY,Cao Y,Wang H,Kung HF,Pang JX,Fu WM,Zhang JF

    更新日期:2017-06-01 00:00:00

  • Quantitative iTRAQ proteome and comparative transcriptome analysis of elicitor-induced Norway spruce (Picea abies) cells reveals elements of calcium signaling in the early conifer defense response.

    abstract::Long-lived conifer trees depend on both constitutive and induced defenses for resistance against a myriad of potential pathogens and herbivores. In species of spruce (Picea spp.), several of the late events of pathogen-, insect-, or elicitor-induced defense responses have previously been characterized at the anatomica...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200800252

    authors: Lippert DN,Ralph SG,Phillips M,White R,Smith D,Hardie D,Gershenzon J,Ritland K,Borchers CH,Bohlmann J

    更新日期:2009-01-01 00:00:00

  • Proteomic profiling of Tectona grandis L. leaf.

    abstract::Tectona grandis L. (teak) is one of the premier hardwood timbers in the world, ranking at present in the top five tropical hardwood species in terms of worldwide plantation area. Characterization of the proteins present in teak leaves will provide a basis for the development of new tools aimed at assisting tree select...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201100183

    authors: Quiala E,Cañal MJ,Rodríguez R,Yagüe N,Chávez M,Barbón R,Valledor L

    更新日期:2012-04-01 00:00:00

  • Towards a proteomic analysis of atopic dermatitis: a two-dimensional-polyacrylamide gel electrophoresis/mass spectrometric analysis of cultured patient-derived fibroblasts.

    abstract::Atopic dermatitis (AD) is a chronic relapsing inflammatory skin disease typically characterized by a distribution of eczematous skin lesions with lichenification, pruritic excoriations, and dry skin with wide varieties of pathophysiologic aspects. Recently, AD was divided into extrinsic and intrinsic forms according t...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200400998

    authors: Park YD,Kim SY,Jang HS,Seo EY,Namkung JH,Park HS,Cho SY,Paik YK,Yang JM

    更新日期:2004-11-01 00:00:00

  • Proteomic analysis in human breast cancer: identification of a characteristic protein expression profile of malignant breast epithelium.

    abstract::Gene expression analysis has become a promising tool in predicting the clinical course of malignant disease and the response to antineoplastic therapy. Surprisingly, only little is known about the protein expression pattern of human tumors. Recent advances in proteomic analysis allow proteins of interest to be identif...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200500129

    authors: Hudelist G,Singer CF,Pischinger KI,Kaserer K,Manavi M,Kubista E,Czerwenka KF

    更新日期:2006-03-01 00:00:00

  • Mass spectrometric profiling of O-linked glycans released directly from glycoproteins in gels using in-gel reductive beta-elimination.

    abstract::Glycosylation is a widespread PTM of proteins; the carbohydrate moieties provide various functional, immunological and structural aspects of both eukaryotic and prokaryotic glycoproteins. Traditional strategies used to analyse glycoprotein O-glycans involve glycoprotein isolation, followed by glycan release using solu...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200500331

    authors: Taylor AM,Holst O,Thomas-Oates J

    更新日期:2006-05-01 00:00:00

  • Urinary Proteomics Associates with COVID-19 Severity: Pilot Proof-of-Principle Data and Design of a Multicentric Diagnostic Study.

    abstract::SARS-CoV-2 infection results in a mild-to-moderate disease course in most patients, allowing outpatient self-care and quarantine. However, in approx. 10% of cases a two- or three-phasic critical disease course with starting from day 7 to 10 is observed. To facilitate and plan outpatient care, biomarkers prognosing suc...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.202000202

    authors: Wendt R,Kalbitz S,Lübbert C,Kellner N,Macholz M,Schroth S,Ermisch J,Latosisnka A,Arnold B,Mischak H,Beige J,Metzger J

    更新日期:2020-09-10 00:00:00

  • Performance validation of an improved Xenon-arc lamp-based CCD camera system for multispectral imaging in proteomics.

    abstract::Advances in gel-based nonradioactive protein expression and PTM detection using fluorophores has served as the impetus for developing analytical instrumentation with improved imaging capabilities. We describe a CCD camera-based imaging instrument, equipped with both a high-pressure Xenon arc lamp and a UV transillumin...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200500062

    authors: Scrivener E,Boghigian BA,Golenko E,Bogdanova A,Jackson P,Mikulskis A,Denoyer E,Courtney P,Lopez MF,Patton WF

    更新日期:2005-11-01 00:00:00

  • Protein profile in neuroblastoma cells incubated with S- and R-enantiomers of ibuprofen by iTRAQ-coupled 2-D LC-MS/MS analysis: possible action of induced proteins on Alzheimer's disease.

    abstract::Ibuprofen is a member of the proprionic acid group of nonsteroidal anti-inflammatory drugs (NSAID), with the S-enantiomer being more active than the R-enantiomer. It has been shown to display protective effects against neuroinflammation, which is linked to the pathogenesis of several neurodegenerative disorders, inclu...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200700556

    authors: Zhang J,Sui J,Ching CB,Chen WN

    更新日期:2008-04-01 00:00:00

  • Experimental and computational tools useful for (re)construction of dynamic kinase-substrate networks.

    abstract::The explosion of site- and context-specific in vivo phosphorylation events presents a potentially rich source of biological knowledge and calls for novel data analysis and modeling paradigms. Perhaps the most immediate challenge is delineating detected phosphorylation sites to their effector kinases. This is important...

    journal_title:Proteomics

    pub_type: 杂志文章,评审

    doi:10.1002/pmic.200900266

    authors: Tan CS,Linding R

    更新日期:2009-12-01 00:00:00

  • Target coatings and desorption surfaces in biomolecular MALDI-MS.

    abstract::MALDI-MS is an extremely flexible technique and can be synergistically used in conjunction with established bioanalytical methods such as PAGE or SPR. To that end, slight modifications on the sample target plate may be necessary. Those can involve the use of hydrophobic coatings for improved sample deposition and desa...

    journal_title:Proteomics

    pub_type: 杂志文章,评审

    doi:10.1002/pmic.200700782

    authors: König S

    更新日期:2008-02-01 00:00:00

  • Proteome analysis of human colon cancer by two-dimensional difference gel electrophoresis and mass spectrometry.

    abstract::Two-dimensional difference gel electrophoresis (2-D DIGE) coupled with mass spectrometry (MS) was used to investigate tumor-specific changes in the proteome of human colorectal cancers and adjacent normal mucosa. For each of six patients with different stages of colon cancer, Cy5-labeled proteins isolated from tumor t...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200300635

    authors: Friedman DB,Hill S,Keller JW,Merchant NB,Levy SE,Coffey RJ,Caprioli RM

    更新日期:2004-03-01 00:00:00

  • Identification of a second Nutlin-3 responsive interaction site in the N-terminal domain of MDM2 using hydrogen/deuterium exchange mass spectrometry.

    abstract::MDM2 is a multidomain protein that functions as an E3 ubiquitin ligase, transcription repressor, mRNA-binding protein, translation factor, and molecular chaperone. The small molecule Nutlin-3 has been engineered to bind to the N-terminal hydrophobic pocket domain of MDM2. This binding of Nutlin-3 has two consequences:...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201300029

    authors: Hernychova L,Man P,Verma C,Nicholson J,Sharma CA,Ruckova E,Teo JY,Ball K,Vojtesek B,Hupp TR

    更新日期:2013-08-01 00:00:00

  • Citrulline enhances myofibrillar constituents expression of skeletal muscle and induces a switch in muscle energy metabolism in malnourished aged rats.

    abstract::Citrulline (Cit) actions on muscle metabolism remain unclear. Those latter were investigated using a proteomic approach on Tibialis muscles from male Sprague-Dawley rats. At 23 months of age, rats were either fed ad libitum (AL group) or subjected to dietary restriction for 12 weeks. At the end of the restriction peri...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201200262

    authors: Faure C,Morio B,Chafey P,Le Plénier S,Noirez P,Randrianarison-Huetz V,Cynober L,Aussel C,Moinard C

    更新日期:2013-07-01 00:00:00

  • The Simpson-Golabi-Behmel syndrome causative glypican-3, binds to and inhibits the dipeptidyl peptidase activity of CD26.

    abstract::Simpson-Golabi-Behmel syndrome (SGBS) is an X-linked condition shown to be the result of deletions of the glypican-3 (GPC3) gene. GPC3 is a proteoglycan localized to the cell membrane via a glycosylphosphatidyl-inositol (GPI) anchor. To further elucidate the GPC3 function(s), we have screened various cell lines for pr...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200600654

    authors: Davoodi J,Kelly J,Gendron NH,MacKenzie AE

    更新日期:2007-06-01 00:00:00

  • Transcription factor proteomics-Tools, applications, and challenges.

    abstract::Transcription factors (TFs) are a family of DNA-binding proteins whose gene regulatory capabilities are of vital importance in defining the molecular state of a cell. Despite their biological significance, our understanding of TF behavior and function is still limited. This is because we have so far mostly relied on g...

    journal_title:Proteomics

    pub_type: 杂志文章,评审

    doi:10.1002/pmic.201600317

    authors: Simicevic J,Deplancke B

    更新日期:2017-02-01 00:00:00

  • Restoration of heat shock protein70 suppresses gastric mucosal inducible nitric oxide synthase expression induced by Helicobacter pylori.

    abstract::Heat shock proteins (HSPs) are crucial for the maintenance of cell integrity during normal cellular growth as well as during pathophysiological conditions. While functioning mainly as molecular chaperones, HSPs also appear to be involved in diverse biological activities, such as apoptosis, carcinogenesis, and cytoprot...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200400951

    authors: Yeo M,Park HK,Kim DK,Cho SW,Kim YS,Cho SY,Paik YK,Hahm KB

    更新日期:2004-11-01 00:00:00

  • Identification of tumor-associated plasma biomarkers using proteomic techniques: from mouse to human.

    abstract::In an effort to identify tumor-associated proteins from plasma of tumor-bearing mice that may be used as diagnostic biomarkers, we developed a strategy that combines a tumor xenotransplantation model in nude mice with comparative proteomic technology. Five human cancer cell lines (SC-M1, HONE-1, CC-M1, OECM1, GBM 8401...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200400785

    authors: Juan HF,Chen JH,Hsu WT,Huang SC,Chen ST,Yi-Chung Lin J,Chang YW,Chiang CY,Wen LL,Chan DC,Liu YC,Chen YJ

    更新日期:2004-09-01 00:00:00

  • The secreted and surface proteomes of the adult stage of the carcinogenic human liver fluke Opisthorchis viverrini.

    abstract::Infection with the human liver fluke, Opisthorchis viverrini, is a serious public health problem in Thailand, Laos and nearby locations in Southeast Asia. Both experimental and epidemiological evidence strongly implicate liver fluke infection in the etiology of one of the liver cancer subtypes, cholangiocarcinoma (CCA...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200900393

    authors: Mulvenna J,Sripa B,Brindley PJ,Gorman J,Jones MK,Colgrave ML,Jones A,Nawaratna S,Laha T,Suttiprapa S,Smout MJ,Loukas A

    更新日期:2010-03-01 00:00:00

  • Differential accumulation of Lhcb gene products in thylakoid membranes of Zea mays plants grown under contrasting light and temperature conditions.

    abstract::In higher plants many different genes encode Lhcb proteins that belong to a highly conserved protein family. Evolutionary conservation of this genetic redundancy suggests that individual gene products play different roles in light harvesting and photoprotection depending on environmental conditions. We have tested the...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200402008

    authors: Caffarri S,Frigerio S,Olivieri E,Righetti PG,Bassi R

    更新日期:2005-02-01 00:00:00

  • Identification of the degradome of Isp-1, a major intracellular serine protease of Bacillus subtilis, by two-dimensional gel electrophoresis and matrix- assisted laser desorption/ionization-time of flight analysis.

    abstract::Intracellular serine protease-1 (Isp-1) is a major intracellular serine protease of Bacillus subtilis, whose functions still remain largely unknown. Furthermore, physiological substrates are yet to be determined. To identify Isp-1 substrates, we digested extract obtained from an Isp-1 deficient Bacillus mutant with pu...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200400997

    authors: Lee AY,Goo Park S,Kho CW,Young Park S,Cho S,Lee SC,Lee DH,Myung PK,Park BC

    更新日期:2004-11-01 00:00:00

  • A proteomic kinetic analysis of IGROV1 ovarian carcinoma cell line response to cisplatin treatment.

    abstract::Ovarian cancer is one of the leading causes of mortality by gynecological cancer. Despite good response to surgery and initial chemotherapy, essentially based on cisplatin (cis-diamino-dichloro-platinum(II) (CDDP)) compounds, frequent recurrences with chemoresistance acquisition are responsible for poor prognosis. Sev...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.200700231

    authors: Le Moguen K,Lincet H,Marcelo P,Lemoisson E,Heutte N,Duval M,Poulain L,Vinh J,Gauduchon P,Baudin B

    更新日期:2007-11-01 00:00:00

  • Comprehensive two-dimensional liquid chromatography mass spectrometric profiling of the rat hippocampal proteome.

    abstract::In this study, we performed the first high-throughput and comprehensive proteomic profiling of the rat hippocampal proteome. Using a combination of 2-D LC-MS and data analysis with the Rosetta Elucidator(®) system, we identified 1340 unique proteins. Functional classification showed that most of these were associated ...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201000525

    authors: Yang X,Levin Y,Rahmoune H,Ma D,Schöffmann S,Umrania Y,Guest PC,Bahn S

    更新日期:2011-02-01 00:00:00

  • MRM as a discovery tool?

    abstract::Multiple-reaction monitoring (MRM) of peptides has been recognized as a promising technology because it is sensitive and robust. Borrowed from stable-isotope dilution (SID) methodologies in the field of small molecules, MRM is now routinely used in proteomics laboratories. While its usefulness validating candidate tar...

    journal_title:Proteomics

    pub_type: 评论,杂志文章

    doi:10.1002/pmic.201500090

    authors: Rudnick PA

    更新日期:2015-04-01 00:00:00

  • Tyrosine 656 in topoisomerase IIβ is important for the catalytic activity of the enzyme: Identification based on artifactual +80-Da modification at this site.

    abstract::Topoisomerase (topo) II catalyzes topological changes in DNA. Although both human isozymes, topo IIα and β are phosphorylated, site-specific phosphorylation of topo IIβ is poorly characterized. Using LC-MS/MS analysis of topo IIβ, cleaved with trypsin, Arg C or cyanogen bromide (CNBr) plus trypsin, we detected four +8...

    journal_title:Proteomics

    pub_type: 杂志文章

    doi:10.1002/pmic.201000194

    authors: Grozav AG,Willard BB,Kozuki T,Chikamori K,Micluta MA,Petrescu AJ,Kinter M,Ganapathi R,Ganapathi MK

    更新日期:2011-03-01 00:00:00