Protein secondary structure prediction using nearest-neighbor methods.

Abstract:

:We have studied the use of nearest-neighbor classifiers to predict the secondary structure of proteins. The nearest-neighbor rule states that a test instance is classified according to the classifications of "nearby" training examples from a database of known structures. In the context of secondary structure prediction, the test instances are windows of n consecutive residues, and the label is the secondary structure type (alpha-helix, beta-strand, or coil) of the center position of the window. To define the neighborhood of a test instance, we employed a novel similarity metric based on the local structural environment scoring scheme of Bowie et al. In this manner, we have attempted to exploit the underlying structural similarity between segments of different proteins to aid in the prediction of secondary structure. Furthermore, in addition to using neighborhoods of fixed radius, we explored a modification of the standard nearest-neighbor algorithm that involved defining an "effective radius" for each exemplar by measuring its performance on a training set. Using these ideas, we achieved a peak prediction accuracy of 68%. Finally, we sought to improve the biological utility of secondary structure prediction by identifying the subset of the predictions that are most likely to be correct. Toward this end, we developed a nearest-neighbor estimator that produced not the traditional "one-state" prediction (alpha-helix, beta-strand, or coil) but rather a probability distribution over the three states. It should be emphasized that this scheme estimates true probability values and that the resulting numbers are not pseudo-probability scores generated by simple normalization of the raw output of the predictor. Applying the mutual information statistic, we found that these probability triplets possess 58% more information than the one-state predictions. Furthermore, the probability estimates allow one to assign an a priori confidence level to the prediction at each residue. Using this approach, we found that the top 28% of the predictions were 86% accurate and the top 43% of the predictions were 81% accurate. These results indicate that, notwithstanding the limitations on overall accuracy of secondary structure prediction, a substantial proportion of a protein can be predicted with considerable accuracy.

journal_name

J Mol Biol

authors

Yi TM,Lander ES

doi

10.1006/jmbi.1993.1464

subject

Has Abstract

pub_date

1993-08-20 00:00:00

pages

1117-29

issue

4

eissn

0022-2836

issn

1089-8638

pii

S0022-2836(83)71464-6

journal_volume

232

pub_type

杂志文章
  • The capsid size-determining protein Sid forms an external scaffold on phage P4 procapsids.

    abstract::Although the phages P2 and P4 build their capsids from the same precursor, the product of the P2 N gene, the two capsids differ in size: P2 builds a 60 nm, T = 7 capsid from 420 subunits, whereas P4 makes a 45 nm, T = 4 capsid from 240 subunits. This difference leads to substantial changes in shell geometry and subuni...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1995.0416

    authors: Marvik OJ,Dokland T,Nøkling RH,Jacobsen E,Larsen T,Lindqvist BH

    更新日期:1995-08-04 00:00:00

  • A Theoretical Framework for Evolutionary Cell Biology.

    abstract::One of the last uncharted territories in evolutionary biology concerns the link with cell biology. Because all phenotypes ultimately derive from events at the cellular level, this connection is essential to building a mechanism-based theory of evolution. Given the impressive developments in cell biological methodologi...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章,评审

    doi:10.1016/j.jmb.2020.02.006

    authors: Lynch M,Trickovic B

    更新日期:2020-03-27 00:00:00

  • Intracellular antibody capture technology: application to selection of intracellular antibodies recognising the BCR-ABL oncogenic protein.

    abstract::The expression of antibodies inside cells to ablate protein function has the potential for disease therapy and for target validation in functional genomics. However, due to inefficient expression or folding, only a few antibodies or antibody fragments, usually as single-chain Fv antibody fragments (scFv), bind their a...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.2002.5403

    authors: Tse E,Lobato MN,Forster A,Tanaka T,Chung GT,Rabbitts TH

    更新日期:2002-03-15 00:00:00

  • Evolution of the albumin: alpha-fetoprotein ancestral gene from the amplification of a 27 nucleotide sequence.

    abstract::The genes for alpha-fetoprotein and albumin arose by duplication of an ancestral gene that contained three genetic domains. These domains were generated by the triplication of a primordial genetic domain composed of five exons or subdomains. That the primordial domain itself arose by amplification of a simpler sequenc...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(84)90187-6

    authors: Alexander F,Young PR,Tilghman SM

    更新日期:1984-02-25 00:00:00

  • Diversification of β-Augmentation Interactions between CDI Toxin/Immunity Proteins.

    abstract::Contact-dependent growth inhibition (CDI) is a widespread mechanism of inter-bacterial competition mediated by the CdiB/CdiA family of two-partner secretion proteins. CdiA effectors carry diverse C-terminal toxin domains (CdiA-CT), which are delivered into neighboring target cells to inhibit growth. CDI(+) bacteria al...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2015.09.020

    authors: Morse RP,Willett JL,Johnson PM,Zheng J,Credali A,Iniguez A,Nowick JS,Hayes CS,Goulding CW

    更新日期:2015-11-20 00:00:00

  • Unfolding pathway of apomyoglobin. Simultaneous characterization of acidic conformational states by frequency domain fluorometry.

    abstract::The dynamic properties of the conformational states co-existing during the acid-induced unfolding of tuna apomyoglobin, a single tryptophan-containing protein, have been investigated simultaneously by frequency domain fluorometry. In the transition region, in the absence of salt, the tryptophanyl fluorescence emission...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1994.1477

    authors: Bismuto E,Irace G

    更新日期:1994-08-05 00:00:00

  • Mechanism of codon recognition by transfer RNA and codon-induced tRNA association.

    abstract::The steps of UUC recognition by tRNAPhe were analysed by temperature-jump measurements. At ion concentrations close to physiological conditions we found three relaxation processes, which we assigned to (1) formation of codon-anticodon complexes, (2) a conformational change of the anticodon loop coupled with Mg2+ bindi...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(84)90085-8

    authors: Labuda D,Striker G,Porschke D

    更新日期:1984-04-25 00:00:00

  • Structure of turnip crinkle virus. III. Identification of a unique coat protein dimer.

    abstract::The minor structural protein (p80), found in about one copy per virion in turnip crinkle virus (TCV), is shown by amino acid analysis and peptide mapping to be a covalent dimer of the major coat protein (p40). The covalent linkage occurs near the N termini of the crosslinked chains. These data suggest that TGV and rel...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(86)90456-0

    authors: Stockley PG,Kirsh AL,Chow EP,Smart JE,Harrison SC

    更新日期:1986-10-20 00:00:00

  • DNA curvature in native and modified EcoRI recognition sites and possible influence upon the endonuclease cleavage reaction.

    abstract::The ligation of a decadeoxynucleotide containing the EcoRI recognition site forms a series of multimers which appear to be curved based on observed anomalous gel migration in polyacrylamide gels. The degree of DNA curvature present in the recognition sequence, based upon the observed migration anomaly, can be altered ...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(88)90561-x

    authors: Diekmann S,McLaughlin LW

    更新日期:1988-08-20 00:00:00

  • Defining the structural basis for assembly of a transmembrane cytochrome.

    abstract::To define the structural basis for cofactor binding to membrane proteins, we introduce a manageable model system, which allows us, for the first time, to study the influence of individual transmembrane helices and of single amino acid residues on the assembly of a transmembrane cytochrome. In vivo as well as in vitro ...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2005.05.016

    authors: Prodöhl A,Volkmer T,Finger C,Schneider D

    更新日期:2005-07-22 00:00:00

  • Calcium-dependent homoassociation of E-cadherin by NMR spectroscopy: changes in mobility, conformation and mapping of contact regions.

    abstract::Cadherins are calcium-dependent cell surface proteins that mediate homophilic cellular adhesion. The calcium-induced oligomerization of the N-terminal two domains of epithelial cadherin (ECAD12) was followed by NMR spectroscopy in solution over a large range of protein (10 microM-5 mM) and calcium (0-5 mM) concentrati...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/s0022-2836(02)01137-3

    authors: Häussinger D,Ahrens T,Sass HJ,Pertz O,Engel J,Grzesiek S

    更新日期:2002-12-06 00:00:00

  • Core RNA polymerase and promoter DNA interactions of purified domains of sigma N: bipartite functions.

    abstract::The sigma N class of sigma factors confer upon RNA polymerase the requirement for enhancer-binding activator proteins. The sigma-N (sigma N) protein of Klebsiella pneumoniae was analysed by the assay of purified peptides comprising domains or regions of sigma N defined by proteolysis or by homology alignment, respecti...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1995.0260

    authors: Cannon W,Missailidis S,Smith C,Cottier A,Austin S,Moore M,Buck M

    更新日期:1995-05-12 00:00:00

  • The three-dimensional structure at 2.4 A resolution of glycosylated proteinase A from the lysosome-like vacuole of Saccharomyces cerevisiae.

    abstract::The crystal structures of glycosylated native proteinase A, an aspartic proteinase found in the vacuole of Saccharomyces cerevisiae, and its complex with a difluorostatone-containing tripeptide have been determined by molecular replacement to 3.5 A and 2.4 A resolutions, respectively. Superposition of the bound and na...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1996.0880

    authors: Aguilar CF,Cronin NB,Badasso M,Dreyer T,Newman MP,Cooper JB,Hoover DJ,Wood SP,Johnson MS,Blundell TL

    更新日期:1997-04-11 00:00:00

  • Protein misfolding and amyloid formation for the peptide GNNQQNY from yeast prion protein Sup35: simulation by reaction path annealing.

    abstract::We study the early steps of amyloid formation of the seven residue peptide GNNQQNY from yeast prion-like protein Sup35 by simulating the random coil to beta-sheet and alpha-helix to beta-sheet transition both in the absence and presence of a cross-beta amyloid nucleus. The simulation method at atomic resolution employ...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2005.03.083

    authors: Lipfert J,Franklin J,Wu F,Doniach S

    更新日期:2005-06-10 00:00:00

  • Engineering GST M2-2 for high activity with indene 1,2-oxide and indication of an H-site residue sustaining catalytic promiscuity.

    abstract::The substrate-binding H-site of human glutathione transferase (GST) M2-2 was subjected to iterative saturation mutagenesis in order to obtain an efficient enzyme with the novel epoxide substrate indene 1,2-oxide. Residues 10, 116, and 210 were targeted, and the activities with the alternative substrates, benzyl isothi...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2011.07.039

    authors: Norrgård MA,Mannervik B

    更新日期:2011-09-09 00:00:00

  • B-DNA twisting correlates with base-pair morphology.

    abstract::The observed sequence dependence of the mean twist angles in 38 B-DNA crystal structures can be understood in terms of simple geometrical features of the constituent base-pairs. Structures with low twist appear to unwind in response to severe steric clashes of large exocyclic groups (such as NH2-NH2) in the major and ...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1994.0120

    authors: Gorin AA,Zhurkin VB,Olson WK

    更新日期:1995-03-17 00:00:00

  • Design of a highly reactive HDV ribozyme sequence uncovers facilitation of RNA folding by alternative pairings and physiological ionic strength.

    abstract::The hepatitis delta virus (HDV) ribozyme is a self-cleaving RNA that resides in the HDV genome and regulates its replication. The native fold of the ribozyme is complex, having two pseudoknots. Earlier work implicated four non-native pairings in slowing pseudoknot formation: Alt 1, Alt 2, Alt 3, and Alt P1. The goal o...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2004.05.071

    authors: Brown TS,Chadalavada DM,Bevilacqua PC

    更新日期:2004-08-13 00:00:00

  • Capsid targeting sequence targets foreign proteins into bacteriophage T4 and permits proteolytic processing.

    abstract::A membrane-independent morphogenetic viral signal peptide is identified within bacteriophage T4 internal protein III (IPIII). Utilizing a phagederived expression-packaging-processing system, which packages foreign proteins fused with IPIII into the phage capsid, a synthetic cleavage site introduced at the C terminus o...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1996.0470

    authors: Mullaney JM,Black LW

    更新日期:1996-08-23 00:00:00

  • Crystal structure of the soluble domain of the major anaerobically induced outer membrane protein (AniA) from pathogenic Neisseria: a new class of copper-containing nitrite reductases.

    abstract::The major anaerobically induced outer membrane protein (AniA) from pathogenic Neisseria gonorrhoeae is essential for cell growth under oxygen limiting conditions in the presence of nitrite and is protective against killing by human sera. A phylogenic analysis indicates that AniA is a member of a new class of copper-co...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.2001.5251

    authors: Boulanger MJ,Murphy ME

    更新日期:2002-02-01 00:00:00

  • Correlation between conformational and binding properties of nebulin repeats.

    abstract::Nebulin, a large protein (600 to 800 kDa) located in the thin filament of striated vertebrate muscle, is assumed to bind and stabilise F-actin. Complete sequence determination of human nebulin has only recently been accomplished showing a uniform modular structure along the whole length of the molecule. Up to 97% of t...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1996.0169

    authors: Pfuhl M,Winder SJ,Castiglione Morelli MA,Labeit S,Pastore A

    更新日期:1996-03-29 00:00:00

  • Refined structure of spinach glycolate oxidase at 2 A resolution.

    abstract::The amino acid sequence of glycolate oxidase from spinach has been fitted to an electron density map of 2.0 A nominal resolution and the structure has been refined using the restrained parameter least-squares refinement of Hendrickson and Konnert. A final crystallographic R-factor of 18.9% was obtained for 32,888 inde...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(89)90178-2

    authors: Lindqvist Y

    更新日期:1989-09-05 00:00:00

  • Glutamate promotes SSB protein-protein Interactions via intrinsically disordered regions.

    abstract::E. coli single strand (ss) DNA binding protein (SSB) is an essential protein that binds to ssDNA intermediates formed during genome maintenance. SSB homotetramers bind ssDNA in several modes that differ in occluded site size and cooperativity. High "unlimited" cooperativity is associated with the 35 site size ((SSB)35...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2017.07.021

    authors: Kozlov AG,Shinn MK,Weiland EA,Lohman TM

    更新日期:2017-09-01 00:00:00

  • Molecular Highways-Navigating Collisions of DNA Motor Proteins.

    abstract::Fundamental biological processes require concurrent sharing of DNA by numerous motor proteins and complexes. Thus, collision, congestion, and roadblocks are inescapable on these busy "molecular highways." The consequences of these traffic problems are diverse, resulting in complex cellular mechanisms to resolve threat...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章,评审

    doi:10.1016/j.jmb.2018.08.006

    authors: Le TT,Wang MD

    更新日期:2018-10-26 00:00:00

  • The external alternative NAD(P)H dehydrogenase NDE3 is localized both in the mitochondria and in the cytoplasm of Neurospora crassa.

    abstract::The filamentous fungus Neurospora crassa has a branched respiratory chain. Several alternative dehydrogenases, aside from the canonical complex I enzyme, are involved in the oxidation of NAD(P)H substrates. Based on homology searches in the fungal genome, we have tentatively identified one of these proteins. The corre...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2007.02.080

    authors: Carneiro P,Duarte M,Videira A

    更新日期:2007-05-11 00:00:00

  • Local RNA target structure influences siRNA efficacy: systematic analysis of intentionally designed binding regions.

    abstract::Contradictory reports in the literature have emphasised either the sequence of small interfering RNAs (siRNA) or the structure of their target molecules to be the major determinant of the efficiency of RNA interference (RNAi) approaches. In the present study, we analyse systematically the contributions of these parame...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/j.jmb.2005.03.011

    authors: Schubert S,Grünweller A,Erdmann VA,Kurreck J

    更新日期:2005-05-13 00:00:00

  • Construction of a microphage variant of filamentous bacteriophage.

    abstract::The intergenic region in the genome of the Ff class of filamentous phage (comprising strains fl, fd and M13) genome constitutes 8% of the viral genome, and has essential functions in DNA replication and phage morphogenesis. The functional domains of this region may be inserted into separate sites of a plasmid to funct...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(92)90858-h

    authors: Specthrie L,Bullitt E,Horiuchi K,Model P,Russel M,Makowski L

    更新日期:1992-12-05 00:00:00

  • Biotin and biotin analogues specifically modify the fluorescence decay of avidin.

    abstract::Avidin, a basic tetrameric glycoprotein, isolated from hen egg-white, binds up to four molecules of biotin with exceptionally high affinity. The presence of tryptophanyl residues in the active site pointed out the opportunity of correlating the protein fluorescence with biotin binding. We have performed both steady st...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1994.1600

    authors: Mei G,Pugliese L,Rosato N,Toma L,Bolognesi M,Finazzi-Agrò A

    更新日期:1994-09-30 00:00:00

  • Growth rate-dependent control, feedback regulation and steady-state mRNA levels of the threonyl-tRNA synthetase gene of Escherichia coli.

    abstract::The expression of the gene thrS encoding threonyl-tRNA synthetase is under the control of two apparently different regulatory loops: translational feedback regulation and growth rate-dependent control. The translational feedback regulation is due to the binding of threonyl-tRNA synthetase to a site located in the lead...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1996.0445

    authors: Comer MM,Dondon J,Graffe M,Yarchuk O,Springer M

    更新日期:1996-08-16 00:00:00

  • Ribosomal protein L9: a structure determination by the combined use of X-ray crystallography and NMR spectroscopy.

    abstract::The structure of protein L9 from the Bacillus stearothernophilus ribosome has been determined at 2.5 A resolution by refinement against single crystal X-ray diffraction data with additional constraints provided by NMR data. This highly elongated protein consists of two domains separated by a nine-turn connecting helix...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1006/jmbi.1996.0696

    authors: Hoffman DW,Cameron CS,Davies C,White SW,Ramakrishnan V

    更新日期:1996-12-20 00:00:00

  • Efficient utilization of Escherichia coli transcriptional signals in Bacillus subtilis.

    abstract::Using purified sigma 55 RNA polymerase from Bacillus subtilis in an in vitro transcription system, we have shown that both promoters and terminators of Gram negative origin are recognized by this enzyme. Furthermore, when B. subtilis is transformed with a shuttle vector containing certain of these promoters, synthesis...

    journal_title:Journal of molecular biology

    pub_type: 杂志文章

    doi:10.1016/0022-2836(85)90129-9

    authors: Peschke U,Beuck V,Bujard H,Gentz R,Le Grice S

    更新日期:1985-12-05 00:00:00