Abstract:
:We have studied the use of nearest-neighbor classifiers to predict the secondary structure of proteins. The nearest-neighbor rule states that a test instance is classified according to the classifications of "nearby" training examples from a database of known structures. In the context of secondary structure prediction, the test instances are windows of n consecutive residues, and the label is the secondary structure type (alpha-helix, beta-strand, or coil) of the center position of the window. To define the neighborhood of a test instance, we employed a novel similarity metric based on the local structural environment scoring scheme of Bowie et al. In this manner, we have attempted to exploit the underlying structural similarity between segments of different proteins to aid in the prediction of secondary structure. Furthermore, in addition to using neighborhoods of fixed radius, we explored a modification of the standard nearest-neighbor algorithm that involved defining an "effective radius" for each exemplar by measuring its performance on a training set. Using these ideas, we achieved a peak prediction accuracy of 68%. Finally, we sought to improve the biological utility of secondary structure prediction by identifying the subset of the predictions that are most likely to be correct. Toward this end, we developed a nearest-neighbor estimator that produced not the traditional "one-state" prediction (alpha-helix, beta-strand, or coil) but rather a probability distribution over the three states. It should be emphasized that this scheme estimates true probability values and that the resulting numbers are not pseudo-probability scores generated by simple normalization of the raw output of the predictor. Applying the mutual information statistic, we found that these probability triplets possess 58% more information than the one-state predictions. Furthermore, the probability estimates allow one to assign an a priori confidence level to the prediction at each residue. Using this approach, we found that the top 28% of the predictions were 86% accurate and the top 43% of the predictions were 81% accurate. These results indicate that, notwithstanding the limitations on overall accuracy of secondary structure prediction, a substantial proportion of a protein can be predicted with considerable accuracy.
journal_name
J Mol Bioljournal_title
Journal of molecular biologyauthors
Yi TM,Lander ESdoi
10.1006/jmbi.1993.1464subject
Has Abstractpub_date
1993-08-20 00:00:00pages
1117-29issue
4eissn
0022-2836issn
1089-8638pii
S0022-2836(83)71464-6journal_volume
232pub_type
杂志文章abstract::Although the phages P2 and P4 build their capsids from the same precursor, the product of the P2 N gene, the two capsids differ in size: P2 builds a 60 nm, T = 7 capsid from 420 subunits, whereas P4 makes a 45 nm, T = 4 capsid from 240 subunits. This difference leads to substantial changes in shell geometry and subuni...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1995.0416
更新日期:1995-08-04 00:00:00
abstract::One of the last uncharted territories in evolutionary biology concerns the link with cell biology. Because all phenotypes ultimately derive from events at the cellular level, this connection is essential to building a mechanism-based theory of evolution. Given the impressive developments in cell biological methodologi...
journal_title:Journal of molecular biology
pub_type: 杂志文章,评审
doi:10.1016/j.jmb.2020.02.006
更新日期:2020-03-27 00:00:00
abstract::The expression of antibodies inside cells to ablate protein function has the potential for disease therapy and for target validation in functional genomics. However, due to inefficient expression or folding, only a few antibodies or antibody fragments, usually as single-chain Fv antibody fragments (scFv), bind their a...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.2002.5403
更新日期:2002-03-15 00:00:00
abstract::The genes for alpha-fetoprotein and albumin arose by duplication of an ancestral gene that contained three genetic domains. These domains were generated by the triplication of a primordial genetic domain composed of five exons or subdomains. That the primordial domain itself arose by amplification of a simpler sequenc...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(84)90187-6
更新日期:1984-02-25 00:00:00
abstract::Contact-dependent growth inhibition (CDI) is a widespread mechanism of inter-bacterial competition mediated by the CdiB/CdiA family of two-partner secretion proteins. CdiA effectors carry diverse C-terminal toxin domains (CdiA-CT), which are delivered into neighboring target cells to inhibit growth. CDI(+) bacteria al...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2015.09.020
更新日期:2015-11-20 00:00:00
abstract::The dynamic properties of the conformational states co-existing during the acid-induced unfolding of tuna apomyoglobin, a single tryptophan-containing protein, have been investigated simultaneously by frequency domain fluorometry. In the transition region, in the absence of salt, the tryptophanyl fluorescence emission...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1994.1477
更新日期:1994-08-05 00:00:00
abstract::The steps of UUC recognition by tRNAPhe were analysed by temperature-jump measurements. At ion concentrations close to physiological conditions we found three relaxation processes, which we assigned to (1) formation of codon-anticodon complexes, (2) a conformational change of the anticodon loop coupled with Mg2+ bindi...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(84)90085-8
更新日期:1984-04-25 00:00:00
abstract::The minor structural protein (p80), found in about one copy per virion in turnip crinkle virus (TCV), is shown by amino acid analysis and peptide mapping to be a covalent dimer of the major coat protein (p40). The covalent linkage occurs near the N termini of the crosslinked chains. These data suggest that TGV and rel...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(86)90456-0
更新日期:1986-10-20 00:00:00
abstract::The ligation of a decadeoxynucleotide containing the EcoRI recognition site forms a series of multimers which appear to be curved based on observed anomalous gel migration in polyacrylamide gels. The degree of DNA curvature present in the recognition sequence, based upon the observed migration anomaly, can be altered ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(88)90561-x
更新日期:1988-08-20 00:00:00
abstract::To define the structural basis for cofactor binding to membrane proteins, we introduce a manageable model system, which allows us, for the first time, to study the influence of individual transmembrane helices and of single amino acid residues on the assembly of a transmembrane cytochrome. In vivo as well as in vitro ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2005.05.016
更新日期:2005-07-22 00:00:00
abstract::Cadherins are calcium-dependent cell surface proteins that mediate homophilic cellular adhesion. The calcium-induced oligomerization of the N-terminal two domains of epithelial cadherin (ECAD12) was followed by NMR spectroscopy in solution over a large range of protein (10 microM-5 mM) and calcium (0-5 mM) concentrati...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/s0022-2836(02)01137-3
更新日期:2002-12-06 00:00:00
abstract::The sigma N class of sigma factors confer upon RNA polymerase the requirement for enhancer-binding activator proteins. The sigma-N (sigma N) protein of Klebsiella pneumoniae was analysed by the assay of purified peptides comprising domains or regions of sigma N defined by proteolysis or by homology alignment, respecti...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1995.0260
更新日期:1995-05-12 00:00:00
abstract::The crystal structures of glycosylated native proteinase A, an aspartic proteinase found in the vacuole of Saccharomyces cerevisiae, and its complex with a difluorostatone-containing tripeptide have been determined by molecular replacement to 3.5 A and 2.4 A resolutions, respectively. Superposition of the bound and na...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0880
更新日期:1997-04-11 00:00:00
abstract::We study the early steps of amyloid formation of the seven residue peptide GNNQQNY from yeast prion-like protein Sup35 by simulating the random coil to beta-sheet and alpha-helix to beta-sheet transition both in the absence and presence of a cross-beta amyloid nucleus. The simulation method at atomic resolution employ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2005.03.083
更新日期:2005-06-10 00:00:00
abstract::The substrate-binding H-site of human glutathione transferase (GST) M2-2 was subjected to iterative saturation mutagenesis in order to obtain an efficient enzyme with the novel epoxide substrate indene 1,2-oxide. Residues 10, 116, and 210 were targeted, and the activities with the alternative substrates, benzyl isothi...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2011.07.039
更新日期:2011-09-09 00:00:00
abstract::The observed sequence dependence of the mean twist angles in 38 B-DNA crystal structures can be understood in terms of simple geometrical features of the constituent base-pairs. Structures with low twist appear to unwind in response to severe steric clashes of large exocyclic groups (such as NH2-NH2) in the major and ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1994.0120
更新日期:1995-03-17 00:00:00
abstract::The hepatitis delta virus (HDV) ribozyme is a self-cleaving RNA that resides in the HDV genome and regulates its replication. The native fold of the ribozyme is complex, having two pseudoknots. Earlier work implicated four non-native pairings in slowing pseudoknot formation: Alt 1, Alt 2, Alt 3, and Alt P1. The goal o...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2004.05.071
更新日期:2004-08-13 00:00:00
abstract::A membrane-independent morphogenetic viral signal peptide is identified within bacteriophage T4 internal protein III (IPIII). Utilizing a phagederived expression-packaging-processing system, which packages foreign proteins fused with IPIII into the phage capsid, a synthetic cleavage site introduced at the C terminus o...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0470
更新日期:1996-08-23 00:00:00
abstract::The major anaerobically induced outer membrane protein (AniA) from pathogenic Neisseria gonorrhoeae is essential for cell growth under oxygen limiting conditions in the presence of nitrite and is protective against killing by human sera. A phylogenic analysis indicates that AniA is a member of a new class of copper-co...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.2001.5251
更新日期:2002-02-01 00:00:00
abstract::Nebulin, a large protein (600 to 800 kDa) located in the thin filament of striated vertebrate muscle, is assumed to bind and stabilise F-actin. Complete sequence determination of human nebulin has only recently been accomplished showing a uniform modular structure along the whole length of the molecule. Up to 97% of t...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0169
更新日期:1996-03-29 00:00:00
abstract::The amino acid sequence of glycolate oxidase from spinach has been fitted to an electron density map of 2.0 A nominal resolution and the structure has been refined using the restrained parameter least-squares refinement of Hendrickson and Konnert. A final crystallographic R-factor of 18.9% was obtained for 32,888 inde...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(89)90178-2
更新日期:1989-09-05 00:00:00
abstract::E. coli single strand (ss) DNA binding protein (SSB) is an essential protein that binds to ssDNA intermediates formed during genome maintenance. SSB homotetramers bind ssDNA in several modes that differ in occluded site size and cooperativity. High "unlimited" cooperativity is associated with the 35 site size ((SSB)35...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2017.07.021
更新日期:2017-09-01 00:00:00
abstract::Fundamental biological processes require concurrent sharing of DNA by numerous motor proteins and complexes. Thus, collision, congestion, and roadblocks are inescapable on these busy "molecular highways." The consequences of these traffic problems are diverse, resulting in complex cellular mechanisms to resolve threat...
journal_title:Journal of molecular biology
pub_type: 杂志文章,评审
doi:10.1016/j.jmb.2018.08.006
更新日期:2018-10-26 00:00:00
abstract::The filamentous fungus Neurospora crassa has a branched respiratory chain. Several alternative dehydrogenases, aside from the canonical complex I enzyme, are involved in the oxidation of NAD(P)H substrates. Based on homology searches in the fungal genome, we have tentatively identified one of these proteins. The corre...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2007.02.080
更新日期:2007-05-11 00:00:00
abstract::Contradictory reports in the literature have emphasised either the sequence of small interfering RNAs (siRNA) or the structure of their target molecules to be the major determinant of the efficiency of RNA interference (RNAi) approaches. In the present study, we analyse systematically the contributions of these parame...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2005.03.011
更新日期:2005-05-13 00:00:00
abstract::The intergenic region in the genome of the Ff class of filamentous phage (comprising strains fl, fd and M13) genome constitutes 8% of the viral genome, and has essential functions in DNA replication and phage morphogenesis. The functional domains of this region may be inserted into separate sites of a plasmid to funct...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(92)90858-h
更新日期:1992-12-05 00:00:00
abstract::Avidin, a basic tetrameric glycoprotein, isolated from hen egg-white, binds up to four molecules of biotin with exceptionally high affinity. The presence of tryptophanyl residues in the active site pointed out the opportunity of correlating the protein fluorescence with biotin binding. We have performed both steady st...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1994.1600
更新日期:1994-09-30 00:00:00
abstract::The expression of the gene thrS encoding threonyl-tRNA synthetase is under the control of two apparently different regulatory loops: translational feedback regulation and growth rate-dependent control. The translational feedback regulation is due to the binding of threonyl-tRNA synthetase to a site located in the lead...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0445
更新日期:1996-08-16 00:00:00
abstract::The structure of protein L9 from the Bacillus stearothernophilus ribosome has been determined at 2.5 A resolution by refinement against single crystal X-ray diffraction data with additional constraints provided by NMR data. This highly elongated protein consists of two domains separated by a nine-turn connecting helix...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0696
更新日期:1996-12-20 00:00:00
abstract::Using purified sigma 55 RNA polymerase from Bacillus subtilis in an in vitro transcription system, we have shown that both promoters and terminators of Gram negative origin are recognized by this enzyme. Furthermore, when B. subtilis is transformed with a shuttle vector containing certain of these promoters, synthesis...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(85)90129-9
更新日期:1985-12-05 00:00:00