Abstract:
:Many proteins have small-molecule binding pockets that are not easily detectable in the ligand-free structures. These cryptic sites require a conformational change to become apparent; a cryptic site can therefore be defined as a site that forms a pocket in a holo structure, but not in the apo structure. Because many proteins appear to lack druggable pockets, understanding and accurately identifying cryptic sites could expand the set of drug targets. Previously, cryptic sites were identified experimentally by fragment-based ligand discovery and computationally by long molecular dynamics simulations and fragment docking. Here, we begin by constructing a set of structurally defined apo-holo pairs with cryptic sites. Next, we comprehensively characterize the cryptic sites in terms of their sequence, structure, and dynamics attributes. We find that cryptic sites tend to be as conserved in evolution as traditional binding pockets but are less hydrophobic and more flexible. Relying on this characterization, we use machine learning to predict cryptic sites with relatively high accuracy (for our benchmark, the true positive and false positive rates are 73% and 29%, respectively). We then predict cryptic sites in the entire structurally characterized human proteome (11,201 structures, covering 23% of all residues in the proteome). CryptoSite increases the size of the potentially "druggable" human proteome from ~40% to ~78% of disease-associated proteins. Finally, to demonstrate the utility of our approach in practice, we experimentally validate a cryptic site in protein tyrosine phosphatase 1B using a covalent ligand and NMR spectroscopy. The CryptoSite Web server is available at http://salilab.org/cryptosite.
journal_name
J Mol Bioljournal_title
Journal of molecular biologyauthors
Cimermancic P,Weinkam P,Rettenmaier TJ,Bichmann L,Keedy DA,Woldeyes RA,Schneidman-Duhovny D,Demerdash ON,Mitchell JC,Wells JA,Fraser JS,Sali Adoi
10.1016/j.jmb.2016.01.029subject
Has Abstractpub_date
2016-02-22 00:00:00pages
709-719issue
4eissn
0022-2836issn
1089-8638pii
S0022-2836(16)00085-1journal_volume
428pub_type
杂志文章abstract::Archaeal DNA repair pathways are not well defined; in particular, there are no convincing candidate proteins for detection of DNA mismatches or the bulky lesions removed by excision repair pathways. Single-stranded DNA-binding proteins (SSBs) play a central role in DNA replication, recombination and repair. The crenar...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2005.08.050
更新日期:2005-10-28 00:00:00
abstract::A highly thermostable xylanase isolated from the thermophilic fungus Paecilomyces varioti has been crystallized by the vapour diffusion method. The isolation of this enzyme by crystallization directly from the culture filtrate projects this fungus as an important source for large-scale production of pure xylanase. The...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(94)90052-3
更新日期:1994-11-04 00:00:00
abstract::The two cardiac myosin heavy chain isoforms, alpha and beta, differ functionally, alpha Myosin exhibits higher actin-activated ATPase than does beta myosin, and hearts expressing alpha myosin exhibit increased contractility relative to hearts expressing beta myosin. To understand the molecular basis for this functiona...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(89)90141-1
更新日期:1989-12-05 00:00:00
abstract::Single-strand conformers (SSCs) from the C-rich strand of the triplet repeat at the FMR-1 locus are rapidly and selectively methylated by the human DNA (cytosine-5) methyltransferase. The apparent affinity of the enzyme for the FMR-1 SSC is about tenfold higher than it is for a control Watson-Crick paired duplex. The ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1997.1430
更新日期:1998-01-09 00:00:00
abstract::Although largely deemed as structurally conserved, catalytic metal ion sites can rearrange, thereby contributing to enzyme evolvability. Here, we show that in paraoxonase-1, a lipo-lactonase, catalytic promiscuity and divergence into an organophosphate hydrolase are correlated with an alternative mode of the catalytic...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2013.01.009
更新日期:2013-03-25 00:00:00
abstract::Microbial organisms utilize light not only as energy sources but also as signals by which rhodopsins (containing retinal as a chromophore) work as photoreceptors. Sensory rhodopsin I (SRI) is a dual photoreceptor that regulates both negative and positive phototaxis in microbial organisms, such as the archaeon Halobact...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2009.06.050
更新日期:2009-09-11 00:00:00
abstract::During general genetic recombination and recombinational DNA repair, DNA damages and heterologies are often encountered which must be efficiently processed by the cellular recombination machinery. In RecA-mediated three-strand exchange reactions between single-stranded circular and linear duplex DNA, or four-strand ex...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0600
更新日期:1996-11-08 00:00:00
abstract::Viral suppressors of RNA interference (VSRs) target host gene silencing pathways, thereby operating important roles in the viral cycle and in host cells, in which they counteract host innate immune responses. However, the molecular mechanisms of VSRs are poorly understood. We provide here biochemical and biophysical f...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2013.03.028
更新日期:2013-07-24 00:00:00
abstract::We have used a hidden Markov model (HMM) to identify the consensus sequence of the RpoD promoters in the genome of Campylobacter jejuni. The identified promoter consensus sequence is unusual compared to other bacteria, in that the region upstream of the TATA-box does not contain a conserved -35 region, but shows a ver...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/s0022-2836(03)00034-2
更新日期:2003-03-07 00:00:00
abstract::Regulation of eukaryotic genes is largely governed by multiple cis-acting DNA sequences recognized by specific transcription factors. The transcription factor NF-kappa B has been implicated as an important regulator of cellular and viral genes, including those of immunoglobulin kappa light chain, interleukin-2, beta-i...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/0022-2836(90)90187-q
更新日期:1990-07-20 00:00:00
abstract::Native state 1H NMR resonance assignments for 125 of the 129 residues of equine lysozyme have enabled measurement of the hydrogen exchange kinetics for over 60 backbone amide and three tryptophan indole hydrogen atoms in the native state. Native holo equine lysozyme hydrogen exchange protection factors are as large as...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1997.0996
更新日期:1997-05-23 00:00:00
abstract::Autonomously replicating sequences (ARSs) in the yeast Yarrowia lipolytica require two components: an origin of replication (ORI) and centromere (CEN) DNA, both of which are necessary for extrachromosomal maintenance. To investigate this cooperation in more detail, we performed a screen for genomic sequences able to c...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.2000.4300
更新日期:2001-01-12 00:00:00
abstract::The filamentous fungus Neurospora crassa has a branched respiratory chain. Several alternative dehydrogenases, aside from the canonical complex I enzyme, are involved in the oxidation of NAD(P)H substrates. Based on homology searches in the fungal genome, we have tentatively identified one of these proteins. The corre...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2007.02.080
更新日期:2007-05-11 00:00:00
abstract::The non-steroidal anti-estrogen tamoxifen [TAM] has been in clinical use over the last two decades as a potent adjunct chemotherapeutic agent for treatment of breast cancer. It has also been given prophylactically to women with a strong family history of breast cancer. However, tamoxifen treatment has also been associ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.2000.4071
更新日期:2000-09-15 00:00:00
abstract::Ricin is a potent cytotoxin which has been used widely in the construction of therapeutic agents such as immunotoxins. Recently it has been used by governments and underground groups as a poison. There is interest in identifying and designing effective inhibitors of the ricin A chain (RTA). In this study computer-assi...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0865
更新日期:1997-03-14 00:00:00
abstract::Loss-of-function mutations in the gene encoding the multifunctional protein, DJ-1, have been implicated in the pathogenesis of early-onset familial Parkinson's disease (PD), suggesting that DJ-1 may act as a neuroprotectant for dopaminergic (DA) neurons. Enhanced autophagy may benefit PD by clearing damaged organelles...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2012.06.034
更新日期:2012-10-19 00:00:00
abstract::TWINKLE is the helicase at the mitochondrial DNA (mtDNA) replication fork in mammalian cells. Mutations in the PEO1 gene, which encodes TWINKLE, cause autosomal dominant progressive external ophthalmoplegia (AdPEO), a disorder associated with deletions in mtDNA. Here, we characterized seven different AdPEO-causing mut...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2008.01.035
更新日期:2008-03-28 00:00:00
abstract::We present a refined model of the membrane-associated Torpedo acetylcholine (ACh) receptor at 4A resolution. An improved experimental density map was obtained from 342 electron images of helical tubes, and the refined structure was derived to an R-factor of 36.7% (R(free) 37.9%) by standard crystallographic methods, a...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2004.12.031
更新日期:2005-03-04 00:00:00
abstract::The calpains are a family of cysteine proteases with closely related amino acid sequences, but a wide range of Ca(2+) requirements (K(d)). For m-calpain, K(d) is approximately 325microM, for mu-calpain it is approximately 50microM, and for calpain 3 it is not strictly known but may be approximately 0.1microM. On the b...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2004.08.073
更新日期:2004-10-29 00:00:00
abstract::Essentially all cold agglutinins (CA) with red blood cell I/i specificity isolated from patients with CA disease stemming from lymphoproliferative disorders utilize the VH 4-34 (VH 4-21) gene segment. This near universality of the restricted use of a single gene segment is substantially greater than that demonstrated ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1996.0110
更新日期:1996-03-01 00:00:00
abstract::Single sodium-driven rotors from a bacterial ATP synthase were embedded into a lipid membrane and observed in buffer solution at subnanometer resolution using atomic force microscopy (AFM). Time-lapse AFM topographs show the movement of single proteins within the membrane. Subsequent analysis of their individual traje...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/s0022-2836(03)00206-7
更新日期:2003-04-11 00:00:00
abstract::The solution structure of the homeodomain of the Drosophila morphogenic protein Bicoid (Bcd) complexed with a TAATCC DNA site is described. Bicoid is the only known protein that uses a homeodomain to regulate translation, as well as transcription, by binding to both RNA and DNA during early Drosophila development; in ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2005.12.007
更新日期:2006-03-10 00:00:00
abstract::Bloom syndrome protein (BLM) is one of five human RecQ helicases that participate in DNA metabolism. RecQ C-terminal (RQC) domain is the main DNA binding module of BLM and specifically recognizes G-quadruplex (G4) DNA structures. Because G4 processing by BLM is essential for regulating replication and transcription, b...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2019.01.010
更新日期:2019-02-15 00:00:00
abstract::KorA and KorB proteins of IncP1 plasmid RK2 are encoded in the central control region (ccr) of the plasmid and act as global regulators of plasmid genes for replication, transfer and stable inheritance. KorA represses seven promoters on RK2, by binding to a defined operator site, OA, which always occurs in promoter re...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1999.2761
更新日期:1999-06-04 00:00:00
abstract::FLP is a conservative site-specific recombinase that is encoded by the 2 microns plasmid of the yeast, Saccharomyces cerevisiae. FLP is member of the integrase family of recombinases that mediate the recombination reaction through a Holliday intermediate. The FLP recognition target (FRT) sites lie within two 599 bp in...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1994.1647
更新日期:1994-10-21 00:00:00
abstract::Peptidoglycan recognition proteins (PGRPs) form a recently discovered protein family, which is conserved from insect to mammals and is implicated in the innate immune system by interacting with/or degrading microbial peptidoglycans (PGNs). Drosophila PGRP-SA is a member of this family of pattern recognition receptors ...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2004.04.077
更新日期:2004-07-16 00:00:00
abstract::The sequence RGITVNGKTYGR has been reported as part of a de novo design peptide system. This peptide folds as a beta-hairpin structure with three residues per strand and two residue turns. Asn6 side-chain, the residue in position L1 of the beta-turn, appeared to be solvent exposed, interacting only within the turn but...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1997.1347
更新日期:1997-11-07 00:00:00
abstract::Four-helix bundles are identified and characterized in the subunit interfaces of protein multimers. We find that this motif occurs as often in the interfaces as in the protein monomers. Common and different characteristics demonstrated by the bundles in the two environments suggest the possible stabilization mechanism...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1006/jmbi.1995.0208
更新日期:1995-04-21 00:00:00
abstract::Ras is a small GTP-binding protein that is an essential molecular switch for a wide variety of signaling pathways including the control of cell proliferation, cell cycle progression and apoptosis. In the GTP-bound state, Ras can interact with its effectors, triggering various signaling cascades in the cell. In the GDP...
journal_title:Journal of molecular biology
pub_type: 杂志文章
doi:10.1016/j.jmb.2010.03.046
更新日期:2010-06-11 00:00:00
abstract::Class II gene transcription commences with the assembly of the Preinitiation Complex (PIC) from a plethora of proteins and protein assemblies in the nucleus, including the General Transcription Factors (GTFs), RNA polymerase II (RNA pol II), co-activators, co-repressors, and more. TFIID, a megadalton-sized multiprotei...
journal_title:Journal of molecular biology
pub_type: 杂志文章,评审
doi:10.1016/j.jmb.2016.04.003
更新日期:2016-06-19 00:00:00