Abstract:
:We investigate unexpectedly short non-covalent distances (<85% of the sum of van der Waals radii) in X-ray crystal structures of proteins. We curate over 11 000 high-quality protein crystal structures and an ultra-high-resolution (1.2 Å or better) subset containing >900 structures. Although our non-covalent distance criterion excludes standard hydrogen bonds known to be essential in protein stability, we observe over 75 000 close contacts (CCs) in the curated protein structures. Analysis of the frequency of amino acids participating in these interactions demonstrates some expected trends (i.e., enrichment of charged Lys, Arg, Asp, and Glu) but also reveals unexpected enhancement of Tyr in such interactions. Nearly all amino acids are observed to form at least one CC with all other amino acids, and most interactions are preserved in the much smaller ultra-high-resolution subset. We quantum-mechanically characterize the interaction energetics of a subset of >5000 CCs with symmetry-adapted perturbation theory to enable decomposition of interactions. We observe the majority of CCs to be favorable. The shortest favorable non-covalent distances are under 2.2 Å and are very repulsive when characterized with classical force fields. This analysis reveals stabilization by a combination of electrostatic and charge-transfer effects between hydrophobic (i.e., Val, Ile, Leu) amino acids and charged Asp or Glu. We also observe a unique hydrogen-bonding configuration between Tyr and Asn/Gln involving both residues acting simultaneously as hydrogen bond donors and acceptors. This work confirms the importance of first-principles simulation in explaining unexpected geometries in protein crystal structures.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Qi HW,Kulik HJdoi
10.1021/acs.jcim.9b00144subject
Has Abstractpub_date
2019-05-28 00:00:00pages
2199-2211issue
5eissn
1549-9596issn
1549-960Xjournal_volume
59pub_type
杂志文章abstract::A homology model of the Arabidopsis thaliana UV resistance locus 8 (UVR8) protein is presented herein, showing a seven-bladed β-propeller conformation similar to the globular structure of RCC1. The UVR8 amino acid sequence contains a very high amount of conserved tryptophans, and the homology model shows that seven of...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200017f
更新日期:2011-06-27 00:00:00
abstract::A three-dimensional homology model of the human histamine H 4 receptor was developed to investigate the binding mode of a series of structurally diverse H 4-agonists, i.e. histamine, clozapine, and the recently described selective, nonimidazole agonist VUF 8430. Mutagenesis studies and docking of these ligands in a rh...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700474a
更新日期:2008-07-01 00:00:00
abstract::Advances in computer-aided translation technology have made tremendous progress in accuracy in the past few years. Chemical Abstracts Service of the American Chemical Society summarizes scientific works from more than 50 languages and allows the users to search papers in nine selected languages. Currently, only the ab...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00274
更新日期:2020-07-27 00:00:00
abstract::Given the essential role played by protein kinases in regulating cellular pathways, their dysregulation can result in the onset and/or progression of various human diseases. Structural analysis of diverse protein kinases suggests that these proteins exhibit a remarkable plasticity that allows them to adopt distinct co...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00439
更新日期:2017-10-23 00:00:00
abstract::The importance of thorough analyses of the secondary structures in proteins as basic structural units cannot be overemphasized. Although recent computational methods have achieved reasonably high accuracy for predicting secondary structures from amino acid sequences, a simple and fundamental empirical approach to char...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900452z
更新日期:2010-04-26 00:00:00
abstract::Different forms of synaptic plasticity in the cerebellum expressed at the synapses onto Purkinje cells (PCs) are mediated by membrane metabotropic glutamate receptors (mGluRs). There are three main mGluR groups with a total of 8 subtypes. Although mGluRs are also found at the climbing fiber (CF) to PC synapses, the di...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050161s
更新日期:2005-11-01 00:00:00
abstract::The partitioning of solute molecules between immiscible solvents with significantly different polarities is of great importance. The polarization between the solute and solvent molecules plays an essential role in determining the solubility of the solute, which makes computational studies utilizing molecular mechanics...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00001
更新日期:2017-10-23 00:00:00
abstract::The human DNA-repair O (6)-alkylguanine DNA alkyltransferase (MGMT or hAGT) protein protects DNA from environmental alkylating agents and also plays an important role in tumor resistance to chemotherapy treatment. Available inhibitors, based on pseudosubstrate analogs, have been shown to induce substantial bone marrow...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700447r
更新日期:2008-04-01 00:00:00
abstract::Cytochrome P450 2D6 (CYP2D6) is used to develop an approach for predicting affinity and relevant binding conformation(s) for highly flexible binding sites. The approach combines the use of docking scores and compound properties as attributes in building a neural network (NN) model. It begins by identifying segments of...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600267k
更新日期:2006-11-01 00:00:00
abstract::Several hypotheses to elucidate the linkage isomer preference of the thiocyanate (SCN(-)) ion have been offered. For complexes with small coordination numbers (i.e., 1 and 2) and groups 11 (Cu-triad) and 12 (Zn-triad) metals, different levels of theory and a variety of basis sets have been employed to study linkage is...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050050t
更新日期:2005-07-01 00:00:00
abstract::Calcium and magnesium ions play important roles in many physicochemical processes. To facilitate the investigation of phenomena related to these ions that occur over large length and time scales, a coarse-grained force field (CGFF) is developed for MgCl2 and CaCl2 aqueous solutions. The ions are modeled by CG beads wi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00206
更新日期:2017-07-24 00:00:00
abstract::This article presents the computation of both inter- and intramolecular hydrogen bond strengths from first-principles. Quantum chemical calculations conducted at the dispersion-corrected density functional theory level including free energy and solvation contributions are conducted for (i) one-to-one hydrogen-bonded c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00132
更新日期:2019-09-23 00:00:00
abstract::Encapsulation of peptide and protein-based drugs in polymeric nanoparticles is one of the fundamental fields in controlled-release drug delivery systems. The molecular mechanisms of absorption of peptides to the polymeric nanoparticles are still unknown, and there is no precise molecular data on the encapsulation proc...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00641
更新日期:2019-01-28 00:00:00
abstract::A generic chemical transformation may often be achieved under various synthetic conditions. However, for any specific reagents, only one or a few among the reported synthetic protocols may be successful. For example, Michael β-addition reactions may proceed under different choices of solvent (e.g., hydrophobic, aproti...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500698a
更新日期:2015-02-23 00:00:00
abstract::Discovery of new antibacterial agents is a never-ending task of medicinal chemistry. Every new drug brings significant improvement to patients with bacterial infections, but prolonged usage of antibacterials leads to the emergence of resistant strains. Therefore, novel active structures with new modes of action are re...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00436
更新日期:2019-11-25 00:00:00
abstract::Most of the common molecular descriptors have numerous different implementations. This can influence the results of compound prioritization based on the multiparameter assessment (MPA) approach that allows a medicinal chemist to simultaneously analyze and achieve the desired balance of the diverse and often conflictin...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00734
更新日期:2018-05-29 00:00:00
abstract::Visualizing high-dimensional data by projecting them into a two- or three-dimensional space is a popular approach in many scientific fields, including computer-aided drug design and cheminformatics. In contrast, dimensionality reduction techniques have been far less explored for materials informatics. Nevertheless, si...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00552
更新日期:2018-12-24 00:00:00
abstract::We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regre...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4003766
更新日期:2013-09-23 00:00:00
abstract::In this work, the perception of similarity of reactions catalyzed by hydrolases and oxidoreductases on the basis of the overall breaking and making of bonds of reactions is investigated. Six physicochemical properties for the reacting bond in the substrate of each enzymatic reaction were calculated to describe the cha...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci9004833
更新日期:2010-06-28 00:00:00
abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b01068
更新日期:2020-03-23 00:00:00
abstract::An accurate scoring function is expected to correctly select the most stable structure from a set of pose candidates. One can hypothesize that a scoring function's ability to identify the most stable structure might be improved by emphasizing the most relevant atom pairwise interactions. However, it is hard to evaluat...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00356
更新日期:2019-07-22 00:00:00
abstract::The similarity/diversity measures play a fundamental role in library searching, virtual screening, and quantitative structure-activity relationship/quantitative structure-property relationship modeling as well as in genomics and proteomics. In this paper, a new similarity/diversity measure is proposed as a new approac...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci060099e
更新日期:2006-09-01 00:00:00
abstract::Protein-protein interactions (PPIs) play vital roles in regulating biological processes, such as cellular and signaling pathways. Hotspots are certain residues located at protein-protein interfaces that contribute more in protein-protein binding than other residues. Research on the mutational effects of hotspots is im...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00966
更新日期:2021-01-25 00:00:00
abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200078f
更新日期:2011-09-26 00:00:00
abstract::Metal-ligand (M-L) bond lengths for a range of ligands (carboxylates, chlorides, pyridines, water, tertiary phosphines, and alkenes) and a variety of metals have been retrieved from the Cambridge Structural Database, CSD. Analysis of the factors which affect M-L bond lengths (for example, ligand coordination mode, oxi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0500785
更新日期:2005-11-01 00:00:00
abstract::In this study, we have developed a two model system to mimic the active and inactive states of a G-protein coupled receptor specifically the alpha1A adrenergic receptor. We have docked two agonists, epinephrine (phenylamine type) and oxymetazoline (imidazoline type), as well as two antagonists, prazosin and 5-methylur...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700026v
更新日期:2007-09-01 00:00:00
abstract::The present study focuses on the determination of the biologically significant N-acetylneuraminic acid (NANA) drug binding interaction mechanism between bovine serum albumin (BSA) and human α-1 acid glycoprotein (HAG) using various optical spectroscopy and computational methods. The steady state fluorescence spectrosc...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00558
更新日期:2019-01-28 00:00:00
abstract::Sterol 14α-demethylase (CYP51) is the main drug target for the treatment of fungal infections. The discovery of new efficient fungal CYP51 inhibitors requires an understanding of the structural requirements for selectivity for the fungal over the human ortholog. In this study, a binding mode of the pyridylethanol(phen...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500556k
更新日期:2014-12-22 00:00:00
abstract::The homodimeric catabolite activator protein (CAP) regulates the transcription of several bacterial genes based on the cellular concentration of cyclic adenosine monophosphate (cAMP). The binding of cAMP to CAP triggers allosteric communication between the cAMP binding domains (CBD) and DNA binding domains (DBD) of CA...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00617
更新日期:2020-12-28 00:00:00
abstract::Membrane transporters play a crucial role in determining fate of administered drugs in a biological system. Early identification of plausible transporters for a drug molecule can provide insights into its therapeutic, pharmacokinetic, and toxicological profiles. In the present study, predictive models for classifying ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00508
更新日期:2017-03-27 00:00:00