Abstract:
:Protein-ligand (PL) interactions play a key role in many life processes such as molecular recognition, molecular binding, signal transmission, and cell metabolism. Examples of interaction forces include hydrogen bonding, hydrophobic effects, steric clashes, electrostatic contacts, and van der Waals attractions. Currently, a large number of hypotheses and perspectives to model these interaction forces are scattered throughout the literature and largely forgotten. Instead, had they been assembled and utilized collectively, they would have substantially improved the accuracy of predicting binding affinity of protein-ligand complexes. In this work, we present Descriptor Data Bank (DDB), a data-driven platform on the cloud for facilitating multiperspective modeling of PL interactions. DDB is an open-access hub for depositing, hosting, executing, and sharing descriptor extraction tools and data for a large number of interaction modeling hypotheses. The platform also implements a machine-learning (ML) toolbox for automatic descriptor filtering and analysis and scoring function (SF) fitting and prediction. The descriptor filtering module is used to filter out irrelevant and/or noisy descriptors and to produce a compact subset from all available features. We seed DDB with 16 diverse descriptor extraction tools developed in-house and collected from the literature. The tools altogether generate over 2700 descriptors that characterize (i) proteins, (ii) ligands, and (iii) protein-ligand complexes. The in-house descriptors we extract are protein-specific which are based on pairwise primary and tertiary alignment of protein structures followed by clustering and trilateration. We built and used DDB's ML library to fit SFs to the in-house descriptors and those collected from the literature. We then evaluated them on several data sets that were constructed to reflect real-world drug screening scenarios. We found that multiperspective SFs that were constructed using a large number of diverse DDB descriptors capturing various PL interactions in different ways outperformed their single-perspective counterparts in all evaluation scenarios, with an average improvement of more than 15%. We also found that our proposed protein-specific descriptors improve the accuracy of SFs.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Ashtawy HM,Mahapatra NRdoi
10.1021/acs.jcim.7b00310subject
Has Abstractpub_date
2018-01-22 00:00:00pages
134-147issue
1eissn
1549-9596issn
1549-960Xjournal_volume
58pub_type
杂志文章abstract::Large ring cyclodextrins have become increasingly important for drug delivery applications. In this work, we have performed replica-exchange molecular dynamics simulations using both implicit and explicit water solvation models to study the conformational diversity of iota-cyclodextrin containing 14 α-1,4 glycosidic l...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00595
更新日期:2017-04-24 00:00:00
abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b01068
更新日期:2020-03-23 00:00:00
abstract::An index of the activation of Class A G-protein-coupled receptors (GPCRs) has been trained using interhelix distances from a series of microsecond molecular-dynamics simulations and tested for 268 published X-ray structures. In a three-class model that includes intermediate structures, 63% of the active structures are...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00604
更新日期:2019-09-23 00:00:00
abstract::Cathepsin A is a mammalian lysosomal enzyme that catalyzes the hydrolysis of the carboxy-terminal amino acids of polypeptides and also regulates beta-galactosidase and neuraminidase-1 activities through the formation of a multienzymic complex in lysosomes. Human cathepsin A (hCathA), yeast carboxypeptidase (CPY), and ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci060093p
更新日期:2006-09-01 00:00:00
abstract::The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In th...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00148
更新日期:2018-06-25 00:00:00
abstract::Spin diffusion is a formidable problem when interpreting NMR data of chemical compounds. We developed a method to reconstruct the conformational ensemble of flexible molecules displaying spin diffusion, which minimizes the subjective bias in the interpretation of experimental data and which can be used routinely to ob...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00259
更新日期:2019-06-24 00:00:00
abstract::Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00135
更新日期:2019-08-26 00:00:00
abstract::The early stages of drug discovery rely on hit-to-lead programs, where initial hits undergo partial optimization to improve binding affinities for their biological target. This is an expensive and time-consuming process, requiring multiple iterations of trial and error designs, an ideal scenario for applying computer ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00938
更新日期:2020-03-23 00:00:00
abstract::We present an induced fit docking approach called Adaptive BP-Dock that integrates perturbation response scanning (PRS) with the flexible docking protocol of RosettaLigand in an adaptive manner. We first perturb the binding pocket residues of a receptor and obtain a new conformation based on the residue response fluct...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00587
更新日期:2016-04-25 00:00:00
abstract::Human cannabinoid type 1 (CB1) G-protein coupled receptor is a potential therapeutic target for obesity. The previously predicted and experimentally validated ensemble of ligand-free conformations of CB1 [Scott, C. E. et al. Protein Sci. 2013 , 22 , 101 - 113 ; Ahn, K. H. et al. Proteins 2013 , 81 , 1304 - 1317] are u...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00581
更新日期:2016-01-25 00:00:00
abstract::In the preceding paper (Duca, J. S.; Madison, V. S.; Voigt, J. H. J. Chem. Inf. Model. 2008, 48, 659-668), the accuracy of docking and affinity predictions of the Gold and Glide programs were investigated using single protein conformations spanning 150 CDK2/inhibitor crystallographic complexes. High docking accuracy w...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700428d
更新日期:2008-03-01 00:00:00
abstract::Small-molecule protein docking is an essential tool in drug design and to understand molecular recognition. In the present work we introduce FlexAID, a small-molecule docking algorithm that accounts for target side-chain flexibility and utilizes a soft scoring function, i.e. one that is not highly dependent on specifi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00078
更新日期:2015-07-27 00:00:00
abstract::Various cages are constructed by using three types of caps: f-cap (derived from spherical fullerenes by deleting zones of various size), kf-cap (obtainable by cutting off the polar ring, of size k), and t-cap ("tubercule"-cap). Building ways are presented, some of them being possible isomerization routes in the real c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci049738g
更新日期:2005-03-01 00:00:00
abstract::The conformational properties of AT1 antagonist valsartan have been analyzed both in solution and at the binding site of the receptor. Low energy conformations of valsartan in solution were explored by NMR spectroscopy and molecular modeling studies. The NMR results showed the existence of two distinct and almost isoe...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800427s
更新日期:2009-03-01 00:00:00
abstract::The importance of thorough analyses of the secondary structures in proteins as basic structural units cannot be overemphasized. Although recent computational methods have achieved reasonably high accuracy for predicting secondary structures from amino acid sequences, a simple and fundamental empirical approach to char...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900452z
更新日期:2010-04-26 00:00:00
abstract::Blockade of human ether-à-go-go related gene (hERG) channel prolongs the duration of the cardiac action potential and is a common reason for drug failure in preclinical safety trials. Therefore, it is of great importance to develop robust in silico tools to predict potential hERG blockers in the early stages of drug d...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200271d
更新日期:2011-11-28 00:00:00
abstract::Traditional herbal medicine has been an inseparable part of the traditional medical science in many countries throughout history. Nowadays, the popularity of using herbal medicines in daily life, as well as clinical practices, has gradually expanded to numerous Western countries with positive impacts and acceptance. T...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00826
更新日期:2020-03-23 00:00:00
abstract::Generalization of an earlier algorithm has led to the development of new local structural alignment algorithms for prediction of protein-protein binding sites. The algorithms use maximum cliques on protein graphs to define structurally similar protein regions. The search for structural neighbors in the new algorithms ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci100265x
更新日期:2010-10-25 00:00:00
abstract::Human type II topoisomerases, molecular motors that alter the DNA topology, are a major target of modern chemotherapy. Groups of catalytic inhibitors represent a new approach to overcome the known limitations of topoisomerase II poisons such as cardiotoxicity and induction of secondary tumors. Here, we present a class...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00202
更新日期:2020-07-27 00:00:00
abstract::Acetohydroxyacid synthase (AHAS) is a thiamin diphosphate-dependent enzyme involved in the biosynthesis of valine, leucine, isoleucine, and lysine. Experimental evidence has shown that mutation of the Gln202 residue results in a decrease in the enzymatic activity, thus suggesting the main role of the carboligation cat...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00863
更新日期:2020-02-24 00:00:00
abstract::CDC25 phosphatases play critical roles in cell cycle regulation and are attractive targets for anticancer therapies. Several small non-peptide molecules are known to inhibit CDC25, but many of them appear to form a covalent bond with the enzyme or act through oxidation of the thiolate group of the catalytic cysteine. ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700313e
更新日期:2008-01-01 00:00:00
abstract::Increased reports of oseltamivir (OTV)-resistant strains of the influenza virus, such as the H274Y mutation on its neuraminidase (NA), have created some cause for concern. Many studies have been conducted in the attempt to uncover the mechanism of OTV resistance in H274Y NA. However, most of the reported studies on H2...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00331
更新日期:2016-01-25 00:00:00
abstract::Fragment-based methods have emerged in the last two decades as alternatives to traditional high throughput screenings for the identification of chemical starting points in drug discovery. One arguable yet popular assumption about fragment-based design is that the fragment binding mode remains conserved upon chemical e...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300355p
更新日期:2012-12-21 00:00:00
abstract::This article presents the computation of both inter- and intramolecular hydrogen bond strengths from first-principles. Quantum chemical calculations conducted at the dispersion-corrected density functional theory level including free energy and solvation contributions are conducted for (i) one-to-one hydrogen-bonded c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00132
更新日期:2019-09-23 00:00:00
abstract::Pharmacophore search is a key component of many drug discovery efforts. Pharmer is a new computational approach to pharmacophore search that scales with the breadth and complexity of the query, not the size of the compound library being screened. Two novel methods for organizing pharmacophore data, the Pharmer KDB-tre...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200097m
更新日期:2011-06-27 00:00:00
abstract::One of the largest commercial applications of enzymes and surfactants is as main components in modern detergents. The high concentration of surfactant compounds usually present in detergents can, however, negatively affect the enzymatic activity. To remedy this drawback, it is of great importance to characterize the i...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00857
更新日期:2019-05-28 00:00:00
abstract::Computer programs for structure diagram generation (SDG) are indispensable cheminformatic tools that translate one- or three-dimensional (1D or 3D) chemical structure data stored in electronic formats to human-readable 2D depictions. Although many such programs are known, only a moderate part of chemical space can be ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00391
更新日期:2016-12-27 00:00:00
abstract::The molecular structure of four dimeric units (D-E, E-F, F-G, and G-H) of the DEFGH structural unit of heparin, their anionic forms, and their sodium salts have been studied using the B3LYP/6-31+G(d) method. The optimized geometries indicate that the most stable structure of these dimeric units in neutral state is sta...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci060060+
更新日期:2006-07-01 00:00:00
abstract::Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies tha...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00216
更新日期:2016-03-28 00:00:00
abstract::The failure of default scoring functions to ensure virtual screening enrichment is a persistent problem for the molecular docking algorithms used in structure-based drug discovery. To remedy this problem, elaborate rescoring and postprocessing schemes have been developed with a varying degree of success, specificity, ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00383
更新日期:2019-08-26 00:00:00