Abstract:
:Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D tensors containing the ligand-protein interaction descriptor values for each time step. Similar in structure to the time series encountered in modern approaches for signal, speech, and natural language processing, these time series can be directly analyzed using long short-term memory (LSTM) recurrent neural networks or convolutional neural networks (CNNs). The predictive regression models for the ligand-protein affinity were built for a subset of the PDBbind v.2017 database and applied to inhibitors of tankyrase, an enzyme of the poly(ADP-ribose)-polymerase (PARP) family that can be used in the treatment of colorectal cancer. As an additional test set, a subset of the Community Structure-Activity Resource (CSAR) data set was used. For comparison, the random forest and simple neural network models based on the crystal pose or the trajectory-averaged descriptors were used, as well as the commonly employed docking and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) scores. Convolutional neural networks based on the 2D tensors of ligand-protein interaction descriptors for short (2 ns) trajectories provide the best accuracy and predictive power, reaching the Spearman rank correlation coefficient of 0.73 and Pearson correlation coefficient of 0.70 for the tankyrase test set. Taking into account the recent increase in computational power of modern GPUs and relatively low computational complexity of the proposed approach, it can be used as an advanced virtual screening filter for compound prioritization.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Berishvili VP,Perkin VO,Voronkov AE,Radchenko EV,Syed R,Venkata Ramana Reddy C,Pillay V,Kumar P,Choonara YE,Kamal A,Palyulin VAdoi
10.1021/acs.jcim.9b00135subject
Has Abstractpub_date
2019-08-26 00:00:00pages
3519-3532issue
8eissn
1549-9596issn
1549-960Xjournal_volume
59pub_type
杂志文章abstract::The failure of default scoring functions to ensure virtual screening enrichment is a persistent problem for the molecular docking algorithms used in structure-based drug discovery. To remedy this problem, elaborate rescoring and postprocessing schemes have been developed with a varying degree of success, specificity, ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00383
更新日期:2019-08-26 00:00:00
abstract::The importance of thorough analyses of the secondary structures in proteins as basic structural units cannot be overemphasized. Although recent computational methods have achieved reasonably high accuracy for predicting secondary structures from amino acid sequences, a simple and fundamental empirical approach to char...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900452z
更新日期:2010-04-26 00:00:00
abstract::Sixteen FDA-approved drugs were investigated to elucidate their mechanisms of action (MOAs) and clinical functions by pathway analysis based on retrieved drug targets interacting with or affected by the investigated drugs. Protein and gene targets and associated pathways were obtained by data-mining of public database...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4005354
更新日期:2014-02-24 00:00:00
abstract::Human type II topoisomerases, molecular motors that alter the DNA topology, are a major target of modern chemotherapy. Groups of catalytic inhibitors represent a new approach to overcome the known limitations of topoisomerase II poisons such as cardiotoxicity and induction of secondary tumors. Here, we present a class...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00202
更新日期:2020-07-27 00:00:00
abstract::Mapping the chemical space of small organic molecules is approached from a theoretical graph theory viewpoint, in an effort to begin the systematic exploration of molecular topologies. We present an algorithm for exhaustive generation of scaffold topologies with up to eight rings and an efficient comparison method for...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci7003412
更新日期:2008-07-01 00:00:00
abstract::Resistance remains a major issue with regards to HIV-1 protease, despite the availability of numerous HIV-1 protease inhibitors and copious amounts of structural and binding data. In an effort to improve our understanding of how HIV-1 protease is able to "outsmart" new drugs, we have investigated the flexibility of HI...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci2000677
更新日期:2011-05-23 00:00:00
abstract::With continually increased computer power, molecular mechanics force field-based approaches, such as the endpoint methods of molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) and molecular mechanics generalized Born surface area (MM-GBSA), have been routinely applied in both drug lead identification and opt...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00934
更新日期:2020-12-28 00:00:00
abstract::Reversible covalent inhibitors have drawn increasing attention in drug design, as they are likely more potent than noncovalent inhibitors and less toxic than covalent inhibitors. Despite those advantages, the computational prediction of reversible covalent binding presents a formidable challenge because the binding pr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00959
更新日期:2019-05-28 00:00:00
abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200078f
更新日期:2011-09-26 00:00:00
abstract::When both the difference between two quantities and their individual values can be measured or computationally predicted, multiple quantities can be determined from the measurements or predictions of select individual quantities and select pairwise differences. These measurements and predictions form a network connect...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00528
更新日期:2019-11-25 00:00:00
abstract::We examine the sensitivity of folding molecular dynamics simulations on the choice between three variants of the same force field (the AMBER99SB force field and its ILDN, NMR-ILDN, and STAR-ILDN variants). Using two different peptide systems (a marginally stable helical peptide and a β-hairpin) and a grand total of mo...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00493
更新日期:2016-10-24 00:00:00
abstract::A kinetic, reactivity-binding model has been proposed to predict the regioselectivity of substrates meditated by the CYP1A2 enzyme, which is responsible for the metabolism of planar-conjugated compounds such as caffeine. This model consists of a docking simulation for binding energy and a semiempirical molecular orbit...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800001m
更新日期:2008-05-01 00:00:00
abstract::Databases of small, potentially bioactive molecules are ubiquitous across the industry and academia. Designed such that each unique compound should appear only once, the multiplicity of ways in which many compounds can be represented means that these databases require methods for standardizing the representation of ch...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00232
更新日期:2020-08-24 00:00:00
abstract::Factor Xa inhibitors are innovative anticoagulant agents that provide a better safety/efficacy profile compared to other anticoagulative drugs. A chemical feature-based modeling approach was applied to identify crucial pharmacophore patterns from 3D crystal structures of inhibitors bound to human factor Xa (Pdb entrie...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci049778k
更新日期:2005-01-01 00:00:00
abstract::In this account, a rapid retrosynthesis-based scoring method for the assessment of synthetic accessibility of drug-like molecules, called RASA (Retrosynthesis-based Assessment of Synthetic Accessibility) is devised. RASA first constructs a synthesis tree for the target molecule based on retrosynthetic analysis; in thi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci100216g
更新日期:2011-10-24 00:00:00
abstract::We present our newly developed and highly efficient lossless compression algorithm for trajectories of atom positions and volumetric data. The algorithm is designed as a two-step approach. In the first step, efficient polynomial extrapolation schemes reduce the information entropy of the data by exploiting both spatia...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00501
更新日期:2018-10-22 00:00:00
abstract::With the emergence of large collections of protein-ligand complexes complemented by binding data, as found in PDBbind or BindingMOAD, new opportunities for parametrizing and evaluating scoring functions have arisen. With huge data collections available, it becomes feasible to fit scoring functions in a QSAR style, i.e...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci100264e
更新日期:2010-11-22 00:00:00
abstract::Protein-ligand binding is essential to almost all life processes. The understanding of protein-ligand interactions is fundamentally important to rational drug and protein design. Based on large scale data sets, we show that protein rigidity strengthening or flexibility reduction is a mechanism in protein-ligand bindin...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00226
更新日期:2017-07-24 00:00:00
abstract::The determination of the validity of a QSAR model when applied to new compounds is an important concern in the field of QSAR and QSPR modeling. Various scoring techniques can be applied to specific types of models. We present a technique with which we can state whether a new compound will be well predicted by a previo...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0497511
更新日期:2005-01-01 00:00:00
abstract::Among the photophysical parameters that underpin Förster resonance energy transfer (FRET), perhaps the least explored is the spectral overlap term ( J). While by definition J increases linearly with acceptor molar absorption coefficient (ε(A) in M-1 cm-1), is proportional to wavelength (λ4), and depends on the degree ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00753
更新日期:2019-02-25 00:00:00
abstract::Spin diffusion is a formidable problem when interpreting NMR data of chemical compounds. We developed a method to reconstruct the conformational ensemble of flexible molecules displaying spin diffusion, which minimizes the subjective bias in the interpretation of experimental data and which can be used routinely to ob...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00259
更新日期:2019-06-24 00:00:00
abstract::In structure-based drug design, scoring functions are often employed to evaluate protein-ligand interactions. A variety of scoring functions have been developed so far, and thus, some objective benchmarks are desired for assessing their strength and weakness. The comparative assessment of scoring functions (CASF) benc...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00545
更新日期:2019-02-25 00:00:00
abstract::The enzyme UDP-galactopyranose mutase (UGM) represents a promising drug target for the treatment of infections with Trypanosoma cruzi. We have computed the Potential of Mean Force for the release of UDP-galactopyranose from UGM, using Umbrella Sampling simulations. The simulations revealed the conformational changes t...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00675
更新日期:2019-02-25 00:00:00
abstract::Class II fructose-1,6-bisphosphate aldolases (FBA-II) are attractive new targets for the discovery of drugs to combat invasive fungal infection, because they are absent in animals and higher plants. Although several FBA-II inhibitors have been reported, none of these inhibitors exhibit antifungal effect so far. In thi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00763
更新日期:2017-06-26 00:00:00
abstract::Various cages are constructed by using three types of caps: f-cap (derived from spherical fullerenes by deleting zones of various size), kf-cap (obtainable by cutting off the polar ring, of size k), and t-cap ("tubercule"-cap). Building ways are presented, some of them being possible isomerization routes in the real c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci049738g
更新日期:2005-03-01 00:00:00
abstract::Water is the natural medium of molecules in the cell and plays an important role in protein structure, function and interaction with small molecule ligands. However, the widely used molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) method for binding energy calculation does not explicitly take account of wa...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4001794
更新日期:2014-02-24 00:00:00
abstract::The integration of ligand- and structure-based strategies might sensitively increase the success of drug discovery process. We have recently described the application of Molecular Electrostatic Potential autocorrelated vectors (autoMEPs) in generating both linear (Partial Least-Square, PLS) and nonlinear (Response Sur...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700300w
更新日期:2008-02-01 00:00:00
abstract::HackaMol is an open source, object-oriented toolkit written in Modern Perl that organizes atoms within molecules and provides chemically intuitive attributes and methods. The library consists of two components: HackaMol, the core that contains classes for storing and manipulating molecular information, and HackaMol::X...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500359e
更新日期:2015-04-27 00:00:00
abstract::We present an induced fit docking approach called Adaptive BP-Dock that integrates perturbation response scanning (PRS) with the flexible docking protocol of RosettaLigand in an adaptive manner. We first perturb the binding pocket residues of a receptor and obtain a new conformation based on the residue response fluct...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00587
更新日期:2016-04-25 00:00:00
abstract::Calcium is involved in important intracellular processes, such as intracellular signaling from cell membrane receptors to the nucleus. Typically, calcium levels are kept at less than 100 nM in the nucleus and cytosol, but some calcium is stored in the endoplasmic reticulum (ER) lumen for rapid release to activate intr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00475
更新日期:2017-02-27 00:00:00