Perturbation-Theory and Machine Learning (PTML) Model for High-Throughput Screening of Parham Reactions: Experimental and Theoretical Studies.

Abstract:

:Machine learning (ML) algorithms are gaining importance in the processing of chemical information and modeling of chemical reactivity problems. In this work, we have developed a perturbation-theory and machine learning (PTML) model combining perturbation theory (PT) and ML algorithms for predicting the yield of a given reaction. For this purpose, we have selected Parham cyclization, which is a general and powerful tool for the synthesis of heterocyclic and carbocyclic compounds. This reaction has both structural (substitution pattern on the substrate, internal electrophile, ring size, etc.) and operational variables (organolithium reagent, solvent, temperature, time, etc.), so predicting the effect of changes on substrate design (internal elelctrophile, halide, etc.) or reaction conditions on the yield is an important task that could help to optimize the reaction design. The PTML model developed uses PT operators to account for perturbations under experimental conditions and/or structural variables of all the molecules involved in a query reaction, compared to a reaction of reference. Thus, a dataset of >100 reactions has been collected for different substrates and internal electrophiles, under different reaction conditions, with a wide range of yields (0-98%). The best PTML model found using General Linear Regression (GLR) has R = 0.88 in training and R = 0.83 in external validation series for 10 000 pairs of query and reference reactions. The PTML model has a final R = 0.95 for all reactions using multiple reactions of reference. We also report a comparative study of linear versus nonlinear PTML models based on artificial neural network (ANN) algorithms. PTML-ANN models (LNN, MLP, RBF) with R ≈ 0.1-0.8 do not outperform the first PMTL model. This result confirms the validity of the linearity of the model. Next, we carried out an experimental and theoretical study of nonreported Parham reactions to illustrate the practical use of the PTML model. A 500 000-point simulation and a Hammett analysis of the reactivity space of Parham reactions are also reported.

journal_name

J Chem Inf Model

authors

Simón-Vidal L,García-Calvo O,Oteo U,Arrasate S,Lete E,Sotomayor N,González-Díaz H

doi

10.1021/acs.jcim.8b00286

subject

Has Abstract

pub_date

2018-07-23 00:00:00

pages

1384-1396

issue

7

eissn

1549-9596

issn

1549-960X

journal_volume

58

pub_type

杂志文章
  • CHARMMing: a new, flexible web portal for CHARMM.

    abstract::A new web portal for the CHARMM macromolecular modeling package, CHARMMing (CHARMM interface and graphics, http://www.charmming.org), is presented. This tool provides a user-friendly interface for the preparation, submission, monitoring, and visualization of molecular simulations (i.e., energy minimization, solvation,...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800133b

    authors: Miller BT,Singh RP,Klauda JB,Hodoscek M,Brooks BR,Woodcock HL 3rd

    更新日期:2008-09-01 00:00:00

  • RosENet: Improving Binding Affinity Prediction by Leveraging Molecular Mechanics Energies with an Ensemble of 3D Convolutional Neural Networks.

    abstract::The worldwide increase and proliferation of drug resistant microbes, coupled with the lag in new drug development, represents a major threat to human health. In order to reduce the time and cost for exploring the chemical search space, drug discovery increasingly relies on computational biology approaches. One key ste...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00075

    authors: Hassan-Harrirou H,Zhang C,Lemmin T

    更新日期:2020-06-22 00:00:00

  • The valence state combination model: a generic framework for handling tautomers and protonation states.

    abstract::The consistent handling of molecules is probably the most basic and important requirement in the field of cheminformatics. Reliable results can only be obtained if the underlying calculations are independent of the specific way molecules are represented in the input data. However, ensuring consistency is a complex tas...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400724v

    authors: Urbaczek S,Kolodzik A,Rarey M

    更新日期:2014-03-24 00:00:00

  • Computational Prediction and Biochemical Analyses of New Inverse Agonists for the CB1 Receptor.

    abstract::Human cannabinoid type 1 (CB1) G-protein coupled receptor is a potential therapeutic target for obesity. The previously predicted and experimentally validated ensemble of ligand-free conformations of CB1 [Scott, C. E. et al. Protein Sci. 2013 , 22 , 101 - 113 ; Ahn, K. H. et al. Proteins 2013 , 81 , 1304 - 1317] are u...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00581

    authors: Scott CE,Ahn KH,Graf ST,Goddard WA 3rd,Kendall DA,Abrol R

    更新日期:2016-01-25 00:00:00

  • Energetics, Thermodynamics, and Molecular Recognition of Piperine with DNA.

    abstract::Piperine, the bioactive phytochemical from black pepper (Piper nigrum L.), is a nontoxic natural compound exhibiting many physiological and pharmacological properties. They include antioxidant, anti-inflammatory, antimutagenic, antitumor, antiapoptotic, antigenotoxic, antiarthritic, antifungal, antimicrobial, antidepr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00514

    authors: Haris P,Mary V,Haridas M,Sudarsanakumar C

    更新日期:2015-12-28 00:00:00

  • Leave-cluster-out cross-validation is appropriate for scoring functions derived from diverse protein data sets.

    abstract::With the emergence of large collections of protein-ligand complexes complemented by binding data, as found in PDBbind or BindingMOAD, new opportunities for parametrizing and evaluating scoring functions have arisen. With huge data collections available, it becomes feasible to fit scoring functions in a QSAR style, i.e...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100264e

    authors: Kramer C,Gedeck P

    更新日期:2010-11-22 00:00:00

  • Hidden active information in a random compound library: extraction using a pseudo-structure-activity relationship model.

    abstract::We propose a hypothesis that "a model of active compound can be provided by integrating information of compounds high-ranked by docking simulation of a random compound library". In our hypothesis, the inclusion of true active compounds in the high-ranked compound is not necessary. We regard the high-ranked compounds a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7003384

    authors: Fukunishi H,Teramoto R,Shimada J

    更新日期:2008-03-01 00:00:00

  • Informatics-Aided Density Functional Theory Study on the Li Ion Transport of Tavorite-Type LiMTO4F (M(3+)-T(5+), M(2+)-T(6+)).

    abstract::The ongoing search for fast Li-ion conducting solid electrolytes has driven the deployment surge on density functional theory (DFT) computation and materials informatics for exploring novel chemistries before actual experimental testing. Existing structure prototypes can now be readily evaluated beforehand not only to...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500752n

    authors: Jalem R,Kimura M,Nakayama M,Kasuga T

    更新日期:2015-06-22 00:00:00

  • Optimal Measurement Network of Pairwise Differences.

    abstract::When both the difference between two quantities and their individual values can be measured or computationally predicted, multiple quantities can be determined from the measurements or predictions of select individual quantities and select pairwise differences. These measurements and predictions form a network connect...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00528

    authors: Xu H

    更新日期:2019-11-25 00:00:00

  • Consensus QSAR models: do the benefits outweigh the complexity?

    abstract::This study has assessed the use of consensus regression, as compared to single multiple linear regression, models for the development of quantitative structure-activity relationships (QSARs). To provide a comparison, four data sets of varying size and complexity were analyzed: silastic membrane flux, toxicity of pheno...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700016d

    authors: Hewitt M,Cronin MT,Madden JC,Rowe PH,Johnson C,Obi A,Enoch SJ

    更新日期:2007-07-01 00:00:00

  • Antihypertensive drug valsartan in solution and at the AT1 receptor: conformational analysis, dynamic NMR spectroscopy, in silico docking, and molecular dynamics simulations.

    abstract::The conformational properties of AT1 antagonist valsartan have been analyzed both in solution and at the binding site of the receptor. Low energy conformations of valsartan in solution were explored by NMR spectroscopy and molecular modeling studies. The NMR results showed the existence of two distinct and almost isoe...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800427s

    authors: Potamitis C,Zervou M,Katsiaras V,Zoumpoulakis P,Durdagi S,Papadopoulos MG,Hayes JM,Grdadolnik SG,Kyrikou I,Argyropoulos D,Vatougia G,Mavromoustakos T

    更新日期:2009-03-01 00:00:00

  • Structure-Based Discovery of 1H-Indazole-3-carboxamides as a Novel Structural Class of Human GSK-3 Inhibitors.

    abstract::An in silico screening procedure was performed to select new inhibitors of glycogen synthase kinase 3β (GSK-3β), a serine/threonine protein kinase that in the last two decades has emerged as a key target in drug discovery, having been implicated in multiple cellular processes and linked with the pathogenesis of severa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00486

    authors: Ombrato R,Cazzolla N,Mancini F,Mangano G

    更新日期:2015-12-28 00:00:00

  • Imputation of Assay Bioactivity Data Using Deep Learning.

    abstract::We describe a novel deep learning neural network method and its application to impute assay pIC50 values. Unlike conventional machine learning approaches, this method is trained on sparse bioactivity data as input, typical of that found in public and commercial databases, enabling it to learn directly from correlation...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00768

    authors: Whitehead TM,Irwin BWJ,Hunt P,Segall MD,Conduit GJ

    更新日期:2019-03-25 00:00:00

  • Interpretation of the binding affinities of PTP1B inhibitors with the MM-GB/SA method and the X-score scoring function.

    abstract::We have studied the binding affinities of a set of 45 small-molecule inhibitors to protein tyrosine phosphatase 1B (PTP1B) through computational approaches. All of these compounds share a common oxalylamino benzoic acid (OBA) moiety. The complex structure of each compound was modeled by using the GOLD program plus the...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8004429

    authors: Zhang X,Li X,Wang R

    更新日期:2009-04-01 00:00:00

  • 3D QSAR methods: Phase and Catalyst compared.

    abstract::The programs Phase and Catalyst HypoGen are compared for their performance in determining three-dimensional quantitative structure-activity relationships. Eight sets of compounds with measured activity were collected from the public literature and partitioned into suitable training and test sets by an automated proced...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7000082

    authors: Evans DA,Doman TN,Thorner DA,Bodkin MJ

    更新日期:2007-05-01 00:00:00

  • Torsion Library Reloaded: A New Version of Expert-Derived SMARTS Rules for Assessing Conformations of Small Molecules.

    abstract::The Torsion Library contains hundreds of rules for small molecule conformations which have been derived from the Cambridge Structural Database (CSD) and are curated by molecular design experts. The torsion rules are encoded as SMARTS patterns and categorize rotatable bonds via a traffic light coloring scheme. We have ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00522

    authors: Guba W,Meyder A,Rarey M,Hert J

    更新日期:2016-01-25 00:00:00

  • Structure-Based Kinase Profiling To Understand the Polypharmacological Behavior of Therapeutic Molecules.

    abstract::Several drugs elicit their therapeutic efficacy by modulating multiple cellular targets and possess varied polypharmacological actions. The identification of the molecular targets of a potent bioactive molecule is essential in determining its overall polypharmacological profile. Experimental procedures are expensive a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00227

    authors: Dutta D,Das R,Mandal C,Mandal C

    更新日期:2018-01-22 00:00:00

  • Comparative Binding Analysis of N-Acetylneuraminic Acid in Bovine Serum Albumin and Human α-1 Acid Glycoprotein.

    abstract::The present study focuses on the determination of the biologically significant N-acetylneuraminic acid (NANA) drug binding interaction mechanism between bovine serum albumin (BSA) and human α-1 acid glycoprotein (HAG) using various optical spectroscopy and computational methods. The steady state fluorescence spectrosc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00558

    authors: Karthikeyan S,Bharanidharan G,Ragavan S,Kandasamy S,Chinnathambi S,Udayakumar K,Mangaiyarkarasi R,Sundaramoorthy A,Aruna P,Ganesan S

    更新日期:2019-01-28 00:00:00

  • Modeling Binding with Large Conformational Changes: Key Points in Ensemble-Docking Approaches.

    abstract::Protein dynamics play a critical role in ligand binding, and different models have been proposed to explain the relationships between protein motion and molecular recognition. Here, we present a study of ligand-binding processes associated with large conformational changes of a protein to elucidate the critical choice...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00125

    authors: Motta S,Bonati L

    更新日期:2017-07-24 00:00:00

  • Adaptive configuring of radial basis function network by hybrid particle swarm algorithm for QSAR studies of organic compounds.

    abstract::The configuring of a radial basis function network (RBFN) consists of selecting the network parameters (centers and widths in RBF units and weights between the hidden and output layers) and network architecture. The issues of suboptimum and overfitting, however, often occur in RBFN configuring. This paper presented a ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600218d

    authors: Zhou YP,Jiang JH,Lin WQ,Zou HY,Wu HL,Shen GL,Yu RQ

    更新日期:2006-11-01 00:00:00

  • Isomerization and Decomposition of 2-Methylfuran with External Forces.

    abstract::The primary goal of this project was to evaluate the performance of the Standard and Enforced Geometry Optimization (SEGO) method which we have recently developed. The SEGO method has been designed for an automatic location of multiple minima on the molecular Potential Energy Surface (PES), and its usefulness has been...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00352

    authors: Brzyska A,Woliński K

    更新日期:2019-08-26 00:00:00

  • A Polarization-Consistent Model for Alcohols to Predict Solvation Free Energies.

    abstract::Classical nonpolarizable models, normally based on a combination of Lennard-Jones sites and point charges, are extensively used to model thermodynamic properties of fluids, including solvation. An important shortcoming of these models is that they do not explicitly account for polarization effects, i.e., a description...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01005

    authors: Barrera MC,Jorge M

    更新日期:2020-03-23 00:00:00

  • Direct Observation of β-Barrel Intermediates in the Self-Assembly of Toxic SOD128-38 and Absence in Nontoxic Glycine Mutants.

    abstract::Soluble low-molecular-weight oligomers formed during the early stage of amyloid aggregation are considered the major toxic species in amyloidosis. The structure-function relationship between oligomeric assemblies and the cytotoxicity in amyloid diseases are still elusive due to the heterogeneous and transient nature o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01319

    authors: Sun Y,Huang J,Duan X,Ding F

    更新日期:2021-01-14 00:00:00

  • Role of water in ligand binding to maltose-binding protein: insight from a new docking protocol based on the 3D-RISM-KH molecular theory of solvation.

    abstract::Maltose-binding protein is a periplasmic binding protein responsible for transport of maltooligosaccarides through the periplasmic space of Gram-negative bacteria, as a part of the ABC transport system. The molecular mechanisms of the initial ligand binding and induced large scale motion of the protein's domains still...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500520q

    authors: Huang W,Blinov N,Wishart DS,Kovalenko A

    更新日期:2015-02-23 00:00:00

  • Protein Solvent Shell Structure Provides Rapid Analysis of Hydration Dynamics.

    abstract::The solvation layer surrounding a protein is clearly an intrinsic part of protein structure-dynamics-function, and our understanding of how the hydration dynamics influences protein function is emerging. We have recently reported simulations indicating a correlation between regional hydration dynamics and the structur...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00009

    authors: Dahanayake JN,Shahryari E,Roberts KM,Heikes ME,Kasireddy C,Mitchell-Koch KR

    更新日期:2019-05-28 00:00:00

  • Generalized topological indices. Modeling gas-phase rate coefficients of atmospheric relevance.

    abstract::We develop the idea that the use of ad hoc molecular descriptors in QSAR/QSPR studies is not an optimal solution. Instead, we propose to optimize these descriptors for the specific properties under study. In the case of topological indices (TIs) we propose the use of the generalized topological indices (GTIs), which a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600448b

    authors: Estrada E,Matamala AR

    更新日期:2007-05-01 00:00:00

  • Viscosity Prediction of Lubricants by a General Feed-Forward Neural Network.

    abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01068

    authors: Loh GC,Lee HC,Tee XY,Chow PS,Zheng JW

    更新日期:2020-03-23 00:00:00

  • ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes.

    abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002668

    authors: Avram SI,Pacureanu LM,Bora A,Crisan L,Avram S,Kurunczi L

    更新日期:2014-08-25 00:00:00

  • Nonadditivity Analysis.

    abstract::We introduce the statistics behind a novel type of SAR analysis named "nonadditivity analysis". On the basis of all pairs of matched pairs within a given data set, the approach analyzes whether the same transformations between related molecules have the same effect, i.e., whether they are additive. Assuming that the e...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00631

    authors: Kramer C

    更新日期:2019-09-23 00:00:00

  • Tautomer Standardization in Chemical Databases: Deriving Business Rules from Quantum Chemistry.

    abstract::Databases of small, potentially bioactive molecules are ubiquitous across the industry and academia. Designed such that each unique compound should appear only once, the multiplicity of ways in which many compounds can be represented means that these databases require methods for standardizing the representation of ch...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00232

    authors: Baker CM,Kidley NJ,Papachristos K,Hotson M,Carson R,Gravestock D,Pouliot M,Harrison J,Dowling A

    更新日期:2020-08-24 00:00:00