Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.

Abstract:

:The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), r(m)(2), and the Golbraikh-Tropsha method. Recently, the concordance correlation coefficient (CCC, Lin), which simply verifies how small the differences are between experimental data and external data set predictions, independently of their range, was proposed by our group as an external validation parameter for use in QSAR studies. In our preliminary work, we demonstrated with thousands of simulated models that CCC is in good agreement with the compared validation criteria (except r(m)(2)) using the cutoff values normally applied for the acceptance of QSAR models as externally predictive. In this new work, we have studied and compared the general trends of the various criteria relative to different possible biases (scale and location shifts) in external data distributions, using a wide range of different simulated scenarios. This study, further supported by visual inspection of experimental vs predicted data scatter plots, has highlighted problems related to some criteria. Indeed, if based on the cutoff suggested by the proponent, r(m)(2) could also accept not predictive models in two of the possible biases (location, location plus scale), while in the case of scale shift bias, it appears to be the most restrictive. Moreover, Q(F1)(2) and Q(F2)(2) showed some problems in one of the possible biases (scale shift). This analysis allowed us to also propose recalibrated, and intercomparable for the same data scatter, new thresholds for each criterion in defining a QSAR model as really externally predictive in a more precautionary approach. An analysis of the results revealed that the scatter plot of experimental vs predicted external data must always be evaluated to support the statistical criteria values: in some cases high statistical parameter values could hide models with unacceptable predictions.

journal_name

J Chem Inf Model

authors

Chirico N,Gramatica P

doi

10.1021/ci300084j

subject

Has Abstract

pub_date

2012-08-27 00:00:00

pages

2044-58

issue

8

eissn

1549-9596

issn

1549-960X

journal_volume

52

pub_type

杂志文章
  • Structure-based design and screen of novel inhibitors for class II 3-hydroxy-3-methylglutaryl coenzyme A reductase from Streptococcus pneumoniae.

    abstract::3-Hydroxy-3-methylglutaryl coenzyme A reductase (HMGR) is a primary target in the current clinical treatment of hypercholesterolemia with specific inhibitors of "statin" family. Statins are excellent inhibitors of the class I (human) enzyme but relatively poor inhibitors of the class II enzyme, which are well-known as...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300163v

    authors: Li D,Gui J,Li Y,Feng L,Han X,Sun Y,Sun T,Chen Z,Cao Y,Zhang Y,Zhou L,Hu X,Ren Y,Wan J

    更新日期:2012-07-23 00:00:00

  • FOG: Fragment Optimized Growth algorithm for the de novo generation of molecules occupying druglike chemical space.

    abstract::An essential feature of all practical de novo molecule generating programs is the ability to focus the potential combinatorial explosion of grown molecules on a desired chemical space. It is a daunting task to balance the generation of new molecules with limitations on growth that produce desired features such as stab...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci9000458

    authors: Kutchukian PS,Lou D,Shakhnovich EI

    更新日期:2009-07-01 00:00:00

  • VMD Store-A VMD Plugin to Browse, Discover, and Install VMD Extensions.

    abstract::Herein we present the VMD Store, an open-source VMD plugin that simplifies the way that users browse, discover, install, update, and uninstall extensions for the Visual Molecular Dynamics (VMD) software. The VMD Store obtains data about all the indexed VMD extensions hosted on GitHub and presents a one-click mechanism...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00739

    authors: Fernandes HS,Sousa SF,Cerqueira NMFSA

    更新日期:2019-11-25 00:00:00

  • Partitioning of Benzoic Acid into 1,2-Dimyristoyl-sn-glycero-3-phosphocholine and Blood-Brain Barrier Mimetic Bilayers.

    abstract::Using an all-atom explicit water model and replica exchange umbrella sampling simulations, we investigated the molecular mechanisms of benzoic acid partitioning into two model lipid bilayers. The first was formed of 1,2-dimyristoyl-sn-glycero-3-phosphocholine (DMPC) lipids, whereas the second was composed of an equimo...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00590

    authors: Siwy CM,Delfing BM,Smith AK,Klimov DK

    更新日期:2020-08-24 00:00:00

  • Search for novel aminoglycosides by combining fragment-based virtual screening and 3D-QSAR scoring.

    abstract::Aminoglycosides are antibiotics targeting the 16S RNA A site of the bacterial ribosome. There have been many efforts directed toward design of their synthetic derivatives, however with only few successes. As RNA binders, aminoglycosides are also a difficult target for computational drug design, since most of the exist...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800361a

    authors: Setny P,Trylska J

    更新日期:2009-02-01 00:00:00

  • Molecular Oxygen Binding in the Mitochondrial Electron Transfer Flavoprotein.

    abstract::Reactive oxygen species such as superoxide are potentially harmful byproducts of the aerobic metabolism in the inner mitochondrial membrane, and complexes I, II, III of the electron transport chain have been identified as primary sources. The mitochondrial fatty acid b-oxidation pathway may also play a yet uncharacter...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00702

    authors: Husen P,Nielsen C,Martino CF,Solov'yov IA

    更新日期:2019-11-25 00:00:00

  • Tertiary Element Interaction in HIV-1 TAR.

    abstract::HIV-1 replication requires binding to occur between Trans-activation Response Element (TAR) RNA and the TAT protein. This TAR-TAT binding depends on the conformation of TAR, and therapeutic development has attempted to exploit this dynamic behavior. Here we simulate TAR dynamics in the context of mutations inhibiting ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00152

    authors: Krawczyk K,Sim AY,Knapp B,Deane CM,Minary P

    更新日期:2016-09-26 00:00:00

  • Holistic Approach to Partial Covalent Interactions in Protein Structure Prediction and Design with Rosetta.

    abstract::Partial covalent interactions (PCIs) in proteins, which include hydrogen bonds, salt bridges, cation-π, and π-π interactions, contribute to thermodynamic stability and facilitate interactions with other biomolecules. Several score functions have been developed within the Rosetta protein modeling framework that identif...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00398

    authors: Combs SA,Mueller BK,Meiler J

    更新日期:2018-05-29 00:00:00

  • Prediction of synthetic accessibility based on commercially available compound databases.

    abstract::A compound's synthetic accessibility (SA) is an important aspect of drug design, since in some cases computer-designed compounds cannot be synthesized. There have been several reports on SA prediction, most of which have focused on the difficulties of synthetic reactions based on retro-synthesis analyses, reaction dat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500568d

    authors: Fukunishi Y,Kurosawa T,Mikami Y,Nakamura H

    更新日期:2014-12-22 00:00:00

  • Interpretation of the binding affinities of PTP1B inhibitors with the MM-GB/SA method and the X-score scoring function.

    abstract::We have studied the binding affinities of a set of 45 small-molecule inhibitors to protein tyrosine phosphatase 1B (PTP1B) through computational approaches. All of these compounds share a common oxalylamino benzoic acid (OBA) moiety. The complex structure of each compound was modeled by using the GOLD program plus the...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8004429

    authors: Zhang X,Li X,Wang R

    更新日期:2009-04-01 00:00:00

  • Baseline Model for Predicting Protein-Ligand Unbinding Kinetics through Machine Learning.

    abstract::Derivation of structure-kinetics relationships can help rational design and development of new small-molecule drug candidates with desired residence times. Efforts are now being directed toward the development of efficient computational methods. Currently, there is a lack of solid, high-throughput binding kinetics pre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00450

    authors: Amangeldiuly N,Karlov D,Fedorov MV

    更新日期:2020-12-28 00:00:00

  • Combined 3D-QSAR modeling and molecular docking study on indolinone derivatives as inhibitors of 3-phosphoinositide-dependent protein kinase-1.

    abstract::3-Phosphoinositide-dependent protein kinase-1 (PDK1) is a promising target for developing novel anticancer drugs. In order to understand the structure-activity correlation of indolinone-based PDK1 inhibitors, we have carried out a combined molecular docking and three-dimensional quantitative structure-activity relatio...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800147v

    authors: AbdulHameed MD,Hamza A,Liu J,Zhan CG

    更新日期:2008-09-01 00:00:00

  • Exploring inhibitor release pathways in histone deacetylases using random acceleration molecular dynamics simulations.

    abstract::Molecular channel exploration perseveres to be the prominent solution for eliciting structure and accessibility of active site and other internal spaces of macromolecules. The volume and silhouette characterization of these channels provides answers for the issues of substrate access and ligand swapping between the ob...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200584f

    authors: Kalyaanamoorthy S,Chen YP

    更新日期:2012-02-27 00:00:00

  • Comparative Assessment of Scoring Functions: The CASF-2016 Update.

    abstract::In structure-based drug design, scoring functions are often employed to evaluate protein-ligand interactions. A variety of scoring functions have been developed so far, and thus, some objective benchmarks are desired for assessing their strength and weakness. The comparative assessment of scoring functions (CASF) benc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00545

    authors: Su M,Yang Q,Du Y,Feng G,Liu Z,Li Y,Wang R

    更新日期:2019-02-25 00:00:00

  • In silico renal clearance model using classical Volsurf approach.

    abstract::A data set of 130 diverse compounds containing both central nervous system (CNS) and non-CNS drugs was used to generate a renal clearance model using a classical Volsurf approach. Percentage renal clearance data was used as a biological input. The score plots obtained from principal component analysis and partial leas...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0503309

    authors: Doddareddy MR,Cho YS,Koh HY,Kim DH,Pae AN

    更新日期:2006-05-01 00:00:00

  • Molecular modeling of potential anticancer agents from African medicinal plants.

    abstract::Naturally occurring anticancer compounds represent about half of the chemotherapeutic drugs which have been put in the market against cancer until date. Computer-based or in silico virtual screening methods are often used in lead/hit discovery protocols. In this study, the "drug-likeness" of ~400 compounds from Africa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5003697

    authors: Ntie-Kang F,Nwodo JN,Ibezim A,Simoben CV,Karaman B,Ngwa VF,Sippl W,Adikwu MU,Mbaze LM

    更新日期:2014-09-22 00:00:00

  • Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discove

    abstract::All molecules of up to 11 atoms of C, N, O, and F possible under consideration of simple valency, chemical stability, and synthetic feasibility rules were generated and collected in a database (GDB). GDB contains 26.4 million molecules (110.9 million stereoisomers), including three- and four-membered rings and triple ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600423u

    authors: Fink T,Reymond JL

    更新日期:2007-03-01 00:00:00

  • Molecular Dynamics Simulations of Substrate Release from Trypanosoma cruzi UDP-Galactopyranose Mutase.

    abstract::The enzyme UDP-galactopyranose mutase (UGM) represents a promising drug target for the treatment of infections with Trypanosoma cruzi. We have computed the Potential of Mean Force for the release of UDP-galactopyranose from UGM, using Umbrella Sampling simulations. The simulations revealed the conformational changes t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00675

    authors: Cossio-Pérez R,Pierdominici-Sottile G,Sobrado P,Palma J

    更新日期:2019-02-25 00:00:00

  • Identification of Potential Small Molecule Binding Pockets in p38α MAP Kinase.

    abstract::Given the essential role played by protein kinases in regulating cellular pathways, their dysregulation can result in the onset and/or progression of various human diseases. Structural analysis of diverse protein kinases suggests that these proteins exhibit a remarkable plasticity that allows them to adopt distinct co...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00439

    authors: Gomez-Gutierrez P,Rubio-Martinez J,Perez JJ

    更新日期:2017-10-23 00:00:00

  • GPCR-Bench: A Benchmarking Set and Practitioners' Guide for G Protein-Coupled Receptor Docking.

    abstract::Virtual screening is routinely used to discover new ligands and in particular new ligand chemotypes for G protein-coupled receptors (GPCRs). To prepare for a virtual screen, we often tailor a docking protocol that will enable us to select the best candidates for further screening. To aid this, we created GPCR-Bench, a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00660

    authors: Weiss DR,Bortolato A,Tehan B,Mason JS

    更新日期:2016-04-25 00:00:00

  • ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes.

    abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002668

    authors: Avram SI,Pacureanu LM,Bora A,Crisan L,Avram S,Kurunczi L

    更新日期:2014-08-25 00:00:00

  • Ligand- and structure-based virtual screening for clathrodin-derived human voltage-gated sodium channel modulators.

    abstract::Voltage-gated sodium channels (VGSC) are attractive targets for drug discovery because of the broad therapeutic potential of their modulators. On the basis of the structure of marine alkaloid clathrodin, we have recently discovered novel subtype-selective VGSC modulators I and II that were used as starting points for ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400505e

    authors: Tomašić T,Hartzoulakis B,Zidar N,Chan F,Kirby RW,Madge DJ,Peigneur S,Tytgat J,Kikelj D

    更新日期:2013-12-23 00:00:00

  • Isomerization and Decomposition of 2-Methylfuran with External Forces.

    abstract::The primary goal of this project was to evaluate the performance of the Standard and Enforced Geometry Optimization (SEGO) method which we have recently developed. The SEGO method has been designed for an automatic location of multiple minima on the molecular Potential Energy Surface (PES), and its usefulness has been...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00352

    authors: Brzyska A,Woliński K

    更新日期:2019-08-26 00:00:00

  • Molecular Simulation of αvβ6 Integrin Inhibitors.

    abstract::The urgent need for new treatments for the chronic lung disease idiopathic pulmonary fibrosis (IPF) motivates research into antagonists of the RGD binding integrin αvβ6, a protein linked to the initiation and progression of the disease. Molecular dynamics (MD) simulations of αvβ6 in complex with its natural ligand, pr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00254

    authors: Guest EE,Oatley SA,Macdonald SJF,Hirst JD

    更新日期:2020-11-23 00:00:00

  • An Efficient Lossless Compression Algorithm for Trajectories of Atom Positions and Volumetric Data.

    abstract::We present our newly developed and highly efficient lossless compression algorithm for trajectories of atom positions and volumetric data. The algorithm is designed as a two-step approach. In the first step, efficient polynomial extrapolation schemes reduce the information entropy of the data by exploiting both spatia...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00501

    authors: Brehm M,Thomas M

    更新日期:2018-10-22 00:00:00

  • Reaction site mapping of xenobiotic biotransformations.

    abstract::Predictive metabolism methods can be used in drug discovery projects to enhance the understanding of structure-metabolism relationships. The present study uses data mining methods to exploit biotransformation data that have been recorded in the MDL Metabolite database. Reacting center fingerprints were derived from a ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600376q

    authors: Boyer S,Arnby CH,Carlsson L,Smith J,Stein V,Glen RC

    更新日期:2007-03-01 00:00:00

  • GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors.

    abstract::The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00148

    authors: Won J,Lee GR,Park H,Seok C

    更新日期:2018-06-25 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • RosENet: Improving Binding Affinity Prediction by Leveraging Molecular Mechanics Energies with an Ensemble of 3D Convolutional Neural Networks.

    abstract::The worldwide increase and proliferation of drug resistant microbes, coupled with the lag in new drug development, represents a major threat to human health. In order to reduce the time and cost for exploring the chemical search space, drug discovery increasingly relies on computational biology approaches. One key ste...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00075

    authors: Hassan-Harrirou H,Zhang C,Lemmin T

    更新日期:2020-06-22 00:00:00

  • Polarizable Force Field for Molecular Ions Based on the Classical Drude Oscillator.

    abstract::Development of accurate force field parameters for molecular ions in the context of a polarizable energy function based on the classical Drude oscillator is a crucial step toward an accurate polarizable model for modeling and simulations of biological macromolecules. Toward this goal we have undertaken a hierarchical ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00132

    authors: Lin FY,Lopes PEM,Harder E,Roux B,MacKerell AD Jr

    更新日期:2018-05-29 00:00:00