Determining the validity of a QSAR model--a classification approach.

Abstract:

:The determination of the validity of a QSAR model when applied to new compounds is an important concern in the field of QSAR and QSPR modeling. Various scoring techniques can be applied to specific types of models. We present a technique with which we can state whether a new compound will be well predicted by a previously built QSAR model. In this study we focus on linear regression models only, though the technique is general and could also be applied to other types of quantitative models. Our technique is based on a classification method that divides regression residuals from a previously generated model into a good class and bad class and then builds a classifier based on this division. The trained classifier is then used to determine the class of the residual for a new compound. We investigated the performance of a variety of classifiers, both linear and nonlinear. The technique was tested on two data sets from the literature and a hand built data set. The data sets selected covered both physical and biological properties and also presented the methodology with quantitative regression models of varying quality. The results indicate that this technique can determine whether a new compound will be well or poorly predicted with weighted success rates ranging from 73% to 94% for the best classifier.

journal_name

J Chem Inf Model

authors

Guha R,Jurs PC

doi

10.1021/ci0497511

keywords:

subject

Has Abstract

pub_date

2005-01-01 00:00:00

pages

65-73

issue

1

eissn

1549-9596

issn

1549-960X

journal_volume

45

pub_type

杂志文章
  • Getting Docking into Shape Using Negative Image-Based Rescoring.

    abstract::The failure of default scoring functions to ensure virtual screening enrichment is a persistent problem for the molecular docking algorithms used in structure-based drug discovery. To remedy this problem, elaborate rescoring and postprocessing schemes have been developed with a varying degree of success, specificity, ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00383

    authors: Kurkinen ST,Lätti S,Pentikäinen OT,Postila PA

    更新日期:2019-08-26 00:00:00

  • Technique for energy decomposition in the study of "receptor-ligand" complexes.

    abstract::A new methodology to describe the interactions in "receptor-ligand" complexes is presented. The methodology is based on a combination of the 3D/4D QSAR BiS/MC and CoCon algorithms. The first algorithm performs the restricted docking of compounds to receptor pockets. The second determines the relationships between the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800405n

    authors: Potemkin VA,Pogrebnoy AA,Grishina MA

    更新日期:2009-06-01 00:00:00

  • Drug effect prediction by polypharmacology-based interaction profiling.

    abstract::Most drugs exert their effects via multitarget interactions, as hypothesized by polypharmacology. While these multitarget interactions are responsible for the clinical effect profiles of drugs, current methods have failed to uncover the complex relationships between them. Here, we introduce an approach which is able t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2002022

    authors: Simon Z,Peragovics A,Vigh-Smeller M,Csukly G,Tombor L,Yang Z,Zahoránszky-Kohalmi G,Végner L,Jelinek B,Hári P,Hetényi C,Bitter I,Czobor P,Málnási-Csizmadia A

    更新日期:2012-01-23 00:00:00

  • Factors affecting d-block metal-ligand bond lengths: toward an automated library of molecular geometry for metal complexes.

    abstract::Metal-ligand (M-L) bond lengths for a range of ligands (carboxylates, chlorides, pyridines, water, tertiary phosphines, and alkenes) and a variety of metals have been retrieved from the Cambridge Structural Database, CSD. Analysis of the factors which affect M-L bond lengths (for example, ligand coordination mode, oxi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0500785

    authors: Harris SE,Orpen AG,Bruno IJ,Taylor R

    更新日期:2005-11-01 00:00:00

  • Protein Preparation Automatic Protocol for High-Throughput Inverse Virtual Screening: Accelerating the Target Identification by Computational Methods.

    abstract::Structure-based virtual screening is highly used in the early stages of drug discovery to identify new putative lead compounds for a given target. However, when a small molecule elicits a biological effect, but its target is unknown, or the side effects it causes arise from its undesired interaction with unknown count...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00428

    authors: De Vita S,Lauro G,Ruggiero D,Terracciano S,Riccio R,Bifulco G

    更新日期:2019-11-25 00:00:00

  • Evaluation of Generalized Born Models for Large Scale Affinity Prediction of Cyclodextrin Host-Guest Complexes.

    abstract::Binding affinity prediction with implicit solvent models remains a challenge in virtual screening for drug discovery. In order to assess the predictive power of implicit solvent models in docking techniques with Amber scoring, three generalized Born models (GBHCT, GBOBCI, and GBOBCII) available in Dock 6.7 were utiliz...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00418

    authors: Zhang H,Yin C,Yan H,van der Spoel D

    更新日期:2016-10-24 00:00:00

  • Simulation of 2D NMR Spectra of Carbohydrates Using GODESS Software.

    abstract::Glycan Optimized Dual Empirical Spectrum Simulation (GODESS) is a web service, which has been recently shown to be one of the most accurate tools for simulation of (1)H and (13)C 1D NMR spectra of natural carbohydrates and their derivatives. The new version of GODESS supports visualization of the simulated (1)H and (1...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00083

    authors: Kapaev RR,Toukach PV

    更新日期:2016-06-27 00:00:00

  • Protein kinases: docking and homology modeling reliability.

    abstract::A database of about 700 high-resolution kinase structures was used to test the reliability of 17 docking procedures (using six docking software packages) by means of self- and cross-docking studies. The analysis of about 80 000 docking calculations suggests that the docking of an unknown ligand into a kinase has a pro...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100161z

    authors: Tuccinardi T,Botta M,Giordano A,Martinelli A

    更新日期:2010-08-23 00:00:00

  • Develop and test a solvent accessible surface area-based model in conformational entropy calculations.

    abstract::It is of great interest in modern drug design to accurately calculate the free energies of protein-ligand or nucleic acid-ligand binding. MM-PBSA (molecular mechanics Poisson-Boltzmann surface area) and MM-GBSA (molecular mechanics generalized Born surface area) have gained popularity in this field. For both methods, ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300064d

    authors: Wang J,Hou T

    更新日期:2012-05-25 00:00:00

  • Homology modeling and docking evaluation of aminergic G protein-coupled receptors.

    abstract::We report the development of homology models of dopamine (D(2), D(3), and D(4)), serotonin (5-HT(1B), 5-HT(2A), 5-HT(2B), and 5-HT(2C)), histamine (H(1)), and muscarinic (M(1)) receptors, based on the high-resolution structure of the beta(2)-adrenergic receptor. The homology models were built and refined using Prime. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900444q

    authors: McRobb FM,Capuano B,Crosby IT,Chalmers DK,Yuriev E

    更新日期:2010-04-26 00:00:00

  • Identifying promising compounds in drug discovery: genetic algorithms and some new statistical techniques.

    abstract::Throughout the drug discovery process, discovery teams are compelled to use statistics for making decisions using data from a variety of inputs. For instance, teams are asked to prioritize compounds for subsequent stages of the drug discovery process, given results from multiple screens. To assist in the prioritizatio...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600556v

    authors: Mandal A,Johnson K,Wu CF,Bornemeier D

    更新日期:2007-05-01 00:00:00

  • From Brazil to Germany: Challenges and Advantages.

    abstract::Moving to a new country, with a different culture and a new environment, is not an easy decision. In this perspective, I present some reasons that made me, a Brazilian computational biochemist, move abroad to do postdoctoral research and some of the challenges I faced before and after moving. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00764

    authors: Nunes-Alves A

    更新日期:2020-02-24 00:00:00

  • Elements of nucleotide specificity in the Trypanosoma brucei mitochondrial RNA editing enzyme RET2.

    abstract::The causative agent of African sleeping sickness, Trypanosoma brucei , undergoes an unusual mitochondrial RNA editing process that is essential for its survival. RNA editing terminal uridylyl transferase 2 of T. brucei (TbRET2) is an indispensable component of the editosome machinery that performs this editing. TbR...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3001327

    authors: Demir Ö,Amaro RE

    更新日期:2012-05-25 00:00:00

  • Prediction of cytochrome P450 xenobiotic metabolism: tethered docking and reactivity derived from ligand molecular orbital analysis.

    abstract::Metabolism of xenobiotic and endogenous compounds is frequently complex, not completely elucidated, and therefore often ambiguous. The prediction of sites of metabolism (SoM) can be particularly helpful as a first step toward the identification of metabolites, a process especially relevant to drug discovery. This pape...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400058s

    authors: Tyzack JD,Williamson MJ,Torella R,Glen RC

    更新日期:2013-06-24 00:00:00

  • Ligand-Based Discovery of a New Scaffold for Allosteric Modulation of the μ-Opioid Receptor.

    abstract::With the hope of discovering effective analgesics with fewer side effects, attention has recently shifted to allosteric modulators of the opioid receptors. In the past two years, the first chemotypes of positive or silent allosteric modulators (PAMs or SAMs, respectively) of μ- and δ-opioid receptor types have been re...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00388

    authors: Bisignano P,Burford NT,Shang Y,Marlow B,Livingston KE,Fenton AM,Rockwell K,Budenholzer L,Traynor JR,Gerritz SW,Alt A,Filizola M

    更新日期:2015-09-28 00:00:00

  • FragPELE: Dynamic Ligand Growing within a Binding Site. A Novel Tool for Hit-To-Lead Drug Design.

    abstract::The early stages of drug discovery rely on hit-to-lead programs, where initial hits undergo partial optimization to improve binding affinities for their biological target. This is an expensive and time-consuming process, requiring multiple iterations of trial and error designs, an ideal scenario for applying computer ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00938

    authors: Perez C,Soler D,Soliva R,Guallar V

    更新日期:2020-03-23 00:00:00

  • Database of Nuclear Independent Chemical Shifts (NICS) versus NICSZZ of Polycyclic Aromatic Hydrocarbons (PAHs).

    abstract::In the present contribution, we have developed a database, called the FAR-database, where the acronym FAR stands for Fused Aromatic Rings, which presents the results of nuclear independent chemical shifts calculations, NICS(0), NICS(1), NICS(0)ZZ, and NICS(1)ZZ, of 660 neutral benzenoid-PAHs and cyclopenta-fused PAHs....

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00909

    authors: Alvarez-Ramírez F,Ruiz-Morales Y

    更新日期:2020-02-24 00:00:00

  • Modeling oral rat chronic toxicity.

    abstract::The chronic toxicity is fundamental for toxicological risk assessment, but its correlation with the chemical structures has been studied only little. This is partly due to the complexity of such an experimental test that embraces a plethora of different biological effects and mechanisms of action, making (Q)SAR studie...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8001974

    authors: Mazzatorta P,Estevez MD,Coulet M,Schilter B

    更新日期:2008-10-01 00:00:00

  • Novel inhibitors of trihydroxynaphthalene reductase with antifungal activity identified by ligand-based and structure-based virtual screening.

    abstract::Curvularia lunata is a dark pigmented fungus that is the causative agent of several diseases in plants and in both immunodeficient and immunocompetent patients. 1,8-Dihydroxynaphthalene-melanin is found in the cell wall of C. lunata and is believed to be the important virulence factor of dematiaceous fungi. Trihydroxy...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2001499

    authors: Brunskole Svegelj M,Turk S,Brus B,Lanisnik Rizner T,Stojan J,Gobec S

    更新日期:2011-07-25 00:00:00

  • Improved Scaffold Hopping in Ligand-Based Virtual Screening Using Neural Representation Learning.

    abstract::Deep learning has demonstrated significant potential in advancing state of the art in many problem domains, especially those benefiting from automated feature extraction. Yet, the methodology has seen limited adoption in the field of ligand-based virtual screening (LBVS) as traditional approaches typically require lar...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00622

    authors: Stojanović L,Popović M,Tijanić N,Rakočević G,Kalinić M

    更新日期:2020-10-26 00:00:00

  • Equally Weighted Multiscale Elastic Network Model and Its Comparison with Traditional and Parameter-Free Models.

    abstract::Dynamical properties of proteins play an essential role in their function exertion. The elastic network model (ENM) is an effective and efficient tool in characterizing the intrinsic dynamical properties encoded in biomacromolecule structures. The Gaussian network model (GNM) and anisotropic network model (ANM) are th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01178

    authors: Gong W,Liu Y,Zhao Y,Wang S,Han Z,Li C

    更新日期:2021-01-26 00:00:00

  • RosENet: Improving Binding Affinity Prediction by Leveraging Molecular Mechanics Energies with an Ensemble of 3D Convolutional Neural Networks.

    abstract::The worldwide increase and proliferation of drug resistant microbes, coupled with the lag in new drug development, represents a major threat to human health. In order to reduce the time and cost for exploring the chemical search space, drug discovery increasingly relies on computational biology approaches. One key ste...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00075

    authors: Hassan-Harrirou H,Zhang C,Lemmin T

    更新日期:2020-06-22 00:00:00

  • Enrichment factor analyses on G-protein coupled receptors with known crystal structure.

    abstract::G-protein coupled receptors (GPCRs) are highly relevant drug targets. Four GPCRs with known crystal structure were analyzed with docking (AutoDock4) and postdocking (MM-PBSA) in order to evaluate the ability to recognize known antagonists from a larger database of molecular decoys and to predict correct binding modes....

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4000745

    authors: Anighoro A,Rastelli G

    更新日期:2013-04-22 00:00:00

  • Study of Data Set Modelability: Modelability, Rivality, and Weighted Modelability Indexes.

    abstract::The knowledge of the capacity of a data set to be modeled in the first stages of the building of quantitative structure-activity relationship (QSAR) prediction models is an important issue because it might reduce the effort and time necessary to select or reject data sets and in refining the data set's composition. Th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00188

    authors: Luque Ruiz I,Gómez-Nieto MÁ

    更新日期:2018-09-24 00:00:00

  • Comparative Dynamics and Functional Mechanisms of the CYP17A1 Tunnels Regulated by Ligand Binding.

    abstract::As an important member of cytochrome P450 (CYP) enzymes, CYP17A1 is a dual-function monooxygenase with a critical role in the synthesis of many human steroid hormones, making it an attractive therapeutic target. The emerging structural information about CYP17A1 and the growing number of inhibitors for these enzymes ca...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00447

    authors: Xiao F,Song X,Tian P,Gan M,Verkhivker GM,Hu G

    更新日期:2020-07-27 00:00:00

  • Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

    abstract::Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies tha...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00216

    authors: Moriya Y,Yamada T,Okuda S,Nakagawa Z,Kotera M,Tokimatsu T,Kanehisa M,Goto S

    更新日期:2016-03-28 00:00:00

  • RED: a set of molecular descriptors based on Renyi entropy.

    abstract::New molecular descriptors, RED (Renyi entropy descriptors), based on the generalized entropies introduced by Renyi are presented. Topological descriptors based on molecular features have proven to be useful for describing molecular profiles. Renyi entropy is used as a variability measure to contract a feature-pair dis...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900275w

    authors: Delgado-Soler L,Toral R,Tomás MS,Rubio-Martinez J

    更新日期:2009-11-01 00:00:00

  • Performance evaluation of 2D fingerprint and 3D shape similarity methods in virtual screening.

    abstract::Virtual screening (VS) can be accomplished in either ligand- or structure-based methods. In recent times, an increasing number of 2D fingerprint and 3D shape similarity methods have been used in ligand-based VS. To evaluate the performance of these ligand-based methods, retrospective VS was performed on a tailored dir...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300030u

    authors: Hu G,Kuang G,Xiao W,Li W,Liu G,Tang Y

    更新日期:2012-05-25 00:00:00

  • Improved CoMFA modeling by optimization of settings.

    abstract::The possibility of improving the predictive ability of comparative molecular field analysis (CoMFA) by settings optimization has been evaluated to show that CoMFA predictive ability can be improved. Ten different CoMFA settings are evaluated, producing a total of 6120 models. This method has been applied to nine diffe...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci049612j

    authors: Peterson SD,Schaal W,Karlén A

    更新日期:2006-01-01 00:00:00

  • Template CoMFA: the 3D-QSAR Grail?

    abstract::Template CoMFA, a novel alignment methodology for training or test set structures in 3D-QSAR, is introduced. Its two most significant advantages are its complete automation and its ability to derive a single combined model from multiple structural series affecting a biological target. Its only two inputs are one or mo...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400696v

    authors: Cramer RD,Wendt B

    更新日期:2014-02-24 00:00:00