Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation.

Abstract:

:A new variable selection wrapper method named the Monte Carlo variable selection (MCVS) method was developed utilizing the framework of the Monte Carlo cross-validation (MCCV) approach. The MCVS method reports the variable selection results in the most conventional and common measure of statistical hypothesis testing, the P-values, thus allowing for a clear and simple statistical interpretation of the results. The MCVS method is equally applicable to the multiple-linear-regression (MLR)-based or non-MLR-based quantitative structure-activity relationship (QSAR) models. The method was applied to blood-brain barrier (BBB) permeation and human intestinal absorption (HIA) QSAR problems using MLR to demonstrate the workings of the new approach. Starting from more than 1600 molecular descriptors, only two (TPSA(NO) and ALOGP) yielded acceptably low P-values for the BBB and HIA problems, respectively. The new method has been implemented in the QSAR-BENCH v2 program, which is freely available (including its Java source code) from www.dmitrykonovalov.org for academic use.

journal_name

J Chem Inf Model

authors

Konovalov DA,Sim N,Deconinck E,Vander Heyden Y,Coomans D

doi

10.1021/ci700283s

subject

Has Abstract

pub_date

2008-02-01 00:00:00

pages

370-83

issue

2

eissn

1549-9596

issn

1549-960X

journal_volume

48

pub_type

杂志文章
  • Development of an informatics platform for therapeutic protein and peptide analytics.

    abstract::The momentum gained by research on biologics has not been met yet with equal thrust on the informatics side. There is a noticeable lack of software for data management that empowers the bench scientists working on the development of biologic therapeutics. SARvision|Biologics is a tool to analyze data associated with b...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400333x

    authors: Hansen MR,Villar HO,Feyfant E

    更新日期:2013-10-28 00:00:00

  • Benchmarking of Semiempirical Quantum-Mechanical Methods on Systems Relevant to Computer-Aided Drug Design.

    abstract::The semiempirical quantum mechanical (SQM) methods used in drug design are commonly parametrized and tested on data sets of systems that may not be representative models for drug-biomolecule interactions in terms of both size and chemical composition. This is addressed here with a new benchmark data set, PLF547, deriv...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01171

    authors: Kříž K,Řezáč J

    更新日期:2020-03-23 00:00:00

  • Interpretation of the binding affinities of PTP1B inhibitors with the MM-GB/SA method and the X-score scoring function.

    abstract::We have studied the binding affinities of a set of 45 small-molecule inhibitors to protein tyrosine phosphatase 1B (PTP1B) through computational approaches. All of these compounds share a common oxalylamino benzoic acid (OBA) moiety. The complex structure of each compound was modeled by using the GOLD program plus the...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8004429

    authors: Zhang X,Li X,Wang R

    更新日期:2009-04-01 00:00:00

  • Ligand-based virtual screening approach using a new scoring function.

    abstract::In this study, we aimed to develop a new ligand-based virtual screening approach using an effective shape-overlapping procedure and a more robust scoring function (denoted by the HWZ score for convenience). The HWZ score-based virtual screening approach was tested against the compounds for 40 protein targets available...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200617d

    authors: Hamza A,Wei NN,Zhan CG

    更新日期:2012-04-23 00:00:00

  • Automated pharmacophore query optimization with genetic algorithms - a case study using the MC4R system.

    abstract::Due to the recent availability of high quality small molecule databases, such as ZINC and PubChem,1,2 virtual screening is playing an even more important role in identifying biologically relevant molecules in drug discovery campaigns. The success of pharmacophore-based virtual screening (PBVS) relies largely on the ac...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700089w

    authors: Jia L,Zou J,So SS,Sun H

    更新日期:2007-07-01 00:00:00

  • Retrospect and Prospect of Single Particle Cryo-Electron Microscopy: The Class of Integral Membrane Proteins as an Example.

    abstract::A giant technological leap in the field of cryo-electron microscopy (cryo-EM) has assured the achievement of near-atomic resolution structures of biological macromolecules. As a recognition of this accomplishment, the Nobel Prize in Chemistry was awarded in 2017 to Jacques Dubochet, Joachim Frank, and Richard Henderso...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01015

    authors: Akbar S,Mozumder S,Sengupta J

    更新日期:2020-05-26 00:00:00

  • Structure-Based Rational Design of Novel Inhibitors Against Fructose-1,6-Bisphosphate Aldolase from Candida albicans.

    abstract::Class II fructose-1,6-bisphosphate aldolases (FBA-II) are attractive new targets for the discovery of drugs to combat invasive fungal infection, because they are absent in animals and higher plants. Although several FBA-II inhibitors have been reported, none of these inhibitors exhibit antifungal effect so far. In thi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00763

    authors: Han X,Zhu X,Hong Z,Wei L,Ren Y,Wan F,Zhu S,Peng H,Guo L,Rao L,Feng L,Wan J

    更新日期:2017-06-26 00:00:00

  • Predicted Biological Activity of Purchasable Chemical Space.

    abstract::Whereas 400 million distinct compounds are now purchasable within the span of a few weeks, the biological activities of most are unknown. To facilitate access to new chemistry for biology, we have combined the Similarity Ensemble Approach (SEA) with the maximum Tanimoto similarity to the nearest bioactive to predict a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00316

    authors: Irwin JJ,Gaskins G,Sterling T,Mysinger MM,Keiser MJ

    更新日期:2018-01-22 00:00:00

  • Determining the validity of a QSAR model--a classification approach.

    abstract::The determination of the validity of a QSAR model when applied to new compounds is an important concern in the field of QSAR and QSPR modeling. Various scoring techniques can be applied to specific types of models. We present a technique with which we can state whether a new compound will be well predicted by a previo...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0497511

    authors: Guha R,Jurs PC

    更新日期:2005-01-01 00:00:00

  • Improved CoMFA modeling by optimization of settings.

    abstract::The possibility of improving the predictive ability of comparative molecular field analysis (CoMFA) by settings optimization has been evaluated to show that CoMFA predictive ability can be improved. Ten different CoMFA settings are evaluated, producing a total of 6120 models. This method has been applied to nine diffe...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci049612j

    authors: Peterson SD,Schaal W,Karlén A

    更新日期:2006-01-01 00:00:00

  • Baseline Model for Predicting Protein-Ligand Unbinding Kinetics through Machine Learning.

    abstract::Derivation of structure-kinetics relationships can help rational design and development of new small-molecule drug candidates with desired residence times. Efforts are now being directed toward the development of efficient computational methods. Currently, there is a lack of solid, high-throughput binding kinetics pre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00450

    authors: Amangeldiuly N,Karlov D,Fedorov MV

    更新日期:2020-12-28 00:00:00

  • Training a scoring function for the alignment of small molecules.

    abstract::A comprehensive data set of aligned ligands with highly similar binding pockets from the Protein Data Bank has been built. Based on this data set, a scoring function for recognizing good alignment poses for small molecules has been developed. This function is based on atoms and hydrogen-bond projected features. The co...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100227h

    authors: Chan SL,Labute P

    更新日期:2010-09-27 00:00:00

  • ReFlex3D: Refined Flexible Alignment of Molecules Using Shape and Electrostatics.

    abstract::We present an algorithm, ReFlex3D, for the refinement of flexible molecular alignments based on their three-dimensional shape and electrostatic properties. The algorithm is designed to be used with fast conformer generators to refine an initial overlay between two molecules and thus to obtain improved overlaps as judg...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00618

    authors: Schmidt TC,Cosgrove DA,Boström J

    更新日期:2018-04-23 00:00:00

  • Benchmark data set for in silico prediction of Ames mutagenicity.

    abstract::Up to now, publicly available data sets to build and evaluate Ames mutagenicity prediction tools have been very limited in terms of size and chemical space covered. In this report we describe a new unique public Ames mutagenicity data set comprising about 6500 nonconfidential compounds (available as SMILES strings and...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900161g

    authors: Hansen K,Mika S,Schroeter T,Sutter A,ter Laak A,Steger-Hartmann T,Heinrich N,Müller KR

    更新日期:2009-09-01 00:00:00

  • Scaling predictive modeling in drug development with cloud computing.

    abstract::Growing data sets with increased time for analysis is hampering predictive modeling in drug discovery. Model building can be carried out on high-performance computer clusters, but these can be expensive to purchase and maintain. We have evaluated ligand-based modeling on cloud computing resources where computations ar...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500580y

    authors: Moghadam BT,Alvarsson J,Holm M,Eklund M,Carlsson L,Spjuth O

    更新日期:2015-01-26 00:00:00

  • Direct Observation of β-Barrel Intermediates in the Self-Assembly of Toxic SOD128-38 and Absence in Nontoxic Glycine Mutants.

    abstract::Soluble low-molecular-weight oligomers formed during the early stage of amyloid aggregation are considered the major toxic species in amyloidosis. The structure-function relationship between oligomeric assemblies and the cytotoxicity in amyloid diseases are still elusive due to the heterogeneous and transient nature o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01319

    authors: Sun Y,Huang J,Duan X,Ding F

    更新日期:2021-01-14 00:00:00

  • Coordination of Na(+) by monoamine ligands in dopamine, norepinephrine, and serotonin transporters.

    abstract::The reuptake of neurotransmitters by dopamine, norepinephrine, and serotonin transporters during neuronal transmission requires a sodium gradient. An "ionic mode" of binding proposes that aspartate anchors the ligand's positive charge but ignores the direct role of sodium in ligand binding seen in the only representat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700255d

    authors: Xhaard H,Backström V,Denessiouk K,Johnson MS

    更新日期:2008-07-01 00:00:00

  • Full and partial agonism of ionotropic glutamate receptors indicated by molecular dynamics simulations.

    abstract::Ionotropic glutamate receptors (iGluRs) are synaptic proteins that facilitate signal transmission in the central nervous system. Extracellular iGluR cleft closure is linked to receptor activation; however, the mechanism underlying partial agonism is not entirely understood. Full agonists close the bilobed ligand-bindi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2000055

    authors: Postila PA,Ylilauri M,Pentikäinen OT

    更新日期:2011-05-23 00:00:00

  • Computational and conformational evaluation of FTase alternative substrates: insight into a novel enzyme binding pocket.

    abstract::Protein farnesyltransferase (FTase) is an important anticancer drug target. In an effort to develop isoprenoid diphosphate-based FTase inhibitors, striking variations have been observed in the ability of conservatively modified analogues to bind to the enzyme. For example, 2Z-GGPP is an alternative substrate with high...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0496550

    authors: Henriksen BS,Zahn TJ,Evanseck JD,Firestine SM,Gibbs RA

    更新日期:2005-07-01 00:00:00

  • Use of 3D QSAR models for database screening: a feasibility study.

    abstract::The applicability and scope of 3D QSAR methods (CoMFA, CoMSIA) to screen databases are examined. A protocol requiring minimal user intervention has been established to align training and test set molecules using FlexS. As model system isozymes of human carbonic anhydrase (hCA) are used, all results are exemplified stu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7002945

    authors: Hillebrecht A,Klebe G

    更新日期:2008-02-01 00:00:00

  • Heteroaromatic π-stacking energy landscapes.

    abstract::In this study we investigate π-stacking interactions of a variety of aromatic heterocycles with benzene using dispersion corrected density functional theory. We calculate extensive potential energy surfaces for parallel-displaced interaction geometries. We find that dispersion contributes significantly to the interact...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500183u

    authors: Huber RG,Margreiter MA,Fuchs JE,von Grafenstein S,Tautermann CS,Liedl KR,Fox T

    更新日期:2014-05-27 00:00:00

  • Prediction of synthetic accessibility based on commercially available compound databases.

    abstract::A compound's synthetic accessibility (SA) is an important aspect of drug design, since in some cases computer-designed compounds cannot be synthesized. There have been several reports on SA prediction, most of which have focused on the difficulties of synthetic reactions based on retro-synthesis analyses, reaction dat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500568d

    authors: Fukunishi Y,Kurosawa T,Mikami Y,Nakamura H

    更新日期:2014-12-22 00:00:00

  • Systematic analysis of enzyme-catalyzed reaction patterns and prediction of microbial biodegradation pathways.

    abstract::The roles of chemical compounds in biological systems are now systematically analyzed by high-throughput experimental technologies. To automate the processing and interpretation of large-scale data it is necessary to develop bioinformatics methods to extract information from the chemical structures of these small mole...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700006f

    authors: Oh M,Yamada T,Hattori M,Goto S,Kanehisa M

    更新日期:2007-07-01 00:00:00

  • Algorithm for reaction classification.

    abstract::Reaction classification has important applications, and many approaches to classification have been applied. Our own algorithm tests all maximum common substructures (MCS) between all reactant and product molecules in order to find an atom mapping containing the minimum chemical distance (MCD). Recent publications hav...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400442f

    authors: Kraut H,Eiblmaier J,Grethe G,Löw P,Matuszczyk H,Saller H

    更新日期:2013-11-25 00:00:00

  • Assessment of the Cruzain Cysteine Protease Reversible and Irreversible Covalent Inhibition Mechanism.

    abstract::Reversible and irreversible covalent ligands are advanced cysteine protease inhibitors in the drug development pipeline. K777 is an irreversible inhibitor of cruzain, a necessary enzyme for the survival of the Trypanosoma cruzi (T. cruzi) parasite, the causative agent of Chagas disease. Despite their importance, irrev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01138

    authors: Silva JRA,Cianni L,Araujo D,Batista PHJ,de Vita D,Rosini F,Leitão A,Lameira J,Montanari CA

    更新日期:2020-03-23 00:00:00

  • Customizable Generation of Synthetically Accessible, Local Chemical Subspaces.

    abstract::Screening large libraries of chemicals has been an efficient strategy to discover bioactive compounds; however a portion of the potential for success is limited to the available libraries. Synergizing combinatorial and computational chemistries has emerged as a time-efficient strategy to explore the chemical space mor...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00648

    authors: Pottel J,Moitessier N

    更新日期:2017-03-27 00:00:00

  • Allosteric Modulation of Human Hsp90α Conformational Dynamics.

    abstract::Central to Hsp90's biological function is its ability to interconvert between various conformational states. Drug targeting of Hsp90's regulatory mechanisms, including its modulation by cochaperone association, presents as an attractive therapeutic strategy for Hsp90 associated pathologies. In this study, we utilized ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00630

    authors: Penkler DL,Atilgan C,Tastan Bishop Ö

    更新日期:2018-02-26 00:00:00

  • Scores of extended connectivity fingerprint as descriptors in QSPR study of melting point and aqueous solubility.

    abstract::QSPR studies, using scores of SciTegic's extended connectivity fingerprint as raw descriptors, were extended to the prediction of melting points and aqueous solubility of organic compounds. Robust partial least-squares models were developed that perform as well as the best published QSPR models for structurally divers...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800024c

    authors: Zhou D,Alelyunas Y,Liu R

    更新日期:2008-05-01 00:00:00

  • Development of a computational tool to rival experts in the prediction of sites of metabolism of xenobiotics by p450s.

    abstract::The metabolism of xenobiotics--and more specifically drugs--in the liver is a critical process controlling their half-life. Although there exist experimental methods, which measure the metabolic stability of xenobiotics and identify their metabolites, developing higher throughput predictive methods is an avenue of res...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3003073

    authors: Campagna-Slater V,Pottel J,Therrien E,Cantin LD,Moitessier N

    更新日期:2012-09-24 00:00:00

  • LigQ: A Webserver to Select and Prepare Ligands for Virtual Screening.

    abstract::Virtual screening is a powerful methodology to search for new small molecule inhibitors against a desired molecular target. Usually, it involves evaluating thousands of compounds (derived from large databases) in order to select a set of potential binders that will be tested in the wet-lab. The number of tested compou...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00241

    authors: Radusky L,Ruiz-Carmona S,Modenutti C,Barril X,Turjanski AG,Martí MA

    更新日期:2017-08-28 00:00:00