Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation.

Abstract:

:A new variable selection wrapper method named the Monte Carlo variable selection (MCVS) method was developed utilizing the framework of the Monte Carlo cross-validation (MCCV) approach. The MCVS method reports the variable selection results in the most conventional and common measure of statistical hypothesis testing, the P-values, thus allowing for a clear and simple statistical interpretation of the results. The MCVS method is equally applicable to the multiple-linear-regression (MLR)-based or non-MLR-based quantitative structure-activity relationship (QSAR) models. The method was applied to blood-brain barrier (BBB) permeation and human intestinal absorption (HIA) QSAR problems using MLR to demonstrate the workings of the new approach. Starting from more than 1600 molecular descriptors, only two (TPSA(NO) and ALOGP) yielded acceptably low P-values for the BBB and HIA problems, respectively. The new method has been implemented in the QSAR-BENCH v2 program, which is freely available (including its Java source code) from www.dmitrykonovalov.org for academic use.

journal_name

J Chem Inf Model

authors

Konovalov DA,Sim N,Deconinck E,Vander Heyden Y,Coomans D

doi

10.1021/ci700283s

subject

Has Abstract

pub_date

2008-02-01 00:00:00

pages

370-83

issue

2

eissn

1549-9596

issn

1549-960X

journal_volume

48

pub_type

杂志文章
  • Factors affecting d-block metal-ligand bond lengths: toward an automated library of molecular geometry for metal complexes.

    abstract::Metal-ligand (M-L) bond lengths for a range of ligands (carboxylates, chlorides, pyridines, water, tertiary phosphines, and alkenes) and a variety of metals have been retrieved from the Cambridge Structural Database, CSD. Analysis of the factors which affect M-L bond lengths (for example, ligand coordination mode, oxi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0500785

    authors: Harris SE,Orpen AG,Bruno IJ,Taylor R

    更新日期:2005-11-01 00:00:00

  • Toward high throughput 3D virtual screening using spherical harmonic surface representations.

    abstract::Searching chemical databases for possible drug leads is often one of the main activities conducted during the early stages of a drug development project. This article shows that spherical harmonic molecular shape representations provide a powerful way to search and cluster small-molecule databases rapidly and accurate...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7001507

    authors: Mavridis L,Hudson BD,Ritchie DW

    更新日期:2007-09-01 00:00:00

  • Bridging molecular docking to membrane molecular dynamics to investigate GPCR-ligand recognition: the human A₂A adenosine receptor as a key study.

    abstract::G protein-coupled receptors (GPCRs) represent the largest family of cell-surface receptors and about one-third of the actual targets of clinically used drugs. Following the progress made in the field of GPCRs structural determination, docking-based screening for novel potent and selective ligands is becoming an increa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400532b

    authors: Sabbadin D,Ciancetta A,Moro S

    更新日期:2014-01-27 00:00:00

  • In Silico Study of Membrane Lipid Composition Regulating Conformation and Hydration of Influenza Virus B M2 Channel.

    abstract::The proton conduction of transmembrane influenza virus B M2 (BM2) proton channel is possibly mediated by the membrane environment, but the detailed molecular mechanism is challenging to determine. In this work, how membrane lipid composition regulates the conformation and hydration of BM2 channel is elucidated in sili...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00329

    authors: Zhang Y,Zhang HX,Zheng QC

    更新日期:2020-07-27 00:00:00

  • Efficiency of Stratification for Ensemble Docking Using Reduced Ensembles.

    abstract::Molecular docking can account for receptor flexibility by combining the docking score over multiple rigid receptor conformations, such as snapshots from a molecular dynamics simulation. Here, we evaluate a number of common snapshot selection strategies using a quality metric from stratified sampling, the efficiency of...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00314

    authors: Xie B,Clark JD,Minh DDL

    更新日期:2018-09-24 00:00:00

  • The valence state combination model: a generic framework for handling tautomers and protonation states.

    abstract::The consistent handling of molecules is probably the most basic and important requirement in the field of cheminformatics. Reliable results can only be obtained if the underlying calculations are independent of the specific way molecules are represented in the input data. However, ensuring consistency is a complex tas...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400724v

    authors: Urbaczek S,Kolodzik A,Rarey M

    更新日期:2014-03-24 00:00:00

  • BiKi Life Sciences: A New Suite for Molecular Dynamics and Related Methods in Drug Discovery.

    abstract::In this paper, we introduce the BiKi Life Sciences suite. This software makes it easy for computational medicinal chemists to run ad hoc molecular dynamics protocols in a novel and task-oriented environment; as a notebook, BiKi (acronym of Binding Kinetics) keeps memory of any activity together with dependencies among...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00680

    authors: Decherchi S,Bottegoni G,Spitaleri A,Rocchia W,Cavalli A

    更新日期:2018-02-26 00:00:00

  • Facile Solutions to the Problems Associated with Chemical Information and Mathematical Symbolism While Using Machine Translation Tools.

    abstract::Advances in computer-aided translation technology have made tremendous progress in accuracy in the past few years. Chemical Abstracts Service of the American Chemical Society summarizes scientific works from more than 50 languages and allows the users to search papers in nine selected languages. Currently, only the ab...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00274

    authors: Wahab MF,Zulfiqar S,Sarwar MI,Lieberwirth I

    更新日期:2020-07-27 00:00:00

  • Mesoscopic simulation of phospholipid membranes, peptides, and proteins with molecular fragment dynamics.

    abstract::Molecular fragment dynamics (MFD) is a variant of dissipative particle dynamics (DPD), a coarse-grained mesoscopic simulation technique for isothermal complex fuids and soft matter systems with particles that are chosen to be adequate fluid elements. MFD choses its particles to be small molecules which may be connecte...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5006096

    authors: Truszkowski A,van den Broek K,Kuhn H,Zielesny A,Epple M

    更新日期:2015-05-26 00:00:00

  • Novel inhibitors of human histone deacetylase (HDAC) identified by QSAR modeling of known inhibitors, virtual screening, and experimental validation.

    abstract::Inhibitors of histone deacetylases (HDACIs) have emerged as a new class of drugs for the treatment of human cancers and other diseases because of their effects on cell growth, differentiation, and apoptosis. In this study we have developed several quantitative structure-activity relationship (QSAR) models for 59 chemi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800366f

    authors: Tang H,Wang XS,Huang XP,Roth BL,Butler KV,Kozikowski AP,Jung M,Tropsha A

    更新日期:2009-02-01 00:00:00

  • Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach.

    abstract::Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00249

    authors: Schneider N,Fechner N,Landrum GA,Stiefl N

    更新日期:2017-08-28 00:00:00

  • Molecular Structure Extraction from Documents Using Deep Learning.

    abstract::Chemical structure extraction from documents remains a hard problem because of both false positive identification of structures during segmentation and errors in the predicted structures. Current approaches rely on handcrafted rules and subroutines that perform reasonably well generally but still routinely encounter s...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00669

    authors: Staker J,Marshall K,Abel R,McQuaw CM

    更新日期:2019-03-25 00:00:00

  • Ab Initio Investigation of CO2 Adsorption on 13-Atom 4d Clusters.

    abstract::In this work, we report an ab initio investigation based on density functional theory calculations within van der Waals D3 corrections to investigate the adsorption properties and activation of CO2 on transition-metal (TM) 13-atom clusters (TM = Ru, Rh, Pd, Ag), which is a key step for the development of subnano catal...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00792

    authors: Batista KEA,Ocampo-Restrepo VK,Soares MD,Quiles MG,Piotrowski MJ,Da Silva JLF

    更新日期:2020-02-24 00:00:00

  • Multiple e-pharmacophore modeling, 3D-QSAR, and high-throughput virtual screening of hepatitis C virus NS5B polymerase inhibitors.

    abstract::The hepatitis C virus (HCV) NS5B RNA-dependent RNA polymerase (RdRP) is a crucial and unique component of the HCV RNA replication machinery and a validated target for drug discovery. Multiple crystal structures of NS5B inhibitor complexes have facilitated the identification of novel compound scaffolds through in silic...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400644r

    authors: Therese PJ,Manvar D,Kondepudi S,Battu MB,Sriram D,Basu A,Yogeeswari P,Kaushik-Basu N

    更新日期:2014-02-24 00:00:00

  • Exploring inhibitor release pathways in histone deacetylases using random acceleration molecular dynamics simulations.

    abstract::Molecular channel exploration perseveres to be the prominent solution for eliciting structure and accessibility of active site and other internal spaces of macromolecules. The volume and silhouette characterization of these channels provides answers for the issues of substrate access and ligand swapping between the ob...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200584f

    authors: Kalyaanamoorthy S,Chen YP

    更新日期:2012-02-27 00:00:00

  • Effect of structural stress on the flexibility and adaptability of HIV-1 protease.

    abstract::Resistance remains a major issue with regards to HIV-1 protease, despite the availability of numerous HIV-1 protease inhibitors and copious amounts of structural and binding data. In an effort to improve our understanding of how HIV-1 protease is able to "outsmart" new drugs, we have investigated the flexibility of HI...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2000677

    authors: Oehme DP,Wilson DJ,Brownlee RT

    更新日期:2011-05-23 00:00:00

  • Cheminformatics Modeling of Adverse Drug Responses by Clinically Relevant Mutants of Human Androgen Receptor.

    abstract::The human androgen receptor (AR) is a ligand-activated transcription factor that plays a pivotal role in the development and progression of prostate cancer (PCa). Many forms of castration-resistant prostate cancer (CRPC) still rely on the AR for survival. Currently used antiandrogens face clinical limitations as drug ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00400

    authors: Paul N,Carabet LA,Lallous N,Yamazaki T,Gleave ME,Rennie PS,Cherkasov A

    更新日期:2016-12-27 00:00:00

  • Baseline Model for Predicting Protein-Ligand Unbinding Kinetics through Machine Learning.

    abstract::Derivation of structure-kinetics relationships can help rational design and development of new small-molecule drug candidates with desired residence times. Efforts are now being directed toward the development of efficient computational methods. Currently, there is a lack of solid, high-throughput binding kinetics pre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00450

    authors: Amangeldiuly N,Karlov D,Fedorov MV

    更新日期:2020-12-28 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • Statistical Analysis on the Performance of Molecular Mechanics Poisson-Boltzmann Surface Area versus Absolute Binding Free Energy Calculations: Bromodomains as a Case Study.

    abstract::Binding free energy calculations that make use of alchemical pathways are becoming increasingly feasible thanks to advances in hardware and algorithms. Although relative binding free energy (RBFE) calculations are starting to find widespread use, absolute binding free energy (ABFE) calculations are still being explore...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00347

    authors: Aldeghi M,Bodkin MJ,Knapp S,Biggin PC

    更新日期:2017-09-25 00:00:00

  • Flux (2): comparison of molecular mutation and crossover operators for ligand-based de novo design.

    abstract::We implemented a fragment-based de novo design algorithm for a population-based optimization of molecular structures. The concept is grounded on an evolution strategy with mutation and crossover operators for structure breeding. Molecular building blocks were obtained from the pseudo-retrosynthesis of a collection of ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci6005307

    authors: Fechner U,Schneider G

    更新日期:2007-03-01 00:00:00

  • Molecular Self-Assembly Strategy for Encapsulation of an Amphipathic α-Helical Antimicrobial Peptide into the Different Polymeric and Copolymeric Nanoparticles.

    abstract::Encapsulation of peptide and protein-based drugs in polymeric nanoparticles is one of the fundamental fields in controlled-release drug delivery systems. The molecular mechanisms of absorption of peptides to the polymeric nanoparticles are still unknown, and there is no precise molecular data on the encapsulation proc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00641

    authors: Jafari M,Doustdar F,Mehrnejad F

    更新日期:2019-01-28 00:00:00

  • First Multitarget Chemo-Bioinformatic Model To Enable the Discovery of Antibacterial Peptides against Multiple Gram-Positive Pathogens.

    abstract::Antimicrobial peptides (AMPs) have emerged as promising therapeutic alternatives to fight against the diverse infections caused by different pathogenic microorganisms. In this context, theoretical approaches in bioinformatics have paved the way toward the creation of several in silico models capable of predicting anti...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00630

    authors: Speck-Planche A,Kleandrova VV,Ruso JM,Cordeiro MN

    更新日期:2016-03-28 00:00:00

  • Improved Scaffold Hopping in Ligand-Based Virtual Screening Using Neural Representation Learning.

    abstract::Deep learning has demonstrated significant potential in advancing state of the art in many problem domains, especially those benefiting from automated feature extraction. Yet, the methodology has seen limited adoption in the field of ligand-based virtual screening (LBVS) as traditional approaches typically require lar...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00622

    authors: Stojanović L,Popović M,Tijanić N,Rakočević G,Kalinić M

    更新日期:2020-10-26 00:00:00

  • Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries.

    abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200078f

    authors: Li L,Wang B,Meroueh SO

    更新日期:2011-09-26 00:00:00

  • Phytochemical informatics of traditional Chinese medicine and therapeutic relevance.

    abstract::Distribution patterns of 8411 compounds from 240 Chinese herbs were analyzed in relation to the herbal categories of traditional Chinese medicine (TCM), using Random Forest (RF) and self-organizing maps (SOM). RF was used first to construct TCM profiles of individual compounds, which describe their affinities for 28 m...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700155t

    authors: Ehrman TM,Barlow DJ,Hylands PJ

    更新日期:2007-11-01 00:00:00

  • CoMFA, CoMSIA, and molecular hologram QSAR studies of novel neuronal nAChRs ligands-open ring analogues of 3-pyridyl ether.

    abstract::3-Pyridyl ethers are excellent nAChRs ligands, which show high subtype selectivity and binding affinity to alpha4beta2 nAChR. Although the quantitative structure-activity relationship (QSAR) of nAChRs ligands has been widely investigated using various classes of compounds, the open ring analogues of 3-pyridyl ethers h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0498113

    authors: Zhang H,Li H,Liu C

    更新日期:2005-03-01 00:00:00

  • Estimation of ligand efficacies of metabotropic glutamate receptors from conformational forces obtained from molecular dynamics simulations.

    abstract::Group 1 metabotropic glutamate receptors (mGluR) are G-protein coupled receptors with a large bilobate extracellular ligand binding region (LBR) that resembles a Venus fly trap. Closing of this LBR in the presence of a ligand is associated with the activation of the receptor. From conformational sampling of the LBR-li...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400160x

    authors: Lakkaraju SK,Xue F,Faden AI,MacKerell AD Jr

    更新日期:2013-06-24 00:00:00

  • L-arginine binding to human inducible nitric oxide synthase: an antisymmetric funnel route toward isoform-specific inhibitors?

    abstract::Nitric oxide (NO) is an important signaling molecule produced by a family of enzymes called nitric oxide synthases (NOS). Because NO is involved in various pathological conditions, the development of potent and isoform-selective NOS inhibitors is an important challenge. In the present study, the dimer of oxygenase dom...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100422v

    authors: Floquet N,Hernandez JF,Boucher JL,Martinez J

    更新日期:2011-06-27 00:00:00

  • ThermoData Engine (TDE): software implementation of the dynamic data evaluation concept. 9. Extensible thermodynamic constraints for pure compounds and new model developments.

    abstract::ThermoData Engine (TDE) is the first full-scale software implementation of the dynamic data evaluation concept, as reported in this journal. The present article describes the background and implementation for new additions in latest release of TDE. Advances are in the areas of program architecture and quality improvem...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4005699

    authors: Diky V,Chirico RD,Muzny CD,Kazakov AF,Kroenlein K,Magee JW,Abdulagatov I,Frenkel M

    更新日期:2013-12-23 00:00:00