Improving classical substructure-based virtual screening to handle extrapolation challenges.

Abstract:

:Target-oriented substructure-based virtual screening (sSBVS) of molecules is a promising approach in drug discovery. Yet, there are doubts whether sSBVS is suitable also for extrapolation, that is, for detecting molecules that are very different from those used for training. Herein, we evaluate the predictive power of classic virtual screening methods, namely, similarity searching using Tanimoto coefficient (MTC) and Naive Bayes (NB). As could be expected, these classic methods perform better in interpolation than in extrapolation tasks. Consequently, to enhance the predictive ability for extrapolation tasks, we introduce the Shadow approach, in which inclusion relations between substructures are considered, as opposed to the classic sSBVS methods that assume independence between substructures. Specifically, we discard contributions from substructures included in ("shaded" by) others which are, in turn, included in the molecule of interest. Indeed, the Shadow classifier significantly outperforms both MTC (pValue = 3.1 × 10(-16)) and NB (pValue = 3.5 × 10(-9)) in detecting hits sharing low similarity with the training active molecules.

journal_name

J Chem Inf Model

authors

Biniashvili T,Schreiber E,Kliger Y

doi

10.1021/ci200472s

subject

Has Abstract

pub_date

2012-03-26 00:00:00

pages

678-85

issue

3

eissn

1549-9596

issn

1549-960X

journal_volume

52

pub_type

杂志文章
  • Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

    abstract::Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies tha...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00216

    authors: Moriya Y,Yamada T,Okuda S,Nakagawa Z,Kotera M,Tokimatsu T,Kanehisa M,Goto S

    更新日期:2016-03-28 00:00:00

  • AntiBac-Pred: A Web Application for Predicting Antibacterial Activity of Chemical Compounds.

    abstract::Discovery of new antibacterial agents is a never-ending task of medicinal chemistry. Every new drug brings significant improvement to patients with bacterial infections, but prolonged usage of antibacterials leads to the emergence of resistant strains. Therefore, novel active structures with new modes of action are re...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00436

    authors: Pogodin PV,Lagunin AA,Rudik AV,Druzhilovskiy DS,Filimonov DA,Poroikov VV

    更新日期:2019-11-25 00:00:00

  • Ligand-based molecular modeling study on a chemically diverse series of cholecystokinin-B/gastrin receptor antagonists: generation of predictive model.

    abstract::Pharmacophore hypotheses were developed for six structurally diverse series of cholecystokinin-B/gastrin receptor (CCK-BR) antagonists. A training set consisting of 33 compounds was carefully selected. The activity spread of the training set molecules was from 0.1 to 2100 nM. The most predictive pharmacophore model (h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050257m

    authors: Chopra M,Mishra AK

    更新日期:2005-11-01 00:00:00

  • Posetic quantitative superstructure/activity relationships (QSSARs) for chlorobenzenes.

    abstract::As a result of the widespread industrial use of polychlorinated hydrocarbons, they have accumulated in nearly all types of environmental compartments, especially in aquatic systems. Particularly, chloroaromatics are among the most undesirable industrial effluents because of their persistence and toxicity. To predict c...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0501342

    authors: Ivanciuc T,Ivanciuc O,Klein DJ

    更新日期:2005-07-01 00:00:00

  • Potent Human Telomerase Inhibitors: Molecular Dynamic Simulations, Multiple Pharmacophore-Based Virtual Screening, and Biochemical Assays.

    abstract::Telomere maintenance is a universal cancer hallmark, and small molecules that disrupt telomere maintenance generally have anticancer properties. Since the vast majority of cancer cells utilize telomerase activity for telomere maintenance, the enzyme has been considered as an anticancer drug target. Recently, rational ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00336

    authors: Shirgahi Talari F,Bagherzadeh K,Golestanian S,Jarstfer M,Amanlou M

    更新日期:2015-12-28 00:00:00

  • Receptor-based virtual ligand screening for the identification of novel CDC25 phosphatase inhibitors.

    abstract::CDC25 phosphatases play critical roles in cell cycle regulation and are attractive targets for anticancer therapies. Several small non-peptide molecules are known to inhibit CDC25, but many of them appear to form a covalent bond with the enzyme or act through oxidation of the thiolate group of the catalytic cysteine. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700313e

    authors: Montes M,Braud E,Miteva MA,Goddard ML,Mondésert O,Kolb S,Brun MP,Ducommun B,Garbay C,Villoutreix BO

    更新日期:2008-01-01 00:00:00

  • Characterization of DNA primary sequences by a new similarity/diversity measure based on the partial ordering.

    abstract::The similarity/diversity measures play a fundamental role in library searching, virtual screening, and quantitative structure-activity relationship/quantitative structure-property relationship modeling as well as in genomics and proteomics. In this paper, a new similarity/diversity measure is proposed as a new approac...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci060099e

    authors: Todeschini R,Consonni V,Mauri A,Ballabio D

    更新日期:2006-09-01 00:00:00

  • Conformator: A Novel Method for the Generation of Conformer Ensembles.

    abstract::Computer-aided drug design methods such as docking, pharmacophore searching, 3D database searching, and the creation of 3D-QSAR models need conformational ensembles to handle the flexibility of small molecules. Here, we present Conformator, an accurate and effective knowledge-based algorithm for generating conformer e...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00704

    authors: Friedrich NO,Flachsenberg F,Meyder A,Sommer K,Kirchmair J,Rarey M

    更新日期:2019-02-25 00:00:00

  • Discovery and Evaluation of Anti-Fibrinolytic Plasmin Inhibitors Derived from 5-(4-Piperidyl)isoxazol-3-ol (4-PIOL).

    abstract::Inhibition of plasmin has been found to effectively reduce fibrinolysis and to avoid hemorrhage. This can be achieved by addressing its kringle 1 domain with the known drug and lysine analogue tranexamic acid. Guided by shape similarities toward a previously discovered lead compound, 5-(4-piperidyl)isoxazol-3-ol, a se...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00255

    authors: Schmidt TC,Eriksson PO,Gustafsson D,Cosgrove D,Frølund B,Boström J

    更新日期:2017-07-24 00:00:00

  • Parameterization and conformational sampling effects in pharmacophore multiplet searching.

    abstract::Pharmacophore patterns in ligands can be effectively characterized in terms of their constituent pharmacophore multiplets. Bitsets (fingerprints) encoding which particular multiplets are found in a given ligand have been and continue to be used as molecular descriptors in a range of molecular modeling applications, fr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800234q

    authors: Fox PC,Wolohan PR,Abrahamian E,Clark RD

    更新日期:2008-12-01 00:00:00

  • Evaluation of different virtual screening programs for docking in a charged binding pocket.

    abstract::Virtual screening of small molecules against a protein target often identifies the correct pose, but the ranking in terms of binding energy remains a difficult problem, resulting in unacceptable numbers of false positives and negatives. To investigate this problem, the performance of three docking programs, FRED, QXP/...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800154w

    authors: Deng W,Verlinde CL

    更新日期:2008-10-01 00:00:00

  • Combined approach using ligand efficiency, cross-docking, and antitarget hits for wild-type and drug-resistant Y181C HIV-1 reverse transcriptase.

    abstract::New hits against HIV-1 wild-type and Y181C drug-resistant reverse transcriptases were predicted taking into account the possibility of some of the known metabolism interactions. In silico hits against a set of antitargets (i.e., proteins or nucleic acids that are off-targets from the desired pharmaceutical target obje...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200203h

    authors: García-Sosa AT,Sild S,Takkis K,Maran U

    更新日期:2011-10-24 00:00:00

  • Systematics of high-genus fullerenes.

    abstract::In this article, we present a systematic way to classify a family of high-genus fullerenes (HGFs) by decomposing them into two types of necklike structures, which are the negatively curved parts of parent toroidal carbon nanotubes. By replacing the faces of a uniform polyhedron with these necks, an HGF polyhedron corr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci9001124

    authors: Chuang C,Jin BY

    更新日期:2009-07-01 00:00:00

  • Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors.

    abstract::Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00135

    authors: Berishvili VP,Perkin VO,Voronkov AE,Radchenko EV,Syed R,Venkata Ramana Reddy C,Pillay V,Kumar P,Choonara YE,Kamal A,Palyulin VA

    更新日期:2019-08-26 00:00:00

  • Virtual Screening with Generative Topographic Maps: How Many Maps Are Required?

    abstract::Universal generative topographic maps (GTMs) provide two-dimensional representations of chemical space selected for their "polypharmacological competence", that is, the ability to simultaneously represent meaningful activity and property landscapes, associated with many distinct targets and properties. Several such GT...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00650

    authors: Casciuc I,Zabolotna Y,Horvath D,Marcou G,Bajorath J,Varnek A

    更新日期:2019-01-28 00:00:00

  • Predictive models for cytochrome p450 isozymes based on quantitative high throughput screening data.

    abstract::The human cytochrome P450 (CYP450) isozymes are the most important enzymes in the body to metabolize many endogenous and exogenous substances including environmental toxins and therapeutic drugs. Any unnecessary interactions between a small molecule and CYP450 isozymes may raise a potential to disarm the integrity of ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200311w

    authors: Sun H,Veith H,Xia M,Austin CP,Huang R

    更新日期:2011-10-24 00:00:00

  • Molecular Dynamics Simulations of Supramolecular Anticancer Nanotubes.

    abstract::We report here on long-time all-atomistic molecular dynamics simulations of functional supramolecular nanotubes composed by the self-assembly of peptide-drug amphiphiles (DAs). These DAs have been shown to possess an inherently high drug loading of the hydrophobic anticancer drug camptothecin. We probe the self-assemb...

    journal_title:Journal of chemical information and modeling

    pub_type: 信件

    doi:10.1021/acs.jcim.8b00193

    authors: Kang M,Chakraborty K,Loverde SM

    更新日期:2018-06-25 00:00:00

  • 3D QSAR methods: Phase and Catalyst compared.

    abstract::The programs Phase and Catalyst HypoGen are compared for their performance in determining three-dimensional quantitative structure-activity relationships. Eight sets of compounds with measured activity were collected from the public literature and partitioned into suitable training and test sets by an automated proced...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7000082

    authors: Evans DA,Doman TN,Thorner DA,Bodkin MJ

    更新日期:2007-05-01 00:00:00

  • ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes.

    abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002668

    authors: Avram SI,Pacureanu LM,Bora A,Crisan L,Avram S,Kurunczi L

    更新日期:2014-08-25 00:00:00

  • Use of surface charges from DFT calculations to predict intestinal absorption.

    abstract::A model for prediction of percent intestinal absorption (%Abs) of neutral molecules was developed based upon surface charges of the molecule calculated by density functional theory (DFT). The surface charges are decomposed into sigma moments which are correlated to a partition coefficient representing transfer of the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci049653f

    authors: Jones R,Connolly PC,Klamt A,Diedenhofen M

    更新日期:2005-09-01 00:00:00

  • Chemoinformatics-based classification of prohibited substances employed for doping in sport.

    abstract::Representative molecules from 10 classes of prohibited substances were taken from the World Anti-Doping Agency (WADA) list, augmented by molecules from corresponding activity classes found in the MDDR database. Together with some explicitly allowed compounds, these formed a set of 5245 molecules. Five types of fingerp...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0601160

    authors: Cannon EO,Bender A,Palmer DS,Mitchell JB

    更新日期:2006-11-01 00:00:00

  • Insights on the facet specific adsorption of amino acids and peptides toward platinum.

    abstract::Engineering shape-controlled bionanomaterials requires comprehensive understanding of interactions between biomolecules and inorganic surfaces. We explore the origin of facet-selective binding of peptides adsorbed onto Pt(100) and Pt(111) crystallographic planes. Using molecular dynamics simulations, we show that upon...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400630d

    authors: Ramakrishnan SK,Martin M,Cloitre T,Firlej L,Cuisinier FJ,Gergely C

    更新日期:2013-12-23 00:00:00

  • Transplant-insert-constrain-relax-assemble (TICRA): protein-ligand complex structure modeling and application to kinases.

    abstract::We introduce TICRA (transplant-insert-constrain-relax-assemble), a method for modeling the structure of unknown protein-ligand complexes using the X-ray crystal structures of homologous proteins and ligands with known activity. We present results from modeling the structures of protein kinase-inhibitor complexes using...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100256u

    authors: Meshkat S,Klon AE,Zou J,Wiseman JS,Konteatis Z

    更新日期:2011-01-24 00:00:00

  • Effect of data standardization on chemical clustering and similarity searching.

    abstract::Standardization is used to ensure that the variables in a similarity calculation make an equal contribution to the computed similarity value. This paper compares the use of seven different methods that have been suggested previously for the standardization of integer-valued or real-valued data, comparing the results w...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800224h

    authors: Chu CW,Holliday JD,Willett P

    更新日期:2009-02-01 00:00:00

  • Viscosity Prediction of Lubricants by a General Feed-Forward Neural Network.

    abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01068

    authors: Loh GC,Lee HC,Tee XY,Chow PS,Zheng JW

    更新日期:2020-03-23 00:00:00

  • Scaling predictive modeling in drug development with cloud computing.

    abstract::Growing data sets with increased time for analysis is hampering predictive modeling in drug discovery. Model building can be carried out on high-performance computer clusters, but these can be expensive to purchase and maintain. We have evaluated ligand-based modeling on cloud computing resources where computations ar...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500580y

    authors: Moghadam BT,Alvarsson J,Holm M,Eklund M,Carlsson L,Spjuth O

    更新日期:2015-01-26 00:00:00

  • Critical Assessment of the Hildebrand and Hansen Solubility Parameters for Polymers.

    abstract::Solubility parameter models are widely used to select suitable solvents/nonsolvents for polymers in a variety of processing and engineering applications. In this study, we focus on two well-established models, namely, the Hildebrand and Hansen solubility parameter models. Both models are built on the basis of the noti...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00656

    authors: Venkatram S,Kim C,Chandrasekaran A,Ramprasad R

    更新日期:2019-10-28 00:00:00

  • PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric Data.

    abstract::Mass spectrometric data are copious and generate a processing burden that is best dealt with programmatically. PythoMS is a collection of tools based on the Python programming language that assist researchers in creating figures and video output that is informative, clear, and visually compelling. The PythoMS framewor...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00055

    authors: Yunker LPE,Donnecke S,Ting M,Yeung D,McIndoe JS

    更新日期:2019-04-22 00:00:00

  • Conformational determinants of the activity of antiproliferative factor glycopeptide.

    abstract::The antiproliferative factor (APF) involved in interstitial cystitis is a glycosylated nonapeptide (TVPAAVVVA) containing a sialylated core 1 α-O-disaccharide linked to the N-terminal threonine. The chemical structure of APF was deduced using spectroscopic techniques and confirmed using total synthesis. The synthetic ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400147s

    authors: Mallajosyula SS,Adams KM,Barchi JJ,MacKerell AD

    更新日期:2013-05-24 00:00:00

  • Enrichment factor analyses on G-protein coupled receptors with known crystal structure.

    abstract::G-protein coupled receptors (GPCRs) are highly relevant drug targets. Four GPCRs with known crystal structure were analyzed with docking (AutoDock4) and postdocking (MM-PBSA) in order to evaluate the ability to recognize known antagonists from a larger database of molecular decoys and to predict correct binding modes....

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4000745

    authors: Anighoro A,Rastelli G

    更新日期:2013-04-22 00:00:00