Predicted Biological Activity of Purchasable Chemical Space.

Abstract:

:Whereas 400 million distinct compounds are now purchasable within the span of a few weeks, the biological activities of most are unknown. To facilitate access to new chemistry for biology, we have combined the Similarity Ensemble Approach (SEA) with the maximum Tanimoto similarity to the nearest bioactive to predict activity for every commercially available molecule in ZINC. This method, which we label SEA+TC, outperforms both SEA and a naïve-Bayesian classifier via predictive performance on a 5-fold cross-validation of ChEMBL's bioactivity data set (version 21). Using this method, predictions for over 40% of compounds (>160 million) have either high significance (pSEA ≥ 40), high similarity (ECFP4MaxTc ≥ 0.4), or both, for one or more of 1382 targets well described by ligands in the literature. Using a further 1347 less-well-described targets, we predict activities for an additional 11 million compounds. To gauge whether these predictions are sensible, we investigate 75 predictions for 50 drugs lacking a binding affinity annotation in ChEMBL. The 535 million predictions for over 171 million compounds at 2629 targets are linked to purchasing information and evidence to support each prediction and are freely available via https://zinc15.docking.org and https://files.docking.org .

journal_name

J Chem Inf Model

authors

Irwin JJ,Gaskins G,Sterling T,Mysinger MM,Keiser MJ

doi

10.1021/acs.jcim.7b00316

subject

Has Abstract

pub_date

2018-01-22 00:00:00

pages

148-164

issue

1

eissn

1549-9596

issn

1549-960X

journal_volume

58

pub_type

杂志文章
  • Prediction of molecular solvation free energy based on the optimization of atomic solvation parameters with genetic algorithm.

    abstract::We propose an improved solvent contact model to estimate the solvation free energy of an organic molecule from individual atomic contributions. The modification of the solvation model involves the optimization of three kinds of parameters in the solvation free energy function: atomic fragmental volume, maximum atomic ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600453b

    authors: Kang H,Choi H,Park H

    更新日期:2007-03-01 00:00:00

  • Molecular modeling of DNA cross-linking analogues based on the azinomycin scaffold.

    abstract::In this work, we present molecular modeling studies carried out using six DNA sequences and six azinomycin analogues, including the naturally occurring compound azinomycin B, selected on the basis of known cell cytotoxicity and structural analogies (epoxide and aziridine alkylating moieties). Among several computation...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0496595

    authors: Alcaro S,Ortuso F,Coleman RS

    更新日期:2005-05-01 00:00:00

  • Evaluation and Characterization of Trk Kinase Inhibitors for the Treatment of Pain: Reliable Binding Affinity Predictions from Theory and Computation.

    abstract::Optimization of ligand binding affinity to the target protein of interest is a primary objective in small-molecule drug discovery. Until now, the prediction of binding affinities by computational methods has not been widely applied in the drug discovery process, mainly because of its lack of accuracy and reproducibili...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00780

    authors: Wan S,Bhati AP,Skerratt S,Omoto K,Shanmugasundaram V,Bagal SK,Coveney PV

    更新日期:2017-04-24 00:00:00

  • Effect of structural stress on the flexibility and adaptability of HIV-1 protease.

    abstract::Resistance remains a major issue with regards to HIV-1 protease, despite the availability of numerous HIV-1 protease inhibitors and copious amounts of structural and binding data. In an effort to improve our understanding of how HIV-1 protease is able to "outsmart" new drugs, we have investigated the flexibility of HI...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2000677

    authors: Oehme DP,Wilson DJ,Brownlee RT

    更新日期:2011-05-23 00:00:00

  • Drug effect prediction by polypharmacology-based interaction profiling.

    abstract::Most drugs exert their effects via multitarget interactions, as hypothesized by polypharmacology. While these multitarget interactions are responsible for the clinical effect profiles of drugs, current methods have failed to uncover the complex relationships between them. Here, we introduce an approach which is able t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2002022

    authors: Simon Z,Peragovics A,Vigh-Smeller M,Csukly G,Tombor L,Yang Z,Zahoránszky-Kohalmi G,Végner L,Jelinek B,Hári P,Hetényi C,Bitter I,Czobor P,Málnási-Csizmadia A

    更新日期:2012-01-23 00:00:00

  • Molecular Mechanism, Dynamics, and Energetics of Protein-Mediated Dinucleotide Flipping in a Mismatched DNA: A Computational Study of the RAD4-DNA Complex.

    abstract::DNA damage alters genetic information and adversely affects gene expression pathways leading to various complex genetic disorders and cancers. DNA repair proteins recognize and rectify DNA damage and mismatches with high fidelity. A critical molecular event that occurs during most protein-mediated DNA repair processes...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00636

    authors: Pitta K,Krishnan M

    更新日期:2018-03-26 00:00:00

  • Cross-docking of inhibitors into CDK2 structures. 2.

    abstract::In the preceding paper (Duca, J. S.; Madison, V. S.; Voigt, J. H. J. Chem. Inf. Model. 2008, 48, 659-668), the accuracy of docking and affinity predictions of the Gold and Glide programs were investigated using single protein conformations spanning 150 CDK2/inhibitor crystallographic complexes. High docking accuracy w...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700428d

    authors: Voigt JH,Elkin C,Madison VS,Duca JS

    更新日期:2008-03-01 00:00:00

  • GESSE: Predicting Drug Side Effects from Drug-Target Relationships.

    abstract::The in silico prediction of unwanted side effects (SEs) caused by the promiscuous behavior of drugs and their targets is highly relevant to the pharmaceutical industry. Considerable effort is now being put into computational and experimental screening of several suspected off-target proteins in the hope that SEs might...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00120

    authors: Pérez-Nueno VI,Souchet M,Karaboga AS,Ritchie DW

    更新日期:2015-09-28 00:00:00

  • Plant Metabolite Databases: From Herbal Medicines to Modern Drug Discovery.

    abstract::Traditional herbal medicine has been an inseparable part of the traditional medical science in many countries throughout history. Nowadays, the popularity of using herbal medicines in daily life, as well as clinical practices, has gradually expanded to numerous Western countries with positive impacts and acceptance. T...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00826

    authors: Nguyen-Vo TH,Nguyen L,Do N,Nguyen TN,Trinh K,Cao H,Le L

    更新日期:2020-03-23 00:00:00

  • Cheminformatics Modeling of Adverse Drug Responses by Clinically Relevant Mutants of Human Androgen Receptor.

    abstract::The human androgen receptor (AR) is a ligand-activated transcription factor that plays a pivotal role in the development and progression of prostate cancer (PCa). Many forms of castration-resistant prostate cancer (CRPC) still rely on the AR for survival. Currently used antiandrogens face clinical limitations as drug ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00400

    authors: Paul N,Carabet LA,Lallous N,Yamazaki T,Gleave ME,Rennie PS,Cherkasov A

    更新日期:2016-12-27 00:00:00

  • Estimation of carcinogenicity using molecular fragments tree.

    abstract::Carcinogenicity is an important toxicological endpoint that poses high concern to drug discovery. In this study, we developed a method to extract structural alerts (SAs) and modulating factors of carcinogens on the basis of statistical analyses. First, the Gaston algorithm, a frequent subgraph mining method, was used ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300266p

    authors: Wang Y,Lu J,Wang F,Shen Q,Zheng M,Luo X,Zhu W,Jiang H,Chen K

    更新日期:2012-08-27 00:00:00

  • Protein kinases: docking and homology modeling reliability.

    abstract::A database of about 700 high-resolution kinase structures was used to test the reliability of 17 docking procedures (using six docking software packages) by means of self- and cross-docking studies. The analysis of about 80 000 docking calculations suggests that the docking of an unknown ligand into a kinase has a pro...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100161z

    authors: Tuccinardi T,Botta M,Giordano A,Martinelli A

    更新日期:2010-08-23 00:00:00

  • Regioselectivity prediction of CYP1A2-mediated phase I metabolism.

    abstract::A kinetic, reactivity-binding model has been proposed to predict the regioselectivity of substrates meditated by the CYP1A2 enzyme, which is responsible for the metabolism of planar-conjugated compounds such as caffeine. This model consists of a docking simulation for binding energy and a semiempirical molecular orbit...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800001m

    authors: Jung J,Kim ND,Kim SY,Choi I,Cho KH,Oh WS,Kim DN,No KT

    更新日期:2008-05-01 00:00:00

  • Ligand- and Structure-Based Analysis of Deep Learning-Generated Potential α2a Adrenoceptor Agonists.

    abstract::The α2a adrenoceptor is a medically relevant subtype of the G protein-coupled receptor family. Unfortunately, high-throughput techniques aimed at producing novel drug leads for this receptor have been largely unsuccessful because of the complex pharmacology of adrenergic receptors. As such, cutting-edge in silico liga...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01019

    authors: Schultz KJ,Colby SM,Lin VS,Wright AT,Renslow RS

    更新日期:2021-01-25 00:00:00

  • Multitarget structure-activity relationships characterized by activity-difference maps and consensus similarity measure.

    abstract::Dual and triple activity-difference (DAD/TAD) maps are tools for the systematic characterization of structure-activity relationships (SAR) of compound data sets screened against two or three targets. DAD and TAD maps are two- and three- dimensional representations of the pairwise activity differences of compound data ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200281v

    authors: Medina-Franco JL,Yongye AB,Pérez-Villanueva J,Houghten RA,Martínez-Mayorga K

    更新日期:2011-09-26 00:00:00

  • Energetics, Thermodynamics, and Molecular Recognition of Piperine with DNA.

    abstract::Piperine, the bioactive phytochemical from black pepper (Piper nigrum L.), is a nontoxic natural compound exhibiting many physiological and pharmacological properties. They include antioxidant, anti-inflammatory, antimutagenic, antitumor, antiapoptotic, antigenotoxic, antiarthritic, antifungal, antimicrobial, antidepr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00514

    authors: Haris P,Mary V,Haridas M,Sudarsanakumar C

    更新日期:2015-12-28 00:00:00

  • Structure-based approach for the study of estrogen receptor binding affinity and subtype selectivity.

    abstract::Estrogens exert important physiological effects through the modulation of two human estrogen receptor (hER) subtypes, alpha (hERalpha) and beta (hERbeta). Because the levels and relative proportion of hERalpha and hERbeta differ significantly in different target cells, selective hER ligands could target specific tissu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8002182

    authors: Salum LB,Polikarpov I,Andricopulo AD

    更新日期:2008-11-01 00:00:00

  • Exploring Topological Pharmacophore Graphs for Scaffold Hopping.

    abstract::The primary goal of ligand-based virtual screening is to identify active compounds consisting of a core scaffold that is not found in the current active compound pool. Scaffold hopping is the term used for this purpose. In the present study, topological representations of pharmacophore features on chemical graphs were...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00098

    authors: Nakano H,Miyao T,Funatsu K

    更新日期:2020-04-27 00:00:00

  • Protein flexibility in virtual screening: the BACE-1 case study.

    abstract::Simulating protein flexibility is a major issue in the docking-based drug-design process for which a single methodological solution does not exist. In our search of new anti-Alzheimer ligands, we were faced with the challenge of including receptor plasticity in a virtual screening campaign aimed at finding new β-secre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300390h

    authors: Cosconati S,Marinelli L,Di Leva FS,La Pietra V,De Simone A,Mancini F,Andrisano V,Novellino E,Goodsell DS,Olson AJ

    更新日期:2012-10-22 00:00:00

  • First Multitarget Chemo-Bioinformatic Model To Enable the Discovery of Antibacterial Peptides against Multiple Gram-Positive Pathogens.

    abstract::Antimicrobial peptides (AMPs) have emerged as promising therapeutic alternatives to fight against the diverse infections caused by different pathogenic microorganisms. In this context, theoretical approaches in bioinformatics have paved the way toward the creation of several in silico models capable of predicting anti...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00630

    authors: Speck-Planche A,Kleandrova VV,Ruso JM,Cordeiro MN

    更新日期:2016-03-28 00:00:00

  • Kinome-wide activity modeling from diverse public high-quality data sets.

    abstract::Large corpora of kinase small molecule inhibitor data are accessible to public sector research from thousands of journal article and patent publications. These data have been generated employing a wide variety of assay methodologies and experimental procedures by numerous laboratories. Here we ask the question how app...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300403k

    authors: Schürer SC,Muskal SM

    更新日期:2013-01-28 00:00:00

  • Comparison of several molecular docking programs: pose prediction and virtual screening accuracy.

    abstract::Molecular docking programs are widely used modeling tools for predicting ligand binding modes and structure based virtual screening. In this study, six molecular docking programs (DOCK, FlexX, GLIDE, ICM, PhDOCK, and Surflex) were evaluated using metrics intended to assess docking pose and virtual screening accuracy. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900056c

    authors: Cross JB,Thompson DC,Rai BK,Baber JC,Fan KY,Hu Y,Humblet C

    更新日期:2009-06-01 00:00:00

  • Large-scale mining for similar protein binding pockets: with RAPMAD retrieval on the fly becomes real.

    abstract::Determination of structural similarities between protein binding pockets is an important challenge in in silico drug design. It can help to understand selectivity considerations, predict unexpected ligand cross-reactivity, and support the putative annotation of function to orphan proteins. To this end, Cavbase was dev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5005898

    authors: Krotzky T,Grunwald C,Egerland U,Klebe G

    更新日期:2015-01-26 00:00:00

  • Structure-based discovery of novel non-nucleosidic DNA alkyltransferase inhibitors: virtual screening and in vitro and in vivo activities.

    abstract::The human DNA-repair O (6)-alkylguanine DNA alkyltransferase (MGMT or hAGT) protein protects DNA from environmental alkylating agents and also plays an important role in tumor resistance to chemotherapy treatment. Available inhibitors, based on pseudosubstrate analogs, have been shown to induce substantial bone marrow...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700447r

    authors: Ruiz FM,Gil-Redondo R,Morreale A,Ortiz AR,Fábrega C,Bravo J

    更新日期:2008-04-01 00:00:00

  • The molecular basis for the selectivity of tadalafil toward phosphodiesterase 5 and 6: a modeling study.

    abstract::Great attention has been paid to the clinical significance of phosphodiesterase 5 (PDE5) inhibitors, such as sildenafil, tadalafil, and vardenafil widely used for erectile dysfunction. However, sildenafil causes side effects on visual functions since it shows similar potencies to inhibit PDE5 and PDE6, whereas tadalaf...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400458z

    authors: Huang YY,Li Z,Cai YH,Feng LJ,Wu Y,Li X,Luo HB

    更新日期:2013-11-25 00:00:00

  • FORTRAN interface for code interoperability in quantum chemistry: the Q5Cost library.

    abstract::Ab initio quantum-chemistry programs produce and use large amounts of data, which are usually stored on disk in the form of binary files. A FORTRAN library, named Q5Cost, has been designed and implemented in order to allow the storage of these data sets in a special data format built with the HDF5 technology. This dat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7000567

    authors: Borini S,Monari A,Rossi E,Tajti A,Angeli C,Bendazzoli GL,Cimiraglia R,Emerson A,Evangelisti S,Maynau D,Sanchez-Marin J,Szalay PG

    更新日期:2007-05-01 00:00:00

  • Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries.

    abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200078f

    authors: Li L,Wang B,Meroueh SO

    更新日期:2011-09-26 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • SMIfp (SMILES fingerprint) chemical space for virtual screening and visualization of large databases of organic molecules.

    abstract::SMIfp (SMILES fingerprint) is defined here as a scalar fingerprint describing organic molecules by counting the occurrences of 34 different symbols in their SMILES strings, which creates a 34-dimensional chemical space. Ligand-based virtual screening using the city-block distance CBD(SMIfp) as similarity measure provi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400206h

    authors: Schwartz J,Awale M,Reymond JL

    更新日期:2013-08-26 00:00:00

  • Torsion Library Reloaded: A New Version of Expert-Derived SMARTS Rules for Assessing Conformations of Small Molecules.

    abstract::The Torsion Library contains hundreds of rules for small molecule conformations which have been derived from the Cambridge Structural Database (CSD) and are curated by molecular design experts. The torsion rules are encoded as SMARTS patterns and categorize rotatable bonds via a traffic light coloring scheme. We have ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00522

    authors: Guba W,Meyder A,Rarey M,Hert J

    更新日期:2016-01-25 00:00:00