Virtual Screening with Generative Topographic Maps: How Many Maps Are Required?


:Universal generative topographic maps (GTMs) provide two-dimensional representations of chemical space selected for their "polypharmacological competence", that is, the ability to simultaneously represent meaningful activity and property landscapes, associated with many distinct targets and properties. Several such GTMs can be generated, each based on a different initial descriptor vector, encoding distinct structural features. While their average polypharmacological competence may indeed be equivalent, they nevertheless significantly diverge with respect to the quality of each property-specific landscape. In this work, we show that distinct universal maps represent complementary and strongly synergistic views of biologically relevant chemical space. Eight universal GTMs were employed as support for predictive classification landscapes, using more than 600 active/inactive ligand series associated with as many targets from the ChEMBL database (v.23). For nine of these targets, it was possible to extract, from the Directory of Useful Decoys (DUD), truly external sets featuring sufficient "actives" and "decoys" not present in the landscape-defining ChEMBL ligand sets. For each such molecule, projected on every class landscape of a particular universal map, a probability of activity was estimated, in analogy to a virtual screening (VS) experiment. Cross-validated (CV) balanced accuracy on landscape-defining ChEMBL data was unable to predict the success of that landscape in VS. Thus, the universal map with best CV results for a given property should not be prioritized as the implicitly best predictor. For a given map, predictions for many DUD compounds are not trustworthy, according to applicability domain considerations. By contrast, simultaneous application of all universal maps, and rating of the likelihood of activity as the mean returned by all applicable maps, significantly improved prediction results. Performance measures in consensus VS using multiple maps were always superior or similar to those of the best individual map.


J Chem Inf Model


Casciuc I,Zabolotna Y,Horvath D,Marcou G,Bajorath J,Varnek A




Has Abstract


2019-01-28 00:00:00












  • Molecular Oxygen Binding in the Mitochondrial Electron Transfer Flavoprotein.

    abstract::Reactive oxygen species such as superoxide are potentially harmful byproducts of the aerobic metabolism in the inner mitochondrial membrane, and complexes I, II, III of the electron transport chain have been identified as primary sources. The mitochondrial fatty acid b-oxidation pathway may also play a yet uncharacter...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Husen P,Nielsen C,Martino CF,Solov'yov IA

    更新日期:2019-11-25 00:00:00

  • Efficient Strategy for the Calculation of Solvation Free Energies in Water and Chloroform at the Quantum Mechanical/Molecular Mechanical Level.

    abstract::The partitioning of solute molecules between immiscible solvents with significantly different polarities is of great importance. The polarization between the solute and solvent molecules plays an essential role in determining the solubility of the solute, which makes computational studies utilizing molecular mechanics...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Wang M,Li P,Jia X,Liu W,Shao Y,Hu W,Zheng J,Brooks BR,Mei Y

    更新日期:2017-10-23 00:00:00

  • Impact of template choice on homology model efficiency in virtual screening.

    abstract::Homology modeling is a reliable method of predicting the three-dimensional structures of proteins that lack NMR or X-ray crystallographic data. It employs the assumption that a structural resemblance exists between closely related proteins. Despite the availability of many crystal structures of possible templates, onl...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Rataj K,Witek J,Mordalski S,Kosciolek T,Bojarski AJ

    更新日期:2014-06-23 00:00:00

  • Molecular Dynamics Simulation of the Conformational Preferences of Pseudouridine Derivatives: Improving the Distribution in the Glycosidic Torsion Space.

    abstract::There are only four derivatives of pseudouridine (Ψ) that are known to occur naturally in RNA as post-transcriptional modifications. We have studied the conformational consequences of pseudouridylation and further modifications using replica exchange molecular dynamics simulations at the nucleoside level, and the simu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Dutta N,Sarzynska J,Lahiri A

    更新日期:2020-10-26 00:00:00

  • Dihedral-based segment identification and classification of biopolymers I: proteins.

    abstract::A new structure classification scheme for biopolymers is introduced, which is solely based on main-chain dihedral angles. It is shown that by dividing a biopolymer into segments containing two central residues, a local classification can be performed. The method is referred to as DISICL, short for Dihedral-based Segme...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Nagy G,Oostenbrink C

    更新日期:2014-01-27 00:00:00

  • Parameterization and conformational sampling effects in pharmacophore multiplet searching.

    abstract::Pharmacophore patterns in ligands can be effectively characterized in terms of their constituent pharmacophore multiplets. Bitsets (fingerprints) encoding which particular multiplets are found in a given ligand have been and continue to be used as molecular descriptors in a range of molecular modeling applications, fr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Fox PC,Wolohan PR,Abrahamian E,Clark RD

    更新日期:2008-12-01 00:00:00

  • Comparison of several molecular docking programs: pose prediction and virtual screening accuracy.

    abstract::Molecular docking programs are widely used modeling tools for predicting ligand binding modes and structure based virtual screening. In this study, six molecular docking programs (DOCK, FlexX, GLIDE, ICM, PhDOCK, and Surflex) were evaluated using metrics intended to assess docking pose and virtual screening accuracy. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Cross JB,Thompson DC,Rai BK,Baber JC,Fan KY,Hu Y,Humblet C

    更新日期:2009-06-01 00:00:00

  • Descriptor Data Bank (DDB): A Cloud Platform for Multiperspective Modeling of Protein-Ligand Interactions.

    abstract::Protein-ligand (PL) interactions play a key role in many life processes such as molecular recognition, molecular binding, signal transmission, and cell metabolism. Examples of interaction forces include hydrogen bonding, hydrophobic effects, steric clashes, electrostatic contacts, and van der Waals attractions. Curren...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Ashtawy HM,Mahapatra NR

    更新日期:2018-01-22 00:00:00

  • An Analysis of Different Components of a High-Throughput Screening Library.

    abstract::Since many projects at pharmaceutical organizations get their start from a high-throughput screening (HTS) campaign, improving the quality of the HTS deck can improve the likelihood of discovering a high-quality lead molecule that can be progressed to a drug candidate. Over the past decade, Janssen has implemented sev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Saha A,Varghese T,Liu A,Allen SJ,Mirzadegan T,Hack MD

    更新日期:2018-10-22 00:00:00

  • Molecular Dynamics Simulations of Substrate Release from Trypanosoma cruzi UDP-Galactopyranose Mutase.

    abstract::The enzyme UDP-galactopyranose mutase (UGM) represents a promising drug target for the treatment of infections with Trypanosoma cruzi. We have computed the Potential of Mean Force for the release of UDP-galactopyranose from UGM, using Umbrella Sampling simulations. The simulations revealed the conformational changes t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Cossio-Pérez R,Pierdominici-Sottile G,Sobrado P,Palma J

    更新日期:2019-02-25 00:00:00

  • Heteroaromatic π-stacking energy landscapes.

    abstract::In this study we investigate π-stacking interactions of a variety of aromatic heterocycles with benzene using dispersion corrected density functional theory. We calculate extensive potential energy surfaces for parallel-displaced interaction geometries. We find that dispersion contributes significantly to the interact...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Huber RG,Margreiter MA,Fuchs JE,von Grafenstein S,Tautermann CS,Liedl KR,Fox T

    更新日期:2014-05-27 00:00:00

  • RED: a set of molecular descriptors based on Renyi entropy.

    abstract::New molecular descriptors, RED (Renyi entropy descriptors), based on the generalized entropies introduced by Renyi are presented. Topological descriptors based on molecular features have proven to be useful for describing molecular profiles. Renyi entropy is used as a variability measure to contract a feature-pair dis...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Delgado-Soler L,Toral R,Tomás MS,Rubio-Martinez J

    更新日期:2009-11-01 00:00:00

  • Direct Observation of β-Barrel Intermediates in the Self-Assembly of Toxic SOD128-38 and Absence in Nontoxic Glycine Mutants.

    abstract::Soluble low-molecular-weight oligomers formed during the early stage of amyloid aggregation are considered the major toxic species in amyloidosis. The structure-function relationship between oligomeric assemblies and the cytotoxicity in amyloid diseases are still elusive due to the heterogeneous and transient nature o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Sun Y,Huang J,Duan X,Ding F

    更新日期:2021-01-14 00:00:00

  • Computational and conformational evaluation of FTase alternative substrates: insight into a novel enzyme binding pocket.

    abstract::Protein farnesyltransferase (FTase) is an important anticancer drug target. In an effort to develop isoprenoid diphosphate-based FTase inhibitors, striking variations have been observed in the ability of conservatively modified analogues to bind to the enzyme. For example, 2Z-GGPP is an alternative substrate with high...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Henriksen BS,Zahn TJ,Evanseck JD,Firestine SM,Gibbs RA

    更新日期:2005-07-01 00:00:00

  • Pharmer: efficient and exact pharmacophore search.

    abstract::Pharmacophore search is a key component of many drug discovery efforts. Pharmer is a new computational approach to pharmacophore search that scales with the breadth and complexity of the query, not the size of the compound library being screened. Two novel methods for organizing pharmacophore data, the Pharmer KDB-tre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Koes DR,Camacho CJ

    更新日期:2011-06-27 00:00:00

  • H274Y's Effect on Oseltamivir Resistance: What Happens Before the Drug Enters the Binding Site.

    abstract::Increased reports of oseltamivir (OTV)-resistant strains of the influenza virus, such as the H274Y mutation on its neuraminidase (NA), have created some cause for concern. Many studies have been conducted in the attempt to uncover the mechanism of OTV resistance in H274Y NA. However, most of the reported studies on H2...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Yusuf M,Mohamed N,Mohamad S,Janezic D,Damodaran KV,Wahab HA

    更新日期:2016-01-25 00:00:00

  • Imputation of Assay Bioactivity Data Using Deep Learning.

    abstract::We describe a novel deep learning neural network method and its application to impute assay pIC50 values. Unlike conventional machine learning approaches, this method is trained on sparse bioactivity data as input, typical of that found in public and commercial databases, enabling it to learn directly from correlation...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Whitehead TM,Irwin BWJ,Hunt P,Segall MD,Conduit GJ

    更新日期:2019-03-25 00:00:00

  • Combinatorial × computational × cheminformatics (C3) approach to characterization of congeneric libraries of organic pollutants.

    abstract::Congeners are molecules based on the same carbon skeleton but are different by the number of substituents and/or a substitution pattern. Examples are 1-chloronaphthalene, 1,4-dichloronaphthalene, and 1,3,8-trichloronaphthalene. Various persistent organic pollutants (POPs) exist in the environment as families of congen...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Haranczyk M,Urbaszek P,Ng EG,Puzyn T

    更新日期:2012-11-26 00:00:00

  • Multitarget structure-activity relationships characterized by activity-difference maps and consensus similarity measure.

    abstract::Dual and triple activity-difference (DAD/TAD) maps are tools for the systematic characterization of structure-activity relationships (SAR) of compound data sets screened against two or three targets. DAD and TAD maps are two- and three- dimensional representations of the pairwise activity differences of compound data ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Medina-Franco JL,Yongye AB,Pérez-Villanueva J,Houghten RA,Martínez-Mayorga K

    更新日期:2011-09-26 00:00:00

  • Consensus QSAR models: do the benefits outweigh the complexity?

    abstract::This study has assessed the use of consensus regression, as compared to single multiple linear regression, models for the development of quantitative structure-activity relationships (QSARs). To provide a comparison, four data sets of varying size and complexity were analyzed: silastic membrane flux, toxicity of pheno...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Hewitt M,Cronin MT,Madden JC,Rowe PH,Johnson C,Obi A,Enoch SJ

    更新日期:2007-07-01 00:00:00

  • PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric Data.

    abstract::Mass spectrometric data are copious and generate a processing burden that is best dealt with programmatically. PythoMS is a collection of tools based on the Python programming language that assist researchers in creating figures and video output that is informative, clear, and visually compelling. The PythoMS framewor...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Yunker LPE,Donnecke S,Ting M,Yeung D,McIndoe JS

    更新日期:2019-04-22 00:00:00

  • 3D-QSAR and docking studies of selective GSK-3beta inhibitors. Comparison with a thieno[2,3-b]pyrrolizinone derivative, a new potential lead for GSK-3beta ligands.

    abstract::The three-dimensional structures of 3-anilino-4-arylmaleimides, selective GSK-3beta inhibitors, were correlated to their biological affinities by 3D-QSAR studies (CoMFA method). The cocrystallographic data of GSK-3beta vs 3-anilino-4-arylmaleimide allowed us to compare 3D-QSAR results to experimental intermolecular in...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Lescot E,Bureau R,Sopkova-de Oliveira Santos J,Rochais C,Lisowski V,Lancelot JC,Rault S

    更新日期:2005-05-01 00:00:00

  • Ranking chemical structures for drug discovery: a new machine learning approach.

    abstract::With chemical libraries increasingly containing millions of compounds or more, there is a fast-growing need for computational methods that can rank or prioritize compounds for screening. Machine learning methods have shown considerable promise for this task; indeed, classification methods such as support vector machin...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Agarwal S,Dugar D,Sengupta S

    更新日期:2010-05-24 00:00:00

  • Comparative modeling and benchmarking data sets for human histone deacetylases and sirtuin families.

    abstract::Histone deacetylases (HDACs) are an important class of drug targets for the treatment of cancers, neurodegenerative diseases, and other types of diseases. Virtual screening (VS) has become fairly effective approaches for drug discovery of novel and highly selective histone deacetylase inhibitors (HDACIs). To facilitat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Xia J,Tilahun EL,Kebede EH,Reid TE,Zhang L,Wang XS

    更新日期:2015-02-23 00:00:00

  • SMIfp (SMILES fingerprint) chemical space for virtual screening and visualization of large databases of organic molecules.

    abstract::SMIfp (SMILES fingerprint) is defined here as a scalar fingerprint describing organic molecules by counting the occurrences of 34 different symbols in their SMILES strings, which creates a 34-dimensional chemical space. Ligand-based virtual screening using the city-block distance CBD(SMIfp) as similarity measure provi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Schwartz J,Awale M,Reymond JL

    更新日期:2013-08-26 00:00:00

  • Holistic Approach to Partial Covalent Interactions in Protein Structure Prediction and Design with Rosetta.

    abstract::Partial covalent interactions (PCIs) in proteins, which include hydrogen bonds, salt bridges, cation-π, and π-π interactions, contribute to thermodynamic stability and facilitate interactions with other biomolecules. Several score functions have been developed within the Rosetta protein modeling framework that identif...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Combs SA,Mueller BK,Meiler J

    更新日期:2018-05-29 00:00:00

  • Viscosity Prediction of Lubricants by a General Feed-Forward Neural Network.

    abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Loh GC,Lee HC,Tee XY,Chow PS,Zheng JW

    更新日期:2020-03-23 00:00:00

  • 3D QSAR methods: Phase and Catalyst compared.

    abstract::The programs Phase and Catalyst HypoGen are compared for their performance in determining three-dimensional quantitative structure-activity relationships. Eight sets of compounds with measured activity were collected from the public literature and partitioned into suitable training and test sets by an automated proced...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Evans DA,Doman TN,Thorner DA,Bodkin MJ

    更新日期:2007-05-01 00:00:00

  • Comparative Binding Analysis of N-Acetylneuraminic Acid in Bovine Serum Albumin and Human α-1 Acid Glycoprotein.

    abstract::The present study focuses on the determination of the biologically significant N-acetylneuraminic acid (NANA) drug binding interaction mechanism between bovine serum albumin (BSA) and human α-1 acid glycoprotein (HAG) using various optical spectroscopy and computational methods. The steady state fluorescence spectrosc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Karthikeyan S,Bharanidharan G,Ragavan S,Kandasamy S,Chinnathambi S,Udayakumar K,Mangaiyarkarasi R,Sundaramoorthy A,Aruna P,Ganesan S

    更新日期:2019-01-28 00:00:00

  • Sharing Data from Molecular Simulations.

    abstract::Given the need for modern researchers to produce open, reproducible scientific output, the lack of standards and best practices for sharing data and workflows used to produce and analyze molecular dynamics (MD) simulations has become an important issue in the field. There are now multiple well-established packages to ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Abraham M,Apostolov R,Barnoud J,Bauer P,Blau C,Bonvin AMJJ,Chavent M,Chodera J,Čondić-Jurkić K,Delemotte L,Grubmüller H,Howard RJ,Jordan EJ,Lindahl E,Ollila OHS,Selent J,Smith DGA,Stansfeld PJ,Tiemann JKS,Trellet M

    更新日期:2019-10-28 00:00:00