Property-Unmatched Decoys in Docking Benchmarks.

Abstract:

:Enrichment of ligands versus property-matched decoys is widely used to test and optimize docking library screens. However, the unconstrained optimization of enrichment alone can mislead, leading to false confidence in prospective performance. This can arise by over-optimizing for enrichment against property-matched decoys, without considering the full spectrum of molecules to be found in a true large library screen. Adding decoys representing charge extrema helps mitigate over-optimizing for electrostatic interactions. Adding decoys that represent the overall characteristics of the library to be docked allows one to sample molecules not represented by ligands and property-matched decoys but that one will encounter in a prospective screen. An optimized version of the DUD-E set (DUDE-Z), as well as Extrema and sets representing broad features of the library (Goldilocks), is developed here. We also explore the variability that one can encounter in enrichment calculations and how that can temper one's confidence in small enrichment differences. The new tools and new decoy sets are freely available at http://tldr.docking.org and http://dudez.docking.org.

journal_name

J Chem Inf Model

authors

Stein RM,Yang Y,Balius TE,O'Meara MJ,Lyu J,Young J,Tang K,Shoichet BK,Irwin JJ

doi

10.1021/acs.jcim.0c00598

subject

Has Abstract

pub_date

2021-01-25 00:00:00

eissn

1549-9596

issn

1549-960X

pub_type

杂志文章
  • Discovery of Inhibitors of Four Bromodomains by Fragment-Anchored Ligand Docking.

    abstract::The high-throughput docking protocol called ALTA-VS (anchor-based library tailoring approach for virtual screening) was developed in 2005 for the efficient in silico screening of large libraries of compounds by preselection of only those molecules that have optimal fragments (anchors) for the protein target. Here we p...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00336

    authors: Marchand JR,Dalle Vedove A,Lolli G,Caflisch A

    更新日期:2017-10-23 00:00:00

  • How do metabolites differ from their parent molecules and how are they excreted?

    abstract::Understanding which physicochemical properties, or property distributions, are favorable for successful design and development of drugs, nutritional supplements, cosmetics, and agrochemicals is of great importance. In this study we have analyzed molecules from three distinct chemical spaces (i) approved drugs, (ii) hu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300487z

    authors: Kirchmair J,Howlett A,Peironcely JE,Murrell DS,Williamson MJ,Adams SE,Hankemeier T,van Buren L,Duchateau G,Klaffke W,Glen RC

    更新日期:2013-02-25 00:00:00

  • Facile Solutions to the Problems Associated with Chemical Information and Mathematical Symbolism While Using Machine Translation Tools.

    abstract::Advances in computer-aided translation technology have made tremendous progress in accuracy in the past few years. Chemical Abstracts Service of the American Chemical Society summarizes scientific works from more than 50 languages and allows the users to search papers in nine selected languages. Currently, only the ab...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00274

    authors: Wahab MF,Zulfiqar S,Sarwar MI,Lieberwirth I

    更新日期:2020-07-27 00:00:00

  • Benchmark Sets for Binding Hot Spot Identification in Fragment-Based Ligand Discovery.

    abstract::Binding hot spots are regions of proteins that, due to their potentially high contribution to the binding free energy, have high propensity to bind small molecules. We present benchmark sets for testing computational methods for the identification of binding hot spots with emphasis on fragment-based ligand discovery. ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00877

    authors: Wakefield AE,Yueh C,Beglov D,Castilho MS,Kozakov D,Keserű GM,Whitty A,Vajda S

    更新日期:2020-12-28 00:00:00

  • A Grid Map Based Approach to Identify Nonobvious Ligand Design Opportunities in 3D Protein Structure Ensembles.

    abstract::Three-dimensional protein structures are a key requisite for structure-based drug discovery. For many highly relevant targets, medicinal chemists are confronted with large numbers of target structures in their apo-forms or in complex with a wealth of different ligands. To exploit the full potential of such structure e...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00051

    authors: Schmalhorst PS,Bergner A

    更新日期:2020-04-27 00:00:00

  • Computational simulations of the interactions between acetyl-coenzyme-A carboxylase and clodinafop: resistance mechanism due to active and nonactive site mutations.

    abstract::Grass weed populations resistant to acetyl-CoA carboxylase-inhibiting (ACCase; EC 6.4.1.2) herbicides represent a major problem for the sustainable development of modern agriculture. In the present study, extensive computational simulations, including homology modeling, molecular dynamics (MD) simulations, and molecul...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900174d

    authors: Zhu XL,Ge-Fei H,Zhan CG,Yang GF

    更新日期:2009-08-01 00:00:00

  • In silico analysis of the thermodynamic stability changes of psychrophilic and mesophilic alpha-amylases upon exhaustive single-site mutations.

    abstract::Identifying sequence modifications that distinguish psychrophilic from mesophilic proteins is important for designing enzymes with different thermodynamic stabilities and to understand the underlying mechanisms. The PoPMuSiC algorithm is used to introduce, in silico, all the single-site mutations in four mesophilic an...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050473v

    authors: Gilis D

    更新日期:2006-05-01 00:00:00

  • Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation.

    abstract::A new variable selection wrapper method named the Monte Carlo variable selection (MCVS) method was developed utilizing the framework of the Monte Carlo cross-validation (MCCV) approach. The MCVS method reports the variable selection results in the most conventional and common measure of statistical hypothesis testing,...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700283s

    authors: Konovalov DA,Sim N,Deconinck E,Vander Heyden Y,Coomans D

    更新日期:2008-02-01 00:00:00

  • Protein structural statistics with PSS.

    abstract::Characterizing the variability within an ensemble of protein structures is a common requirement in structural biology and bioinformatics. With the increasing number of protein structures becoming available, there is a need for new tools capable of automating the structural comparison of large ensemble of structures. W...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400233j

    authors: Gaillard T,Schwarz BB,Chebaro Y,Stote RH,Dejaegere A

    更新日期:2013-09-23 00:00:00

  • New serotonin 5-HT(6) ligands from common feature pharmacophore hypotheses.

    abstract::Serotonin 5-HT6 receptor antagonists are thought to play an important role in the treatment of psychiatry, Alzheimer's disease, and probably obesity. To find novel and potent 5-HT6 antagonists and to provide a new idea for drug design, we used a ligand-based pharmacophore to perform the virtual screening of a commerci...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700160t

    authors: Kim HJ,Doddareddy MR,Choo H,Cho YS,No KT,Park WK,Pae AN

    更新日期:2008-01-01 00:00:00

  • Technique for energy decomposition in the study of "receptor-ligand" complexes.

    abstract::A new methodology to describe the interactions in "receptor-ligand" complexes is presented. The methodology is based on a combination of the 3D/4D QSAR BiS/MC and CoCon algorithms. The first algorithm performs the restricted docking of compounds to receptor pockets. The second determines the relationships between the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800405n

    authors: Potemkin VA,Pogrebnoy AA,Grishina MA

    更新日期:2009-06-01 00:00:00

  • Computational Insight Into the Mechanism of SARS-CoV-2 Membrane Fusion.

    abstract::Membrane fusion, a key step in the early stages of virus propagation, allows the release of the viral genome in the host cell cytoplasm. The process is initiated by fusion peptides that are small, hydrophobic components of viral membrane-embedded glycoproteins and are typically conserved within virus families. Here, w...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01231

    authors: Borkotoky S,Dey D,Banerjee M

    更新日期:2021-01-25 00:00:00

  • Two model system of the alpha1A-adrenoceptor docked with selected ligands.

    abstract::In this study, we have developed a two model system to mimic the active and inactive states of a G-protein coupled receptor specifically the alpha1A adrenergic receptor. We have docked two agonists, epinephrine (phenylamine type) and oxymetazoline (imidazoline type), as well as two antagonists, prazosin and 5-methylur...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700026v

    authors: Asher WB,Hoskins SN,Slasor LA,Morris DH,Cook EM,Bautista DL

    更新日期:2007-09-01 00:00:00

  • Exploration of Interfacial Hydration Networks of Target-Ligand Complexes.

    abstract::Interfacial hydration strongly influences interactions between biomolecules. For example, drug-target complexes are often stabilized by hydration networks formed between hydrophilic residues and water molecules at the interface. Exhaustive exploration of hydration networks is challenging for experimental as well as th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00638

    authors: Jeszenői N,Bálint M,Horváth I,van der Spoel D,Hetényi C

    更新日期:2016-01-25 00:00:00

  • Systematics of high-genus fullerenes.

    abstract::In this article, we present a systematic way to classify a family of high-genus fullerenes (HGFs) by decomposing them into two types of necklike structures, which are the negatively curved parts of parent toroidal carbon nanotubes. By replacing the faces of a uniform polyhedron with these necks, an HGF polyhedron corr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci9001124

    authors: Chuang C,Jin BY

    更新日期:2009-07-01 00:00:00

  • Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors.

    abstract::Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00135

    authors: Berishvili VP,Perkin VO,Voronkov AE,Radchenko EV,Syed R,Venkata Ramana Reddy C,Pillay V,Kumar P,Choonara YE,Kamal A,Palyulin VA

    更新日期:2019-08-26 00:00:00

  • Heteroaromatic π-stacking energy landscapes.

    abstract::In this study we investigate π-stacking interactions of a variety of aromatic heterocycles with benzene using dispersion corrected density functional theory. We calculate extensive potential energy surfaces for parallel-displaced interaction geometries. We find that dispersion contributes significantly to the interact...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500183u

    authors: Huber RG,Margreiter MA,Fuchs JE,von Grafenstein S,Tautermann CS,Liedl KR,Fox T

    更新日期:2014-05-27 00:00:00

  • Discovery of wild-type and Y181C mutant non-nucleoside HIV-1 reverse transcriptase inhibitors using virtual screening with multiple protein structures.

    abstract::To discover non-nucleoside inhibitors of HIV-1 reverse transcriptase (NNRTIs) that are effective against both wild-type (WT) virus and variants that encode the clinically troublesome Tyr181Cys (Y181C) RT mutation, virtual screening by docking was carried out using three RT structures and more than 2 million commercial...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900068k

    authors: Nichols SE,Domaoal RA,Thakur VV,Tirado-Rives J,Anderson KS,Jorgensen WL

    更新日期:2009-05-01 00:00:00

  • Evaluating Unexpectedly Short Non-covalent Distances in X-ray Crystal Structures of Proteins with Electronic Structure Analysis.

    abstract::We investigate unexpectedly short non-covalent distances (<85% of the sum of van der Waals radii) in X-ray crystal structures of proteins. We curate over 11 000 high-quality protein crystal structures and an ultra-high-resolution (1.2 Å or better) subset containing >900 structures. Although our non-covalent distance c...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00144

    authors: Qi HW,Kulik HJ

    更新日期:2019-05-28 00:00:00

  • Equally Weighted Multiscale Elastic Network Model and Its Comparison with Traditional and Parameter-Free Models.

    abstract::Dynamical properties of proteins play an essential role in their function exertion. The elastic network model (ENM) is an effective and efficient tool in characterizing the intrinsic dynamical properties encoded in biomacromolecule structures. The Gaussian network model (GNM) and anisotropic network model (ANM) are th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01178

    authors: Gong W,Liu Y,Zhao Y,Wang S,Han Z,Li C

    更新日期:2021-01-26 00:00:00

  • Structure-based CoMFA as a predictive model - CYP2C9 inhibitors as a test case.

    abstract::In this study, we tried to establish a general scheme to create a model that could predict the affinity of small compounds to their target proteins. This scheme consists of a search for ligand-binding sites on a protein, a generation of bound conformations (poses) of ligands in each of the sites by docking, identifica...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800313h

    authors: Yasuo K,Yamaotsu N,Gouda H,Tsujishita H,Hirono S

    更新日期:2009-04-01 00:00:00

  • BCL::MolAlign: Three-Dimensional Small Molecule Alignment for Pharmacophore Mapping.

    abstract::Small molecule flexible alignment is a critical component of both ligand- and structure-based methods in computer-aided drug discovery. Despite its importance, the availability of high-quality flexible alignment software packages is limited. Here, we present BCL::MolAlign, a freely available property-based molecular a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00020

    authors: Brown BP,Mendenhall J,Meiler J

    更新日期:2019-02-25 00:00:00

  • Tautomer Standardization in Chemical Databases: Deriving Business Rules from Quantum Chemistry.

    abstract::Databases of small, potentially bioactive molecules are ubiquitous across the industry and academia. Designed such that each unique compound should appear only once, the multiplicity of ways in which many compounds can be represented means that these databases require methods for standardizing the representation of ch...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00232

    authors: Baker CM,Kidley NJ,Papachristos K,Hotson M,Carson R,Gravestock D,Pouliot M,Harrison J,Dowling A

    更新日期:2020-08-24 00:00:00

  • Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation.

    abstract::We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4003766

    authors: Kaneko H,Funatsu K

    更新日期:2013-09-23 00:00:00

  • Regulation of JAK2 activation by Janus homology 2: evidence from molecular dynamics simulations.

    abstract::Janus kinase 2 (JAK2) is a protein tyrosine kinase implicated in signaling by specific members of the cytokine receptor family. Although it has been established that the JAK2 tyrosine kinase is negatively regulated by the JAK homology 2 (JH2) pseudokinase domain, the underlying mechanism of JH2 mediated regulation rem...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300308g

    authors: Wan S,Coveney PV

    更新日期:2012-11-26 00:00:00

  • Simulation-Based Algorithm for Two-Dimensional Chemical Structure Diagram Generation of Complex Molecules and Ligand-Protein Interactions.

    abstract::Computer programs for structure diagram generation (SDG) are indispensable cheminformatic tools that translate one- or three-dimensional (1D or 3D) chemical structure data stored in electronic formats to human-readable 2D depictions. Although many such programs are known, only a moderate part of chemical space can be ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00391

    authors: Frączek T

    更新日期:2016-12-27 00:00:00

  • Molecular Oxygen Binding in the Mitochondrial Electron Transfer Flavoprotein.

    abstract::Reactive oxygen species such as superoxide are potentially harmful byproducts of the aerobic metabolism in the inner mitochondrial membrane, and complexes I, II, III of the electron transport chain have been identified as primary sources. The mitochondrial fatty acid b-oxidation pathway may also play a yet uncharacter...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00702

    authors: Husen P,Nielsen C,Martino CF,Solov'yov IA

    更新日期:2019-11-25 00:00:00

  • Supervised self-organizing maps in drug discovery. 2. Improvements in descriptor selection and model validation.

    abstract::The modeling of nonlinear descriptor-target relationships is a topic of considerable interest in drug discovery. We, herein, continue reporting the use of the self-organizing map-a nonlinear, topology-preserving pattern recognition technique that exhibits considerable promise in modeling and decoding these relationshi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0500841

    authors: Xiao YD,Harris R,Bayram E,Ii PS,Schmitt JD

    更新日期:2006-01-01 00:00:00

  • Prediction of pH-dependent aqueous solubility of druglike molecules.

    abstract::In the present work, the Henderson-Hasselbalch (HH) equation has been employed for the development of a tool for the prediction of pH-dependent aqueous solubility of drugs and drug candidates. A new prediction method for the intrinsic solubility was developed, based on artificial neural networks that have been trained...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600292q

    authors: Hansen NT,Kouskoumvekaki I,Jørgensen FS,Brunak S,Jónsdóttir SO

    更新日期:2006-11-01 00:00:00

  • ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes.

    abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002668

    authors: Avram SI,Pacureanu LM,Bora A,Crisan L,Avram S,Kurunczi L

    更新日期:2014-08-25 00:00:00