Abstract:
:Enrichment of ligands versus property-matched decoys is widely used to test and optimize docking library screens. However, the unconstrained optimization of enrichment alone can mislead, leading to false confidence in prospective performance. This can arise by over-optimizing for enrichment against property-matched decoys, without considering the full spectrum of molecules to be found in a true large library screen. Adding decoys representing charge extrema helps mitigate over-optimizing for electrostatic interactions. Adding decoys that represent the overall characteristics of the library to be docked allows one to sample molecules not represented by ligands and property-matched decoys but that one will encounter in a prospective screen. An optimized version of the DUD-E set (DUDE-Z), as well as Extrema and sets representing broad features of the library (Goldilocks), is developed here. We also explore the variability that one can encounter in enrichment calculations and how that can temper one's confidence in small enrichment differences. The new tools and new decoy sets are freely available at http://tldr.docking.org and http://dudez.docking.org.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Stein RM,Yang Y,Balius TE,O'Meara MJ,Lyu J,Young J,Tang K,Shoichet BK,Irwin JJdoi
10.1021/acs.jcim.0c00598subject
Has Abstractpub_date
2021-01-25 00:00:00eissn
1549-9596issn
1549-960Xpub_type
杂志文章abstract::The high-throughput docking protocol called ALTA-VS (anchor-based library tailoring approach for virtual screening) was developed in 2005 for the efficient in silico screening of large libraries of compounds by preselection of only those molecules that have optimal fragments (anchors) for the protein target. Here we p...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00336
更新日期:2017-10-23 00:00:00
abstract::Understanding which physicochemical properties, or property distributions, are favorable for successful design and development of drugs, nutritional supplements, cosmetics, and agrochemicals is of great importance. In this study we have analyzed molecules from three distinct chemical spaces (i) approved drugs, (ii) hu...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300487z
更新日期:2013-02-25 00:00:00
abstract::Advances in computer-aided translation technology have made tremendous progress in accuracy in the past few years. Chemical Abstracts Service of the American Chemical Society summarizes scientific works from more than 50 languages and allows the users to search papers in nine selected languages. Currently, only the ab...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00274
更新日期:2020-07-27 00:00:00
abstract::Binding hot spots are regions of proteins that, due to their potentially high contribution to the binding free energy, have high propensity to bind small molecules. We present benchmark sets for testing computational methods for the identification of binding hot spots with emphasis on fragment-based ligand discovery. ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00877
更新日期:2020-12-28 00:00:00
abstract::Three-dimensional protein structures are a key requisite for structure-based drug discovery. For many highly relevant targets, medicinal chemists are confronted with large numbers of target structures in their apo-forms or in complex with a wealth of different ligands. To exploit the full potential of such structure e...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00051
更新日期:2020-04-27 00:00:00
abstract::Grass weed populations resistant to acetyl-CoA carboxylase-inhibiting (ACCase; EC 6.4.1.2) herbicides represent a major problem for the sustainable development of modern agriculture. In the present study, extensive computational simulations, including homology modeling, molecular dynamics (MD) simulations, and molecul...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900174d
更新日期:2009-08-01 00:00:00
abstract::Identifying sequence modifications that distinguish psychrophilic from mesophilic proteins is important for designing enzymes with different thermodynamic stabilities and to understand the underlying mechanisms. The PoPMuSiC algorithm is used to introduce, in silico, all the single-site mutations in four mesophilic an...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050473v
更新日期:2006-05-01 00:00:00
abstract::A new variable selection wrapper method named the Monte Carlo variable selection (MCVS) method was developed utilizing the framework of the Monte Carlo cross-validation (MCCV) approach. The MCVS method reports the variable selection results in the most conventional and common measure of statistical hypothesis testing,...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700283s
更新日期:2008-02-01 00:00:00
abstract::Characterizing the variability within an ensemble of protein structures is a common requirement in structural biology and bioinformatics. With the increasing number of protein structures becoming available, there is a need for new tools capable of automating the structural comparison of large ensemble of structures. W...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400233j
更新日期:2013-09-23 00:00:00
abstract::Serotonin 5-HT6 receptor antagonists are thought to play an important role in the treatment of psychiatry, Alzheimer's disease, and probably obesity. To find novel and potent 5-HT6 antagonists and to provide a new idea for drug design, we used a ligand-based pharmacophore to perform the virtual screening of a commerci...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700160t
更新日期:2008-01-01 00:00:00
abstract::A new methodology to describe the interactions in "receptor-ligand" complexes is presented. The methodology is based on a combination of the 3D/4D QSAR BiS/MC and CoCon algorithms. The first algorithm performs the restricted docking of compounds to receptor pockets. The second determines the relationships between the ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800405n
更新日期:2009-06-01 00:00:00
abstract::Membrane fusion, a key step in the early stages of virus propagation, allows the release of the viral genome in the host cell cytoplasm. The process is initiated by fusion peptides that are small, hydrophobic components of viral membrane-embedded glycoproteins and are typically conserved within virus families. Here, w...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01231
更新日期:2021-01-25 00:00:00
abstract::In this study, we have developed a two model system to mimic the active and inactive states of a G-protein coupled receptor specifically the alpha1A adrenergic receptor. We have docked two agonists, epinephrine (phenylamine type) and oxymetazoline (imidazoline type), as well as two antagonists, prazosin and 5-methylur...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700026v
更新日期:2007-09-01 00:00:00
abstract::Interfacial hydration strongly influences interactions between biomolecules. For example, drug-target complexes are often stabilized by hydration networks formed between hydrophilic residues and water molecules at the interface. Exhaustive exploration of hydration networks is challenging for experimental as well as th...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00638
更新日期:2016-01-25 00:00:00
abstract::In this article, we present a systematic way to classify a family of high-genus fullerenes (HGFs) by decomposing them into two types of necklike structures, which are the negatively curved parts of parent toroidal carbon nanotubes. By replacing the faces of a uniform polyhedron with these necks, an HGF polyhedron corr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci9001124
更新日期:2009-07-01 00:00:00
abstract::Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00135
更新日期:2019-08-26 00:00:00
abstract::In this study we investigate π-stacking interactions of a variety of aromatic heterocycles with benzene using dispersion corrected density functional theory. We calculate extensive potential energy surfaces for parallel-displaced interaction geometries. We find that dispersion contributes significantly to the interact...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500183u
更新日期:2014-05-27 00:00:00
abstract::To discover non-nucleoside inhibitors of HIV-1 reverse transcriptase (NNRTIs) that are effective against both wild-type (WT) virus and variants that encode the clinically troublesome Tyr181Cys (Y181C) RT mutation, virtual screening by docking was carried out using three RT structures and more than 2 million commercial...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900068k
更新日期:2009-05-01 00:00:00
abstract::We investigate unexpectedly short non-covalent distances (<85% of the sum of van der Waals radii) in X-ray crystal structures of proteins. We curate over 11 000 high-quality protein crystal structures and an ultra-high-resolution (1.2 Å or better) subset containing >900 structures. Although our non-covalent distance c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00144
更新日期:2019-05-28 00:00:00
abstract::Dynamical properties of proteins play an essential role in their function exertion. The elastic network model (ENM) is an effective and efficient tool in characterizing the intrinsic dynamical properties encoded in biomacromolecule structures. The Gaussian network model (GNM) and anisotropic network model (ANM) are th...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01178
更新日期:2021-01-26 00:00:00
abstract::In this study, we tried to establish a general scheme to create a model that could predict the affinity of small compounds to their target proteins. This scheme consists of a search for ligand-binding sites on a protein, a generation of bound conformations (poses) of ligands in each of the sites by docking, identifica...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800313h
更新日期:2009-04-01 00:00:00
abstract::Small molecule flexible alignment is a critical component of both ligand- and structure-based methods in computer-aided drug discovery. Despite its importance, the availability of high-quality flexible alignment software packages is limited. Here, we present BCL::MolAlign, a freely available property-based molecular a...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00020
更新日期:2019-02-25 00:00:00
abstract::Databases of small, potentially bioactive molecules are ubiquitous across the industry and academia. Designed such that each unique compound should appear only once, the multiplicity of ways in which many compounds can be represented means that these databases require methods for standardizing the representation of ch...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00232
更新日期:2020-08-24 00:00:00
abstract::We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regre...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4003766
更新日期:2013-09-23 00:00:00
abstract::Janus kinase 2 (JAK2) is a protein tyrosine kinase implicated in signaling by specific members of the cytokine receptor family. Although it has been established that the JAK2 tyrosine kinase is negatively regulated by the JAK homology 2 (JH2) pseudokinase domain, the underlying mechanism of JH2 mediated regulation rem...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300308g
更新日期:2012-11-26 00:00:00
abstract::Computer programs for structure diagram generation (SDG) are indispensable cheminformatic tools that translate one- or three-dimensional (1D or 3D) chemical structure data stored in electronic formats to human-readable 2D depictions. Although many such programs are known, only a moderate part of chemical space can be ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00391
更新日期:2016-12-27 00:00:00
abstract::Reactive oxygen species such as superoxide are potentially harmful byproducts of the aerobic metabolism in the inner mitochondrial membrane, and complexes I, II, III of the electron transport chain have been identified as primary sources. The mitochondrial fatty acid b-oxidation pathway may also play a yet uncharacter...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00702
更新日期:2019-11-25 00:00:00
abstract::The modeling of nonlinear descriptor-target relationships is a topic of considerable interest in drug discovery. We, herein, continue reporting the use of the self-organizing map-a nonlinear, topology-preserving pattern recognition technique that exhibits considerable promise in modeling and decoding these relationshi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0500841
更新日期:2006-01-01 00:00:00
abstract::In the present work, the Henderson-Hasselbalch (HH) equation has been employed for the development of a tool for the prediction of pH-dependent aqueous solubility of drugs and drug candidates. A new prediction method for the intrinsic solubility was developed, based on artificial neural networks that have been trained...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600292q
更新日期:2006-11-01 00:00:00
abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci5002668
更新日期:2014-08-25 00:00:00