Abstract:
:This work is aimed at describing the workflow for a methodology that combines chemoinformatics and pharmacoepidemiology methods and at reporting the first predictive model developed with this methodology. The new model is able to predict complex networks of AIDS prevalence in the US counties, taking into consideration the social determinants and activity/structure of anti-HIV drugs in preclinical assays. We trained different Artificial Neural Networks (ANNs) using as input information indices of social networks and molecular graphs. We used a Shannon information index based on the Gini coefficient to quantify the effect of income inequality in the social network. We obtained the data on AIDS prevalence and the Gini coefficient from the AIDSVu database of Emory University. We also used the Balaban information indices to quantify changes in the chemical structure of anti-HIV drugs. We obtained the data on anti-HIV drug activity and structure (SMILE codes) from the ChEMBL database. Last, we used Box-Jenkins moving average operators to quantify information about the deviations of drugs with respect to data subsets of reference (targets, organisms, experimental parameters, protocols). The best model found was a Linear Neural Network (LNN) with values of Accuracy, Specificity, and Sensitivity above 0.76 and AUROC > 0.80 in training and external validation series. This model generates a complex network of AIDS prevalence in the US at county level with respect to the preclinical activity of anti-HIV drugs in preclinical assays. To train/validate the model and predict the complex network we needed to analyze 43,249 data points including values of AIDS prevalence in 2,310 counties in the US vs ChEMBL results for 21,582 unique drugs, 9 viral or human protein targets, 4,856 protocols, and 10 possible experimental measures.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
González-Díaz H,Herrera-Ibatá DM,Duardo-Sánchez A,Munteanu CR,Orbegozo-Medina RA,Pazos Adoi
10.1021/ci400716ysubject
Has Abstractpub_date
2014-03-24 00:00:00pages
744-55issue
3eissn
1549-9596issn
1549-960Xjournal_volume
54pub_type
杂志文章abstract::Due to the importance of hot-spots (HS) detection and the efficiency of computational methodologies, several HS detecting approaches have been developed. The current paper presents new models to predict HS for protein-protein and protein-nucleic acid interactions with better statistics compared with the ones currently...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500760m
更新日期:2015-05-26 00:00:00
abstract::Most physiological effects of thyroid hormones are mediated by the two thyroid hormone receptor subtypes, TRalpha and TRbeta. Several pharmacological effects mediated by TRbeta might be beneficial in important medical conditions such as obesity, hypercholesterolemia and diabetes, and selective TRbeta activation may el...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900316e
更新日期:2009-11-01 00:00:00
abstract::A new structure classification scheme for biopolymers is introduced, which is solely based on main-chain dihedral angles. It is shown that by dividing a biopolymer into segments containing two central residues, a local classification can be performed. The method is referred to as DISICL, short for Dihedral-based Segme...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400541d
更新日期:2014-01-27 00:00:00
abstract::An index of the activation of Class A G-protein-coupled receptors (GPCRs) has been trained using interhelix distances from a series of microsecond molecular-dynamics simulations and tested for 268 published X-ray structures. In a three-class model that includes intermediate structures, 63% of the active structures are...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00604
更新日期:2019-09-23 00:00:00
abstract::Structure-based virtual screening is highly used in the early stages of drug discovery to identify new putative lead compounds for a given target. However, when a small molecule elicits a biological effect, but its target is unknown, or the side effects it causes arise from its undesired interaction with unknown count...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00428
更新日期:2019-11-25 00:00:00
abstract::Saturated acyclic alkanes show steric strain if they are highly branched and, in extreme cases, fall apart rapidly at room temperature. Consequently, attempts to count the number of isomeric forms for a given molecular formula that neglect this physical consideration will inevitably overestimate the size of the availa...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700246b
更新日期:2007-11-01 00:00:00
abstract::Advances in computer-aided translation technology have made tremendous progress in accuracy in the past few years. Chemical Abstracts Service of the American Chemical Society summarizes scientific works from more than 50 languages and allows the users to search papers in nine selected languages. Currently, only the ab...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00274
更新日期:2020-07-27 00:00:00
abstract::Herein we investigate whether QM/MM could prove useful as a tool to study the often subtle binding phenomena found within pharmaceutical drug discovery programs. The goal of this investigation is to determine whether it is possible to employ high level QM/MM calculations to answer specific questions around a binding e...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800419j
更新日期:2009-03-01 00:00:00
abstract::Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00249
更新日期:2017-08-28 00:00:00
abstract::Determination of structural similarities between protein binding pockets is an important challenge in in silico drug design. It can help to understand selectivity considerations, predict unexpected ligand cross-reactivity, and support the putative annotation of function to orphan proteins. To this end, Cavbase was dev...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci5005898
更新日期:2015-01-26 00:00:00
abstract::Binding affinity prediction with implicit solvent models remains a challenge in virtual screening for drug discovery. In order to assess the predictive power of implicit solvent models in docking techniques with Amber scoring, three generalized Born models (GBHCT, GBOBCI, and GBOBCII) available in Dock 6.7 were utiliz...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00418
更新日期:2016-10-24 00:00:00
abstract::A three-dimensional homology model of the human histamine H 4 receptor was developed to investigate the binding mode of a series of structurally diverse H 4-agonists, i.e. histamine, clozapine, and the recently described selective, nonimidazole agonist VUF 8430. Mutagenesis studies and docking of these ligands in a rh...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700474a
更新日期:2008-07-01 00:00:00
abstract::In this study, we aimed to develop a new ligand-based virtual screening approach using an effective shape-overlapping procedure and a more robust scoring function (denoted by the HWZ score for convenience). The HWZ score-based virtual screening approach was tested against the compounds for 40 protein targets available...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200617d
更新日期:2012-04-23 00:00:00
abstract::Maltose-binding protein is a periplasmic binding protein responsible for transport of maltooligosaccarides through the periplasmic space of Gram-negative bacteria, as a part of the ABC transport system. The molecular mechanisms of the initial ligand binding and induced large scale motion of the protein's domains still...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500520q
更新日期:2015-02-23 00:00:00
abstract::How well do different classification methods perform in selecting the ligands of a protein target out of large compound collections not used to train the model? Support vector machines, random forest, artificial neural networks, k-nearest-neighbor classification with genetic-algorithm-optimized feature selection, tren...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050519k
更新日期:2006-05-01 00:00:00
abstract::G protein-coupled receptors (GPCRs) are the largest family of cell surface receptors, which is arguably the most important family of drug target. With the technology breakthroughs in X-ray crystallography and cryo-electron microscopy, more than 300 GPCR-ligand complex structures have been publicly reported since 2007,...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00699
更新日期:2020-09-28 00:00:00
abstract::Virtual screening is a powerful methodology to search for new small molecule inhibitors against a desired molecular target. Usually, it involves evaluating thousands of compounds (derived from large databases) in order to select a set of potential binders that will be tested in the wet-lab. The number of tested compou...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00241
更新日期:2017-08-28 00:00:00
abstract::Acetohydroxyacid synthase (AHAS) is a thiamin diphosphate-dependent enzyme involved in the biosynthesis of valine, leucine, isoleucine, and lysine. Experimental evidence has shown that mutation of the Gln202 residue results in a decrease in the enzymatic activity, thus suggesting the main role of the carboligation cat...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00863
更新日期:2020-02-24 00:00:00
abstract::Following the theoretical model by Hann et al. moderately complex structures are preferable lead compounds since they lead to specific binding events involving the complete ligand molecule. To make this concept usable in practice for library design, we studied several complexity measures on the biological activity of ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0503558
更新日期:2006-03-01 00:00:00
abstract::The efficiency of automated compound screening is heavily influenced by the design and the quality of the screening libraries used. We recently reported on the assembly of one diverse and one target-focused lead-like screening library. Using data from 15 enzyme-based screenings conducted using these libraries, their p...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300382f
更新日期:2013-03-25 00:00:00
abstract::Given the need for modern researchers to produce open, reproducible scientific output, the lack of standards and best practices for sharing data and workflows used to produce and analyze molecular dynamics (MD) simulations has become an important issue in the field. There are now multiple well-established packages to ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00665
更新日期:2019-10-28 00:00:00
abstract::Up to now, publicly available data sets to build and evaluate Ames mutagenicity prediction tools have been very limited in terms of size and chemical space covered. In this report we describe a new unique public Ames mutagenicity data set comprising about 6500 nonconfidential compounds (available as SMILES strings and...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900161g
更新日期:2009-09-01 00:00:00
abstract::The partitioning of solute molecules between immiscible solvents with significantly different polarities is of great importance. The polarization between the solute and solvent molecules plays an essential role in determining the solubility of the solute, which makes computational studies utilizing molecular mechanics...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00001
更新日期:2017-10-23 00:00:00
abstract::Different forms of synaptic plasticity in the cerebellum expressed at the synapses onto Purkinje cells (PCs) are mediated by membrane metabotropic glutamate receptors (mGluRs). There are three main mGluR groups with a total of 8 subtypes. Although mGluRs are also found at the climbing fiber (CF) to PC synapses, the di...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050161s
更新日期:2005-11-01 00:00:00
abstract::This article presents the computation of both inter- and intramolecular hydrogen bond strengths from first-principles. Quantum chemical calculations conducted at the dispersion-corrected density functional theory level including free energy and solvation contributions are conducted for (i) one-to-one hydrogen-bonded c...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00132
更新日期:2019-09-23 00:00:00
abstract::HackaMol is an open source, object-oriented toolkit written in Modern Perl that organizes atoms within molecules and provides chemically intuitive attributes and methods. The library consists of two components: HackaMol, the core that contains classes for storing and manipulating molecular information, and HackaMol::X...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500359e
更新日期:2015-04-27 00:00:00
abstract::Protein-ligand binding is essential to almost all life processes. The understanding of protein-ligand interactions is fundamentally important to rational drug and protein design. Based on large scale data sets, we show that protein rigidity strengthening or flexibility reduction is a mechanism in protein-ligand bindin...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00226
更新日期:2017-07-24 00:00:00
abstract::The urgent need for new treatments for the chronic lung disease idiopathic pulmonary fibrosis (IPF) motivates research into antagonists of the RGD binding integrin αvβ6, a protein linked to the initiation and progression of the disease. Molecular dynamics (MD) simulations of αvβ6 in complex with its natural ligand, pr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00254
更新日期:2020-11-23 00:00:00
abstract::Reversible covalent inhibitors have drawn increasing attention in drug design, as they are likely more potent than noncovalent inhibitors and less toxic than covalent inhibitors. Despite those advantages, the computational prediction of reversible covalent binding presents a formidable challenge because the binding pr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00959
更新日期:2019-05-28 00:00:00
abstract::The partitioning of amino acids between water and apolar environments is of vital importance in protein function and drug delivery. Here we present an extensive benchmark for octanol/water (log Poct), chloroform/water (log Pclf), and cyclohexane/water (log Pchx) partition coefficients of neutral amino acid side chain ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00493
更新日期:2018-08-27 00:00:00