Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets.

Abstract:

:On the order of hundreds of absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) models have been described in the literature in the past decade which are more often than not inaccessible to anyone but their authors. Public accessibility is also an issue with computational models for bioactivity, and the ability to share such models still remains a major challenge limiting drug discovery. We describe the creation of a reference implementation of a Bayesian model-building software module, which we have released as an open source component that is now included in the Chemistry Development Kit (CDK) project, as well as implemented in the CDD Vault and in several mobile apps. We use this implementation to build an array of Bayesian models for ADME/Tox, in vitro and in vivo bioactivity, and other physicochemical properties. We show that these models possess cross-validation receiver operator curve values comparable to those generated previously in prior publications using alternative tools. We have now described how the implementation of Bayesian models with FCFP6 descriptors generated in the CDD Vault enables the rapid production of robust machine learning models from public data or the user's own datasets. The current study sets the stage for generating models in proprietary software (such as CDD) and exporting these models in a format that could be run in open source software using CDK components. This work also demonstrates that we can enable biocomputation across distributed private or public datasets to enhance drug discovery.

journal_name

J Chem Inf Model

authors

Clark AM,Dole K,Coulon-Spektor A,McNutt A,Grass G,Freundlich JS,Reynolds RC,Ekins S

doi

10.1021/acs.jcim.5b00143

subject

Has Abstract

pub_date

2015-06-22 00:00:00

pages

1231-45

issue

6

eissn

1549-9596

issn

1549-960X

journal_volume

55

pub_type

杂志文章
  • Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs.

    abstract::Proteins often have both orthosteric and allosteric binding sites. Endogenous ligands, such as hormones and neurotransmitters, bind to the orthosteric site, while synthetic ligands may bind to orthosteric or allosteric sites, which has become a focal point in drug discovery. Usually, such allosteric modulators bind to...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00695

    authors: Burggraaff L,van Veen A,Lam CC,van Vlijmen HWT,IJzerman AP,van Westen GJP

    更新日期:2020-10-26 00:00:00

  • Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation.

    abstract::We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4003766

    authors: Kaneko H,Funatsu K

    更新日期:2013-09-23 00:00:00

  • How do metabolites differ from their parent molecules and how are they excreted?

    abstract::Understanding which physicochemical properties, or property distributions, are favorable for successful design and development of drugs, nutritional supplements, cosmetics, and agrochemicals is of great importance. In this study we have analyzed molecules from three distinct chemical spaces (i) approved drugs, (ii) hu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300487z

    authors: Kirchmair J,Howlett A,Peironcely JE,Murrell DS,Williamson MJ,Adams SE,Hankemeier T,van Buren L,Duchateau G,Klaffke W,Glen RC

    更新日期:2013-02-25 00:00:00

  • Scores of extended connectivity fingerprint as descriptors in QSPR study of melting point and aqueous solubility.

    abstract::QSPR studies, using scores of SciTegic's extended connectivity fingerprint as raw descriptors, were extended to the prediction of melting points and aqueous solubility of organic compounds. Robust partial least-squares models were developed that perform as well as the best published QSPR models for structurally divers...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800024c

    authors: Zhou D,Alelyunas Y,Liu R

    更新日期:2008-05-01 00:00:00

  • Delineation of agonist binding to the human histamine H4 receptor using mutational analysis, homology modeling, and ab initio calculations.

    abstract::A three-dimensional homology model of the human histamine H 4 receptor was developed to investigate the binding mode of a series of structurally diverse H 4-agonists, i.e. histamine, clozapine, and the recently described selective, nonimidazole agonist VUF 8430. Mutagenesis studies and docking of these ligands in a rh...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700474a

    authors: Jongejan A,Lim HD,Smits RA,de Esch IJ,Haaksma E,Leurs R

    更新日期:2008-07-01 00:00:00

  • CoMFA, CoMSIA, and molecular hologram QSAR studies of novel neuronal nAChRs ligands-open ring analogues of 3-pyridyl ether.

    abstract::3-Pyridyl ethers are excellent nAChRs ligands, which show high subtype selectivity and binding affinity to alpha4beta2 nAChR. Although the quantitative structure-activity relationship (QSAR) of nAChRs ligands has been widely investigated using various classes of compounds, the open ring analogues of 3-pyridyl ethers h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0498113

    authors: Zhang H,Li H,Liu C

    更新日期:2005-03-01 00:00:00

  • Molecular modeling of DNA cross-linking analogues based on the azinomycin scaffold.

    abstract::In this work, we present molecular modeling studies carried out using six DNA sequences and six azinomycin analogues, including the naturally occurring compound azinomycin B, selected on the basis of known cell cytotoxicity and structural analogies (epoxide and aziridine alkylating moieties). Among several computation...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0496595

    authors: Alcaro S,Ortuso F,Coleman RS

    更新日期:2005-05-01 00:00:00

  • Whole-molecule calculation of log p based on molar volume, hydrogen bonds, and simulated 13C NMR spectra.

    abstract::The prediction of Log P is usually accomplished using either substructure or whole-molecule approaches. However, these methods are complicated, and previous whole-molecule approaches have not been successful for the prediction of Log P in very complex molecules. The observed chemical shifts in nuclear magnetic resonan...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci049643e

    authors: Schnackenberg LK,Beger RD

    更新日期:2005-03-01 00:00:00

  • "Social" network of isomers based on bond count distance: algorithms.

    abstract::This paper introduces the concept of an isomer network based on the reaction step counts between pairs of isomers as an alternative means to view and analyze isomer space. The computation of isomer networks is computationally expensive with respect to both run time and memory. Accordingly, this paper focuses on the de...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4005173

    authors: Kouri TM,Awale M,Slyby JK,Reymond JL,Mehta DP

    更新日期:2014-01-27 00:00:00

  • Heuristics from Modeling of Spectral Overlap in Förster Resonance Energy Transfer (FRET).

    abstract::Among the photophysical parameters that underpin Förster resonance energy transfer (FRET), perhaps the least explored is the spectral overlap term ( J). While by definition J increases linearly with acceptor molar absorption coefficient (ε(A) in M-1 cm-1), is proportional to wavelength (λ4), and depends on the degree ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00753

    authors: Qi Q,Taniguchi M,Lindsey JS

    更新日期:2019-02-25 00:00:00

  • Virtual drug screen schema based on multiview similarity integration and ranking aggregation.

    abstract::The current drug virtual screen (VS) methods mainly include two categories. i.e., ligand/target structure-based virtual screen and that, utilizing protein-ligand interaction fingerprint information based on the large number of complex structures. Since the former one focuses on the one-side information while the later...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200481c

    authors: Kang H,Sheng Z,Zhu R,Huang Q,Liu Q,Cao Z

    更新日期:2012-03-26 00:00:00

  • GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors.

    abstract::The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00148

    authors: Won J,Lee GR,Park H,Seok C

    更新日期:2018-06-25 00:00:00

  • 3D-QSAR and docking studies of selective GSK-3beta inhibitors. Comparison with a thieno[2,3-b]pyrrolizinone derivative, a new potential lead for GSK-3beta ligands.

    abstract::The three-dimensional structures of 3-anilino-4-arylmaleimides, selective GSK-3beta inhibitors, were correlated to their biological affinities by 3D-QSAR studies (CoMFA method). The cocrystallographic data of GSK-3beta vs 3-anilino-4-arylmaleimide allowed us to compare 3D-QSAR results to experimental intermolecular in...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050008y

    authors: Lescot E,Bureau R,Sopkova-de Oliveira Santos J,Rochais C,Lisowski V,Lancelot JC,Rault S

    更新日期:2005-05-01 00:00:00

  • Computational simulations of the interactions between acetyl-coenzyme-A carboxylase and clodinafop: resistance mechanism due to active and nonactive site mutations.

    abstract::Grass weed populations resistant to acetyl-CoA carboxylase-inhibiting (ACCase; EC 6.4.1.2) herbicides represent a major problem for the sustainable development of modern agriculture. In the present study, extensive computational simulations, including homology modeling, molecular dynamics (MD) simulations, and molecul...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900174d

    authors: Zhu XL,Ge-Fei H,Zhan CG,Yang GF

    更新日期:2009-08-01 00:00:00

  • Systematics of high-genus fullerenes.

    abstract::In this article, we present a systematic way to classify a family of high-genus fullerenes (HGFs) by decomposing them into two types of necklike structures, which are the negatively curved parts of parent toroidal carbon nanotubes. By replacing the faces of a uniform polyhedron with these necks, an HGF polyhedron corr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci9001124

    authors: Chuang C,Jin BY

    更新日期:2009-07-01 00:00:00

  • Computational fragment-based approach at PDB scale by protein local similarity.

    abstract::The large volume of protein-ligand structures now available enables innovative and efficient protocols in computational FBDD (Fragment-Based Drug Design) to be proposed based on experimental data. In this work, we build a database of MED-Portions, where a MED-Portion is a new structural object encoding protein-fragmen...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8003094

    authors: Moriaud F,Doppelt-Azeroual O,Martin L,Oguievetskaia K,Koch K,Vorotyntsev A,Adcock SA,Delfaud F

    更新日期:2009-02-01 00:00:00

  • Discovery and Evaluation of Anti-Fibrinolytic Plasmin Inhibitors Derived from 5-(4-Piperidyl)isoxazol-3-ol (4-PIOL).

    abstract::Inhibition of plasmin has been found to effectively reduce fibrinolysis and to avoid hemorrhage. This can be achieved by addressing its kringle 1 domain with the known drug and lysine analogue tranexamic acid. Guided by shape similarities toward a previously discovered lead compound, 5-(4-piperidyl)isoxazol-3-ol, a se...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00255

    authors: Schmidt TC,Eriksson PO,Gustafsson D,Cosgrove D,Frølund B,Boström J

    更新日期:2017-07-24 00:00:00

  • Ensemble docking into multiple crystallographically derived protein structures: an evaluation based on the statistical analysis of enrichments.

    abstract::Docking into multiple receptor conformations ("ensemble docking") has been proposed, and employed, in the hope that it may account for receptor flexibility in virtual screening and thus provide higher enrichments than docking into single rigid receptor structures. The statistical analyses presented in this paper provi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900407c

    authors: Craig IR,Essex JW,Spiegel K

    更新日期:2010-04-26 00:00:00

  • Toward high throughput 3D virtual screening using spherical harmonic surface representations.

    abstract::Searching chemical databases for possible drug leads is often one of the main activities conducted during the early stages of a drug development project. This article shows that spherical harmonic molecular shape representations provide a powerful way to search and cluster small-molecule databases rapidly and accurate...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7001507

    authors: Mavridis L,Hudson BD,Ritchie DW

    更新日期:2007-09-01 00:00:00

  • Direct Observation of β-Barrel Intermediates in the Self-Assembly of Toxic SOD128-38 and Absence in Nontoxic Glycine Mutants.

    abstract::Soluble low-molecular-weight oligomers formed during the early stage of amyloid aggregation are considered the major toxic species in amyloidosis. The structure-function relationship between oligomeric assemblies and the cytotoxicity in amyloid diseases are still elusive due to the heterogeneous and transient nature o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01319

    authors: Sun Y,Huang J,Duan X,Ding F

    更新日期:2021-01-14 00:00:00

  • Improved Prediction of Drug-Target Interactions Using Self-Paced Learning with Collaborative Matrix Factorization.

    abstract::Identifying drug-target interactions (DTIs) plays an important role in the field of drug discovery, drug side-effects, and drug repositioning. However, in vivo or biochemical experimental methods for identifying new DTIs are extremely expensive and time-consuming. Recently, in silico or various computational methods h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00408

    authors: Xia LY,Yang ZY,Zhang H,Liang Y

    更新日期:2019-07-22 00:00:00

  • Multitarget structure-activity relationships characterized by activity-difference maps and consensus similarity measure.

    abstract::Dual and triple activity-difference (DAD/TAD) maps are tools for the systematic characterization of structure-activity relationships (SAR) of compound data sets screened against two or three targets. DAD and TAD maps are two- and three- dimensional representations of the pairwise activity differences of compound data ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200281v

    authors: Medina-Franco JL,Yongye AB,Pérez-Villanueva J,Houghten RA,Martínez-Mayorga K

    更新日期:2011-09-26 00:00:00

  • Prediction and Experimental Confirmation of Novel Peripheral Cannabinoid-1 Receptor Antagonists.

    abstract::Small molecules targeting peripheral CB1 receptors have therapeutic potential in a variety of disorders including obesity-related, hormonal, and metabolic abnormalities, while avoiding the psychoactive effects in the central nervous system. We applied our in-house algorithm, iterative stochastic elimination, to produc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00577

    authors: El-Atawneh S,Hirsch S,Hadar R,Tam J,Goldblum A

    更新日期:2019-09-23 00:00:00

  • Residue preference mapping of ligand fragments in the Protein Data Bank.

    abstract::The interaction between small molecules and proteins is one of the major concerns for structure-based drug design because the principles of protein-ligand interactions and molecular recognition are not thoroughly understood. Fortunately, the analysis of protein-ligand complexes in the Protein Data Bank (PDB) enables u...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100386y

    authors: Wang L,Xie Z,Wipf P,Xie XQ

    更新日期:2011-04-25 00:00:00

  • Modeling compound-target interaction network of traditional Chinese medicines for type II diabetes mellitus: insight for polypharmacology and drug design.

    abstract::In this study, in order to elucidate the action mechanism of traditional Chinese medicines (TCMs) that exhibit clinical efficacy for type II diabetes mellitus (T2DM), an integrated protocol that combines molecular docking and pharmacophore mapping was employed to find the potential inhibitors from TCM for the T2DM-rel...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400146u

    authors: Tian S,Li Y,Li D,Xu X,Wang J,Zhang Q,Hou T

    更新日期:2013-07-22 00:00:00

  • Pathway analysis for drug repositioning based on public database mining.

    abstract::Sixteen FDA-approved drugs were investigated to elucidate their mechanisms of action (MOAs) and clinical functions by pathway analysis based on retrieved drug targets interacting with or affected by the investigated drugs. Protein and gene targets and associated pathways were obtained by data-mining of public database...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4005354

    authors: Pan Y,Cheng T,Wang Y,Bryant SH

    更新日期:2014-02-24 00:00:00

  • Probabilistic models for capturing more physicochemical properties on protein-protein interface.

    abstract::Protein-protein interactions play a key role in a multitude of biological processes, such as signal transduction, de novo drug design, immune responses, and enzymatic activities. It is of great interest to understand how proteins interact with each other. The general approach is to explore all possible poses and ident...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002372

    authors: Guo F,Li SC,Du P,Wang L

    更新日期:2014-06-23 00:00:00

  • Combination of Ambiguous and Unambiguous Data in the Restraint-driven Docking of Flexible Peptides with HADDOCK: The Binding of the Spider Toxin PcTx1 to the Acid Sensing Ion Channel (ASIC) 1a.

    abstract::Peptides that bind to ion channels have attracted much interest as potential lead molecules for the development of new drugs and insecticides. However, the structure determination of large peptide-channel complexes using experimental methods is challenging. Thus structural models are often derived from combining exper...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00529

    authors: Deplazes E,Davies J,Bonvin AM,King GF,Mark AE

    更新日期:2016-01-25 00:00:00

  • PIIMS Server: A Web Server for Mutation Hotspot Scanning at the Protein-Protein Interface.

    abstract::Protein-protein interactions (PPIs) play vital roles in regulating biological processes, such as cellular and signaling pathways. Hotspots are certain residues located at protein-protein interfaces that contribute more in protein-protein binding than other residues. Research on the mutational effects of hotspots is im...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00966

    authors: Wu FX,Yang JF,Mei LC,Wang F,Hao GF,Yang GF

    更新日期:2021-01-25 00:00:00

  • Solvent accessible surface area-based hot-spot detection methods for protein-protein and protein-nucleic acid interfaces.

    abstract::Due to the importance of hot-spots (HS) detection and the efficiency of computational methodologies, several HS detecting approaches have been developed. The current paper presents new models to predict HS for protein-protein and protein-nucleic acid interactions with better statistics compared with the ones currently...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500760m

    authors: Munteanu CR,Pimenta AC,Fernandez-Lozano C,Melo A,Cordeiro MN,Moreira IS

    更新日期:2015-05-26 00:00:00