Abstract:
:On the order of hundreds of absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) models have been described in the literature in the past decade which are more often than not inaccessible to anyone but their authors. Public accessibility is also an issue with computational models for bioactivity, and the ability to share such models still remains a major challenge limiting drug discovery. We describe the creation of a reference implementation of a Bayesian model-building software module, which we have released as an open source component that is now included in the Chemistry Development Kit (CDK) project, as well as implemented in the CDD Vault and in several mobile apps. We use this implementation to build an array of Bayesian models for ADME/Tox, in vitro and in vivo bioactivity, and other physicochemical properties. We show that these models possess cross-validation receiver operator curve values comparable to those generated previously in prior publications using alternative tools. We have now described how the implementation of Bayesian models with FCFP6 descriptors generated in the CDD Vault enables the rapid production of robust machine learning models from public data or the user's own datasets. The current study sets the stage for generating models in proprietary software (such as CDD) and exporting these models in a format that could be run in open source software using CDK components. This work also demonstrates that we can enable biocomputation across distributed private or public datasets to enhance drug discovery.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Clark AM,Dole K,Coulon-Spektor A,McNutt A,Grass G,Freundlich JS,Reynolds RC,Ekins Sdoi
10.1021/acs.jcim.5b00143subject
Has Abstractpub_date
2015-06-22 00:00:00pages
1231-45issue
6eissn
1549-9596issn
1549-960Xjournal_volume
55pub_type
杂志文章abstract::Interfacial hydration strongly influences interactions between biomolecules. For example, drug-target complexes are often stabilized by hydration networks formed between hydrophilic residues and water molecules at the interface. Exhaustive exploration of hydration networks is challenging for experimental as well as th...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00638
更新日期:2016-01-25 00:00:00
abstract::Functional diversity of the three-finger-protein domain (TFPD) had been acquired via hypervariability of some sequence positions and extensive insertion/deletion of short AA-segments that caused multidimensional drift of several sequence attributes such as the overall (HI) and local hydrophobicity levels, the isoelect...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00322
更新日期:2015-09-28 00:00:00
abstract::Molecular docking programs are widely used modeling tools for predicting ligand binding modes and structure based virtual screening. In this study, six molecular docking programs (DOCK, FlexX, GLIDE, ICM, PhDOCK, and Surflex) were evaluated using metrics intended to assess docking pose and virtual screening accuracy. ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900056c
更新日期:2009-06-01 00:00:00
abstract::Protein-ligand (PL) interactions play a key role in many life processes such as molecular recognition, molecular binding, signal transmission, and cell metabolism. Examples of interaction forces include hydrogen bonding, hydrophobic effects, steric clashes, electrostatic contacts, and van der Waals attractions. Curren...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00310
更新日期:2018-01-22 00:00:00
abstract::Human type II topoisomerases, molecular motors that alter the DNA topology, are a major target of modern chemotherapy. Groups of catalytic inhibitors represent a new approach to overcome the known limitations of topoisomerase II poisons such as cardiotoxicity and induction of secondary tumors. Here, we present a class...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00202
更新日期:2020-07-27 00:00:00
abstract::Estrogens exert important physiological effects through the modulation of two human estrogen receptor (hER) subtypes, alpha (hERalpha) and beta (hERbeta). Because the levels and relative proportion of hERalpha and hERbeta differ significantly in different target cells, selective hER ligands could target specific tissu...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci8002182
更新日期:2008-11-01 00:00:00
abstract::We present a succession of structural changes involved in hormone peptide activation of a prototypical GPCR. Microsecond molecular dynamics simulation generated conformational ensembles reveal propagation of structural changes through key "microswitches" within human AT1R bound to native hormone. The endocrine octa-pe...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00583
更新日期:2019-01-28 00:00:00
abstract::The causative agent of African sleeping sickness, Trypanosoma brucei , undergoes an unusual mitochondrial RNA editing process that is essential for its survival. RNA editing terminal uridylyl transferase 2 of T. brucei (TbRET2) is an indispensable component of the editosome machinery that performs this editing. TbR...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci3001327
更新日期:2012-05-25 00:00:00
abstract::Activity cliffs are formed by structurally similar compounds having large potency differences. Coordinated activity cliffs evolve when compounds within groups of structural neighbors form multiple cliffs with different partners, giving rise to local networks of cliffs in a data set. Using particle swarm optimization, ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci3000503
更新日期:2012-04-23 00:00:00
abstract::Membrane fusion, a key step in the early stages of virus propagation, allows the release of the viral genome in the host cell cytoplasm. The process is initiated by fusion peptides that are small, hydrophobic components of viral membrane-embedded glycoproteins and are typically conserved within virus families. Here, w...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01231
更新日期:2021-01-25 00:00:00
abstract::The human DNA-repair O (6)-alkylguanine DNA alkyltransferase (MGMT or hAGT) protein protects DNA from environmental alkylating agents and also plays an important role in tumor resistance to chemotherapy treatment. Available inhibitors, based on pseudosubstrate analogs, have been shown to induce substantial bone marrow...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700447r
更新日期:2008-04-01 00:00:00
abstract::Fast and accurate predicting of the binding affinities of large sets of diverse protein−ligand complexes is an important, yet extremely challenging, task in drug discovery. The development of knowledge-based scoring functions exploiting structural information of known protein−ligand complexes represents a valuable con...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci100343j
更新日期:2011-02-28 00:00:00
abstract::Class II fructose-1,6-bisphosphate aldolases (FBA-II) are attractive new targets for the discovery of drugs to combat invasive fungal infection, because they are absent in animals and higher plants. Although several FBA-II inhibitors have been reported, none of these inhibitors exhibit antifungal effect so far. In thi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00763
更新日期:2017-06-26 00:00:00
abstract::The cannabinoid receptor subtype 2 (CB2) is a promising therapeutic target for blood cancer, pain relief, osteoporosis, and immune system disease. The recent withdrawal of Rimonabant, which targets another closely related cannabinoid receptor (CB1), accentuates the importance of selectivity for the development of CB2 ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci3003914
更新日期:2013-01-28 00:00:00
abstract::The appropriate selection of a chemical space represented by the data set, the selection of its chemical data representation, the development of a correct modeling process using a robust and reproducible algorithm, and the performance of an exhaustive training and external validation determine the usability and reprod...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00492
更新日期:2017-11-27 00:00:00
abstract::With the hope of discovering effective analgesics with fewer side effects, attention has recently shifted to allosteric modulators of the opioid receptors. In the past two years, the first chemotypes of positive or silent allosteric modulators (PAMs or SAMs, respectively) of μ- and δ-opioid receptor types have been re...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00388
更新日期:2015-09-28 00:00:00
abstract::Small molecule flexible alignment is a critical component of both ligand- and structure-based methods in computer-aided drug discovery. Despite its importance, the availability of high-quality flexible alignment software packages is limited. Here, we present BCL::MolAlign, a freely available property-based molecular a...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00020
更新日期:2019-02-25 00:00:00
abstract::A phenotypic screen (PS) is used to identify compounds causing a desired phenotype in a complex biological system where mechanisms and targets are largely unknown. Deconvoluting the mechanism of action of actives and identification of relevant targets and pathways remains a formidable challenge. Current methods fail t...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400245c
更新日期:2014-02-24 00:00:00
abstract::A machine learning enhanced spectrum recognition system called spectrum recognition based on computer vision (SRCV) for data extraction from previously analyzed 13C and 1H NMR spectra has been developed. The intelligent system was designed with four function modules to extract data from three areas of NMR images, incl...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01046
更新日期:2021-01-25 00:00:00
abstract::The d-Ala:d-Lac ligase, VanA, plays a critical role in the resistance of vancomycin. Indeed, it is involved in the synthesis of a peptidoglycan precursor, to which vancomycin cannot bind. The reaction catalyzed by VanA requires the opening of the so-called "ω-loop", so that the substrates can enter the active site. He...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00211
更新日期:2016-09-26 00:00:00
abstract::Understanding the relationship between chemical structure and function is a ubiquitous problem within the fields of chemistry and biology. Simulation approaches attack the problem utilizing physics to understand a given process at the particle level. Unfortunately, these approaches are often too expensive for many pro...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci8001233
更新日期:2008-08-01 00:00:00
abstract::The configuring of a radial basis function network (RBFN) consists of selecting the network parameters (centers and widths in RBF units and weights between the hidden and output layers) and network architecture. The issues of suboptimum and overfitting, however, often occur in RBFN configuring. This paper presented a ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600218d
更新日期:2006-11-01 00:00:00
abstract::Human telomeric DNA G-quadruplex has been identified as a good therapeutic target in cancer treatment. G-quadruplex-specific ligands that stabilize the G-quadruplex have great potential to be developed as anticancer agents. Two crystal structures (an apo form of parallel stranded human telomeric G-quadruplex and its h...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00287
更新日期:2017-11-27 00:00:00
abstract::Throughout the drug discovery process, discovery teams are compelled to use statistics for making decisions using data from a variety of inputs. For instance, teams are asked to prioritize compounds for subsequent stages of the drug discovery process, given results from multiple screens. To assist in the prioritizatio...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600556v
更新日期:2007-05-01 00:00:00
abstract::The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), r(m)(2), and the Golbraikh-Tropsha method. Recently, the concordance correlati...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300084j
更新日期:2012-08-27 00:00:00
abstract::Double-stranded DNA (dsDNA) has been established as an efficient medium for charge migration, bringing it to the forefront of the field of molecular electronics and biological research. The charge migration rate is controlled by the electronic couplings between the two nucleobases of DNA/RNA. These electronic coupling...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01072
更新日期:2021-01-25 00:00:00
abstract::Targeted covalent inhibitors (TCIs) have been successfully developed as high-affinity and selective inhibitors of enzymes of the protein kinase family. These drugs typically act by undergoing an electrophilic addition with an active-site cysteine residue, so design of a TCI begins with the identification of a "druggab...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00454
更新日期:2018-09-24 00:00:00
abstract::We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regre...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4003766
更新日期:2013-09-23 00:00:00
abstract::The partitioning of solute molecules between immiscible solvents with significantly different polarities is of great importance. The polarization between the solute and solvent molecules plays an essential role in determining the solubility of the solute, which makes computational studies utilizing molecular mechanics...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00001
更新日期:2017-10-23 00:00:00
abstract::One tactic for cysteine protease inhibition is to form a covalent bond between an electrophilic atom of the inhibitor and the thiol of the catalytic cysteine. In this study, we evaluate the reaction free energy obtained from a hybrid quantum mechanical/molecular mechanical (QM/MM) free energy profile as a predictor of...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00847
更新日期:2020-02-24 00:00:00