Scaling predictive modeling in drug development with cloud computing.

Abstract:

:Growing data sets with increased time for analysis is hampering predictive modeling in drug discovery. Model building can be carried out on high-performance computer clusters, but these can be expensive to purchase and maintain. We have evaluated ligand-based modeling on cloud computing resources where computations are parallelized and run on the Amazon Elastic Cloud. We trained models on open data sets of varying sizes for the end points logP and Ames mutagenicity and compare with model building parallelized on a traditional high-performance computing cluster. We show that while high-performance computing results in faster model building, the use of cloud computing resources is feasible for large data sets and scales well within cloud instances. An additional advantage of cloud computing is that the costs of predictive models can be easily quantified, and a choice can be made between speed and economy. The easy access to computational resources with no up-front investments makes cloud computing an attractive alternative for scientists, especially for those without access to a supercomputer, and our study shows that it enables cost-efficient modeling of large data sets on demand within reasonable time.

journal_name

J Chem Inf Model

authors

Moghadam BT,Alvarsson J,Holm M,Eklund M,Carlsson L,Spjuth O

doi

10.1021/ci500580y

subject

Has Abstract

pub_date

2015-01-26 00:00:00

pages

19-25

issue

1

eissn

1549-9596

issn

1549-960X

journal_volume

55

pub_type

杂志文章
  • L-arginine binding to human inducible nitric oxide synthase: an antisymmetric funnel route toward isoform-specific inhibitors?

    abstract::Nitric oxide (NO) is an important signaling molecule produced by a family of enzymes called nitric oxide synthases (NOS). Because NO is involved in various pathological conditions, the development of potent and isoform-selective NOS inhibitors is an important challenge. In the present study, the dimer of oxygenase dom...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100422v

    authors: Floquet N,Hernandez JF,Boucher JL,Martinez J

    更新日期:2011-06-27 00:00:00

  • Molecular modeling of potential anticancer agents from African medicinal plants.

    abstract::Naturally occurring anticancer compounds represent about half of the chemotherapeutic drugs which have been put in the market against cancer until date. Computer-based or in silico virtual screening methods are often used in lead/hit discovery protocols. In this study, the "drug-likeness" of ~400 compounds from Africa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5003697

    authors: Ntie-Kang F,Nwodo JN,Ibezim A,Simoben CV,Karaman B,Ngwa VF,Sippl W,Adikwu MU,Mbaze LM

    更新日期:2014-09-22 00:00:00

  • Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach.

    abstract::Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00249

    authors: Schneider N,Fechner N,Landrum GA,Stiefl N

    更新日期:2017-08-28 00:00:00

  • Machine Learning Enhanced Spectrum Recognition Based on Computer Vision (SRCV) for Intelligent NMR Data Extraction.

    abstract::A machine learning enhanced spectrum recognition system called spectrum recognition based on computer vision (SRCV) for data extraction from previously analyzed 13C and 1H NMR spectra has been developed. The intelligent system was designed with four function modules to extract data from three areas of NMR images, incl...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01046

    authors: Jia W,Yang Z,Yang M,Cheng L,Lei Z,Wang X

    更新日期:2021-01-25 00:00:00

  • Simulation-Based Algorithm for Two-Dimensional Chemical Structure Diagram Generation of Complex Molecules and Ligand-Protein Interactions.

    abstract::Computer programs for structure diagram generation (SDG) are indispensable cheminformatic tools that translate one- or three-dimensional (1D or 3D) chemical structure data stored in electronic formats to human-readable 2D depictions. Although many such programs are known, only a moderate part of chemical space can be ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00391

    authors: Frączek T

    更新日期:2016-12-27 00:00:00

  • Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation.

    abstract::A new variable selection wrapper method named the Monte Carlo variable selection (MCVS) method was developed utilizing the framework of the Monte Carlo cross-validation (MCCV) approach. The MCVS method reports the variable selection results in the most conventional and common measure of statistical hypothesis testing,...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700283s

    authors: Konovalov DA,Sim N,Deconinck E,Vander Heyden Y,Coomans D

    更新日期:2008-02-01 00:00:00

  • Long-range effects of a peripheral mutation on the enzymatic activity of cytochrome P450 1A2.

    abstract::The human cytochrome P450 1A2 is an important drug metabolizing and procarcinogen activating enzyme. An experimental study found that a peripheral mutation, F186L, at ∼26 Å away from the enzyme's active site, caused a significant reduction in the enzymatic activity of 1A2 deethylation reactions. In this paper, we expl...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200112b

    authors: Zhang T,Liu LA,Lewis DF,Wei DQ

    更新日期:2011-06-27 00:00:00

  • ReFlex3D: Refined Flexible Alignment of Molecules Using Shape and Electrostatics.

    abstract::We present an algorithm, ReFlex3D, for the refinement of flexible molecular alignments based on their three-dimensional shape and electrostatic properties. The algorithm is designed to be used with fast conformer generators to refine an initial overlay between two molecules and thus to obtain improved overlaps as judg...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00618

    authors: Schmidt TC,Cosgrove DA,Boström J

    更新日期:2018-04-23 00:00:00

  • Chemoisosterism in the proteome.

    abstract::The concept of chemoisosterism of protein environments is introduced as the complementary property to bioisosterism of chemical fragments. In the same way that two chemical fragments are considered bioisosteric if they can bind to the same protein environment, two protein environments will be considered chemoisosteric...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3002974

    authors: Jalencas X,Mestres J

    更新日期:2013-02-25 00:00:00

  • iUmami-SCM: A Novel Sequence-Based Predictor for Prediction and Analysis of Umami Peptides Using a Scoring Card Method with Propensity Scores of Dipeptides.

    abstract::Umami or the taste of monosodium glutamate represents one of the major attractive taste modalities in humans. Therefore, knowledge about biophysical and biochemical properties of the umami taste is important for both scientific research and the food industry. Experimental approaches for predicting umami peptides are l...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00707

    authors: Charoenkwan P,Yana J,Nantasenamat C,Hasan MM,Shoombuatong W

    更新日期:2020-12-28 00:00:00

  • Prediction of molecular solvation free energy based on the optimization of atomic solvation parameters with genetic algorithm.

    abstract::We propose an improved solvent contact model to estimate the solvation free energy of an organic molecule from individual atomic contributions. The modification of the solvation model involves the optimization of three kinds of parameters in the solvation free energy function: atomic fragmental volume, maximum atomic ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600453b

    authors: Kang H,Choi H,Park H

    更新日期:2007-03-01 00:00:00

  • Probing the Binding Pathway of BRACO19 to a Parallel-Stranded Human Telomeric G-Quadruplex Using Molecular Dynamics Binding Simulation with AMBER DNA OL15 and Ligand GAFF2 Force Fields.

    abstract::Human telomeric DNA G-quadruplex has been identified as a good therapeutic target in cancer treatment. G-quadruplex-specific ligands that stabilize the G-quadruplex have great potential to be developed as anticancer agents. Two crystal structures (an apo form of parallel stranded human telomeric G-quadruplex and its h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00287

    authors: Machireddy B,Kalra G,Jonnalagadda S,Ramanujachary K,Wu C

    更新日期:2017-11-27 00:00:00

  • Combined approach using ligand efficiency, cross-docking, and antitarget hits for wild-type and drug-resistant Y181C HIV-1 reverse transcriptase.

    abstract::New hits against HIV-1 wild-type and Y181C drug-resistant reverse transcriptases were predicted taking into account the possibility of some of the known metabolism interactions. In silico hits against a set of antitargets (i.e., proteins or nucleic acids that are off-targets from the desired pharmaceutical target obje...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200203h

    authors: García-Sosa AT,Sild S,Takkis K,Maran U

    更新日期:2011-10-24 00:00:00

  • Probabilistic models for capturing more physicochemical properties on protein-protein interface.

    abstract::Protein-protein interactions play a key role in a multitude of biological processes, such as signal transduction, de novo drug design, immune responses, and enzymatic activities. It is of great interest to understand how proteins interact with each other. The general approach is to explore all possible poses and ident...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002372

    authors: Guo F,Li SC,Du P,Wang L

    更新日期:2014-06-23 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • Structure-based approach for the study of estrogen receptor binding affinity and subtype selectivity.

    abstract::Estrogens exert important physiological effects through the modulation of two human estrogen receptor (hER) subtypes, alpha (hERalpha) and beta (hERbeta). Because the levels and relative proportion of hERalpha and hERbeta differ significantly in different target cells, selective hER ligands could target specific tissu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci8002182

    authors: Salum LB,Polikarpov I,Andricopulo AD

    更新日期:2008-11-01 00:00:00

  • Identifying promising compounds in drug discovery: genetic algorithms and some new statistical techniques.

    abstract::Throughout the drug discovery process, discovery teams are compelled to use statistics for making decisions using data from a variety of inputs. For instance, teams are asked to prioritize compounds for subsequent stages of the drug discovery process, given results from multiple screens. To assist in the prioritizatio...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600556v

    authors: Mandal A,Johnson K,Wu CF,Bornemeier D

    更新日期:2007-05-01 00:00:00

  • Free energy calculations give insight into the stereoselective hydroxylation of α-ionones by engineered cytochrome P450 BM3 mutants.

    abstract::Previously, stereoselective hydroxylation of α-ionone by Cytochrome P450 BM3 mutants M01 A82W and M11 L437N was observed. While both mutants hydroxylate α-ionone in a regioselective manner at the C3 position, M01 A82W catalyzes formation of trans-3-OH-α-ionone products whereas M11 L437N exhibits opposite stereoselecti...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300243n

    authors: de Beer SB,Venkataraman H,Geerke DP,Oostenbrink C,Vermeulen NP

    更新日期:2012-08-27 00:00:00

  • Pharmer: efficient and exact pharmacophore search.

    abstract::Pharmacophore search is a key component of many drug discovery efforts. Pharmer is a new computational approach to pharmacophore search that scales with the breadth and complexity of the query, not the size of the compound library being screened. Two novel methods for organizing pharmacophore data, the Pharmer KDB-tre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200097m

    authors: Koes DR,Camacho CJ

    更新日期:2011-06-27 00:00:00

  • Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.

    abstract::The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), r(m)(2), and the Golbraikh-Tropsha method. Recently, the concordance correlati...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300084j

    authors: Chirico N,Gramatica P

    更新日期:2012-08-27 00:00:00

  • GPCR-Bench: A Benchmarking Set and Practitioners' Guide for G Protein-Coupled Receptor Docking.

    abstract::Virtual screening is routinely used to discover new ligands and in particular new ligand chemotypes for G protein-coupled receptors (GPCRs). To prepare for a virtual screen, we often tailor a docking protocol that will enable us to select the best candidates for further screening. To aid this, we created GPCR-Bench, a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00660

    authors: Weiss DR,Bortolato A,Tehan B,Mason JS

    更新日期:2016-04-25 00:00:00

  • Influence of Descriptor Implementation on Compound Ranking Based on Multiparameter Assessment.

    abstract::Most of the common molecular descriptors have numerous different implementations. This can influence the results of compound prioritization based on the multiparameter assessment (MPA) approach that allows a medicinal chemist to simultaneously analyze and achieve the desired balance of the diverse and often conflictin...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00734

    authors: Sosnina EA,Osolodkin DI,Radchenko EV,Sosnin S,Palyulin VA

    更新日期:2018-05-29 00:00:00

  • Comparison Study of Polar and Nonpolar Contributions to Solvation Free Energy.

    abstract::In this study, we compared the contributions of polar and nonpolar interactions to the solvation free energy of a solute in solvent, which is decomposed into four different terms based on the nature of interactions: (i) electrostatic solvation free energy term counting for the work done to move solute charges from fix...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00368

    authors: Izairi R,Kamberaj H

    更新日期:2017-10-23 00:00:00

  • ANN multiscale model of anti-HIV drugs activity vs AIDS prevalence in the US at county level based on information indices of molecular graphs and social networks.

    abstract::This work is aimed at describing the workflow for a methodology that combines chemoinformatics and pharmacoepidemiology methods and at reporting the first predictive model developed with this methodology. The new model is able to predict complex networks of AIDS prevalence in the US counties, taking into consideration...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400716y

    authors: González-Díaz H,Herrera-Ibatá DM,Duardo-Sánchez A,Munteanu CR,Orbegozo-Medina RA,Pazos A

    更新日期:2014-03-24 00:00:00

  • Sensitivity of Folding Molecular Dynamics Simulations to Even Minor Force Field Changes.

    abstract::We examine the sensitivity of folding molecular dynamics simulations on the choice between three variants of the same force field (the AMBER99SB force field and its ILDN, NMR-ILDN, and STAR-ILDN variants). Using two different peptide systems (a marginally stable helical peptide and a β-hairpin) and a grand total of mo...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00493

    authors: Serafeim AP,Salamanos G,Patapati KK,Glykos NM

    更新日期:2016-10-24 00:00:00

  • Substituted 4,5'-Bithiazoles as Catalytic Inhibitors of Human DNA Topoisomerase IIα.

    abstract::Human type II topoisomerases, molecular motors that alter the DNA topology, are a major target of modern chemotherapy. Groups of catalytic inhibitors represent a new approach to overcome the known limitations of topoisomerase II poisons such as cardiotoxicity and induction of secondary tumors. Here, we present a class...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00202

    authors: Bergant Loboda K,Janežič M,Štampar M,Žegura B,Filipič M,Perdih A

    更新日期:2020-07-27 00:00:00

  • Flux (2): comparison of molecular mutation and crossover operators for ligand-based de novo design.

    abstract::We implemented a fragment-based de novo design algorithm for a population-based optimization of molecular structures. The concept is grounded on an evolution strategy with mutation and crossover operators for structure breeding. Molecular building blocks were obtained from the pseudo-retrosynthesis of a collection of ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci6005307

    authors: Fechner U,Schneider G

    更新日期:2007-03-01 00:00:00

  • Enrichment factor analyses on G-protein coupled receptors with known crystal structure.

    abstract::G-protein coupled receptors (GPCRs) are highly relevant drug targets. Four GPCRs with known crystal structure were analyzed with docking (AutoDock4) and postdocking (MM-PBSA) in order to evaluate the ability to recognize known antagonists from a larger database of molecular decoys and to predict correct binding modes....

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4000745

    authors: Anighoro A,Rastelli G

    更新日期:2013-04-22 00:00:00

  • How to Model Inter- and Intramolecular Hydrogen Bond Strengths with Quantum Chemistry.

    abstract::This article presents the computation of both inter- and intramolecular hydrogen bond strengths from first-principles. Quantum chemical calculations conducted at the dispersion-corrected density functional theory level including free energy and solvation contributions are conducted for (i) one-to-one hydrogen-bonded c...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00132

    authors: Bauer CA

    更新日期:2019-09-23 00:00:00

  • Protein flexibility in virtual screening: the BACE-1 case study.

    abstract::Simulating protein flexibility is a major issue in the docking-based drug-design process for which a single methodological solution does not exist. In our search of new anti-Alzheimer ligands, we were faced with the challenge of including receptor plasticity in a virtual screening campaign aimed at finding new β-secre...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300390h

    authors: Cosconati S,Marinelli L,Di Leva FS,La Pietra V,De Simone A,Mancini F,Andrisano V,Novellino E,Goodsell DS,Olson AJ

    更新日期:2012-10-22 00:00:00