Chemoinformatics-based classification of prohibited substances employed for doping in sport.

Abstract:

:Representative molecules from 10 classes of prohibited substances were taken from the World Anti-Doping Agency (WADA) list, augmented by molecules from corresponding activity classes found in the MDDR database. Together with some explicitly allowed compounds, these formed a set of 5245 molecules. Five types of fingerprints were calculated for these substances. The random forest classification method was used to predict membership of each prohibited class on the basis of each type of fingerprint, using 5-fold cross-validation. We also used a k-nearest neighbors (kNN) approach, which worked well for the smallest values of k. The most successful classifiers are based on Unity 2D fingerprints and give very similar Matthews correlation coefficients of 0.836 (kNN) and 0.829 (random forest). The kNN classifiers tend to give a higher recall of positives at the expense of lower precision. A naïve Bayesian classifier, however, lies much further toward the extreme of high recall and low precision. Our results suggest that it will be possible to produce a reliable and quantitative assignment of membership or otherwise of each class of prohibited substances. This should aid the fight against the use of bioactive novel compounds as doping agents, while also protecting athletes against unjust disqualification.

journal_name

J Chem Inf Model

authors

Cannon EO,Bender A,Palmer DS,Mitchell JB

doi

10.1021/ci0601160

subject

Has Abstract

pub_date

2006-11-01 00:00:00

pages

2369-80

issue

6

eissn

1549-9596

issn

1549-960X

journal_volume

46

pub_type

杂志文章
  • Molecular Dynamics Simulations of Membrane-Bound STIM1 to Investigate Conformational Changes during STIM1 Activation upon Calcium Release.

    abstract::Calcium is involved in important intracellular processes, such as intracellular signaling from cell membrane receptors to the nucleus. Typically, calcium levels are kept at less than 100 nM in the nucleus and cytosol, but some calcium is stored in the endoplasmic reticulum (ER) lumen for rapid release to activate intr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00475

    authors: Mukherjee S,Karolak A,Debant M,Buscaglia P,Renaudineau Y,Mignen O,Guida WC,Brooks WH

    更新日期:2017-02-27 00:00:00

  • Assessment of the Sampling Performance of Multiple-Copy Dynamics versus a Unique Trajectory.

    abstract::The goal of the present study was to ascertain the differential performance of a long molecular dynamics trajectory versus several shorter ones starting from different points in the phase space and covering the same sampling time. For this purpose, we selected the 16-mer peptide Bak16BH3 as a model for study and carri...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00347

    authors: Perez JJ,Tomas MS,Rubio-Martinez J

    更新日期:2016-10-24 00:00:00

  • Phytochemical informatics of traditional Chinese medicine and therapeutic relevance.

    abstract::Distribution patterns of 8411 compounds from 240 Chinese herbs were analyzed in relation to the herbal categories of traditional Chinese medicine (TCM), using Random Forest (RF) and self-organizing maps (SOM). RF was used first to construct TCM profiles of individual compounds, which describe their affinities for 28 m...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700155t

    authors: Ehrman TM,Barlow DJ,Hylands PJ

    更新日期:2007-11-01 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • Applicability Domain ANalysis (ADAN): a robust method for assessing the reliability of drug property predictions.

    abstract::We report a novel method called ADAN (Applicability Domain ANalysis) for assessing the reliability of drug property predictions obtained by in silico methods. The assessment provided by ADAN is based on the comparison of the query compound with the training set, using six diverse similarity criteria. For every criteri...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500172z

    authors: Carrió P,Pinto M,Ecker G,Sanz F,Pastor M

    更新日期:2014-05-27 00:00:00

  • Development of a computational tool to rival experts in the prediction of sites of metabolism of xenobiotics by p450s.

    abstract::The metabolism of xenobiotics--and more specifically drugs--in the liver is a critical process controlling their half-life. Although there exist experimental methods, which measure the metabolic stability of xenobiotics and identify their metabolites, developing higher throughput predictive methods is an avenue of res...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3003073

    authors: Campagna-Slater V,Pottel J,Therrien E,Cantin LD,Moitessier N

    更新日期:2012-09-24 00:00:00

  • Estimation of carcinogenicity using molecular fragments tree.

    abstract::Carcinogenicity is an important toxicological endpoint that poses high concern to drug discovery. In this study, we developed a method to extract structural alerts (SAs) and modulating factors of carcinogens on the basis of statistical analyses. First, the Gaston algorithm, a frequent subgraph mining method, was used ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300266p

    authors: Wang Y,Lu J,Wang F,Shen Q,Zheng M,Luo X,Zhu W,Jiang H,Chen K

    更新日期:2012-08-27 00:00:00

  • Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discove

    abstract::All molecules of up to 11 atoms of C, N, O, and F possible under consideration of simple valency, chemical stability, and synthetic feasibility rules were generated and collected in a database (GDB). GDB contains 26.4 million molecules (110.9 million stereoisomers), including three- and four-membered rings and triple ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600423u

    authors: Fink T,Reymond JL

    更新日期:2007-03-01 00:00:00

  • Computational evidence for the role of Arabidopsis thaliana UVR8 as UV-B photoreceptor and identification of its chromophore amino acids.

    abstract::A homology model of the Arabidopsis thaliana UV resistance locus 8 (UVR8) protein is presented herein, showing a seven-bladed β-propeller conformation similar to the globular structure of RCC1. The UVR8 amino acid sequence contains a very high amount of conserved tryptophans, and the homology model shows that seven of...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200017f

    authors: Wu M,Grahn E,Eriksson LA,Strid A

    更新日期:2011-06-27 00:00:00

  • Identification of ligand templates using local structure alignment for structure-based drug design.

    abstract::With a rapid increase in the number of high-resolution protein-ligand structures, the known protein-ligand structures can be used to gain insight into ligand-binding modes in a target protein. On the basis of the fact that the structurally similar binding sites share information about their ligands, we have developed ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300178e

    authors: Lee HS,Im W

    更新日期:2012-10-22 00:00:00

  • Structure-based discovery of novel non-nucleosidic DNA alkyltransferase inhibitors: virtual screening and in vitro and in vivo activities.

    abstract::The human DNA-repair O (6)-alkylguanine DNA alkyltransferase (MGMT or hAGT) protein protects DNA from environmental alkylating agents and also plays an important role in tumor resistance to chemotherapy treatment. Available inhibitors, based on pseudosubstrate analogs, have been shown to induce substantial bone marrow...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700447r

    authors: Ruiz FM,Gil-Redondo R,Morreale A,Ortiz AR,Fábrega C,Bravo J

    更新日期:2008-04-01 00:00:00

  • Holistic Approach to Partial Covalent Interactions in Protein Structure Prediction and Design with Rosetta.

    abstract::Partial covalent interactions (PCIs) in proteins, which include hydrogen bonds, salt bridges, cation-π, and π-π interactions, contribute to thermodynamic stability and facilitate interactions with other biomolecules. Several score functions have been developed within the Rosetta protein modeling framework that identif...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00398

    authors: Combs SA,Mueller BK,Meiler J

    更新日期:2018-05-29 00:00:00

  • Structural basis for the mutation-induced dysfunction of human CYP2J2: a computational study.

    abstract::Arachidonic acid is an essential fatty acid in cells, acting as a key inflammatory intermediate in inflammatory reactions. In cardiac tissues, CYP2J2 can adopt arachidonic acid as a major substrate to produce epoxyeicosatrienoic acids (EETs), which can protect endothelial cells from ischemic or hypoxic injuries and ha...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400003p

    authors: Cong S,Ma XT,Li YX,Wang JF

    更新日期:2013-06-24 00:00:00

  • Customizable Generation of Synthetically Accessible, Local Chemical Subspaces.

    abstract::Screening large libraries of chemicals has been an efficient strategy to discover bioactive compounds; however a portion of the potential for success is limited to the available libraries. Synergizing combinatorial and computational chemistries has emerged as a time-efficient strategy to explore the chemical space mor...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00648

    authors: Pottel J,Moitessier N

    更新日期:2017-03-27 00:00:00

  • Molecular modeling of potential anticancer agents from African medicinal plants.

    abstract::Naturally occurring anticancer compounds represent about half of the chemotherapeutic drugs which have been put in the market against cancer until date. Computer-based or in silico virtual screening methods are often used in lead/hit discovery protocols. In this study, the "drug-likeness" of ~400 compounds from Africa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5003697

    authors: Ntie-Kang F,Nwodo JN,Ibezim A,Simoben CV,Karaman B,Ngwa VF,Sippl W,Adikwu MU,Mbaze LM

    更新日期:2014-09-22 00:00:00

  • Truncated variants of the GCN4 transcription activator protein bind DNA with dramatically different dynamical motifs.

    abstract::The yeast protein GCN4 is a transcriptional activator in the basic leucine zipper (bZip) family, whose distinguishing feature is the "chopstick-like" homodimer of alpha helices formed at the DNA-binding interface. While experiments have shown that truncated versions of the protein retain biologically relevant DNA-bind...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500448e

    authors: McHarris DM,Barr DA

    更新日期:2014-10-27 00:00:00

  • Combinatorial × computational × cheminformatics (C3) approach to characterization of congeneric libraries of organic pollutants.

    abstract::Congeners are molecules based on the same carbon skeleton but are different by the number of substituents and/or a substitution pattern. Examples are 1-chloronaphthalene, 1,4-dichloronaphthalene, and 1,3,8-trichloronaphthalene. Various persistent organic pollutants (POPs) exist in the environment as families of congen...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300289b

    authors: Haranczyk M,Urbaszek P,Ng EG,Puzyn T

    更新日期:2012-11-26 00:00:00

  • Accurate Estimation of pKb Values for Amino Groups from Surface Electrostatic Potential (VS,min) Calculations: The Isoelectric Points of Amino Acids as a Case Study.

    abstract::Theoretical calculation of equilibrium dissociation constants is a very computationally demanding and time-consuming process since it requires an extremely accurate computation of the solvation free energy changes for each of the species involved. By correlating the minimum surface electrostatic potential (VS,min) on ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01173

    authors: Sandoval-Lira J,Mondragón-Solórzano G,Lugo-Fuentes LI,Barroso-Flores J

    更新日期:2020-03-23 00:00:00

  • LigQ: A Webserver to Select and Prepare Ligands for Virtual Screening.

    abstract::Virtual screening is a powerful methodology to search for new small molecule inhibitors against a desired molecular target. Usually, it involves evaluating thousands of compounds (derived from large databases) in order to select a set of potential binders that will be tested in the wet-lab. The number of tested compou...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00241

    authors: Radusky L,Ruiz-Carmona S,Modenutti C,Barril X,Turjanski AG,Martí MA

    更新日期:2017-08-28 00:00:00

  • Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors.

    abstract::Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00135

    authors: Berishvili VP,Perkin VO,Voronkov AE,Radchenko EV,Syed R,Venkata Ramana Reddy C,Pillay V,Kumar P,Choonara YE,Kamal A,Palyulin VA

    更新日期:2019-08-26 00:00:00

  • Retrospect and Prospect of Single Particle Cryo-Electron Microscopy: The Class of Integral Membrane Proteins as an Example.

    abstract::A giant technological leap in the field of cryo-electron microscopy (cryo-EM) has assured the achievement of near-atomic resolution structures of biological macromolecules. As a recognition of this accomplishment, the Nobel Prize in Chemistry was awarded in 2017 to Jacques Dubochet, Joachim Frank, and Richard Henderso...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01015

    authors: Akbar S,Mozumder S,Sengupta J

    更新日期:2020-05-26 00:00:00

  • Delineation of agonist binding to the human histamine H4 receptor using mutational analysis, homology modeling, and ab initio calculations.

    abstract::A three-dimensional homology model of the human histamine H 4 receptor was developed to investigate the binding mode of a series of structurally diverse H 4-agonists, i.e. histamine, clozapine, and the recently described selective, nonimidazole agonist VUF 8430. Mutagenesis studies and docking of these ligands in a rh...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700474a

    authors: Jongejan A,Lim HD,Smits RA,de Esch IJ,Haaksma E,Leurs R

    更新日期:2008-07-01 00:00:00

  • Catalytic Role of Gln202 in the Carboligation Reaction Mechanism of Yeast AHAS: A QM/MM Study.

    abstract::Acetohydroxyacid synthase (AHAS) is a thiamin diphosphate-dependent enzyme involved in the biosynthesis of valine, leucine, isoleucine, and lysine. Experimental evidence has shown that mutation of the Gln202 residue results in a decrease in the enzymatic activity, thus suggesting the main role of the carboligation cat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00863

    authors: Mendoza F,Medina FE,Jiménez VA,Jaña GA

    更新日期:2020-02-24 00:00:00

  • Protein-protein binding site prediction by local structural alignment.

    abstract::Generalization of an earlier algorithm has led to the development of new local structural alignment algorithms for prediction of protein-protein binding sites. The algorithms use maximum cliques on protein graphs to define structurally similar protein regions. The search for structural neighbors in the new algorithms ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100265x

    authors: Carl N,Konc J,Vehar B,Janezic D

    更新日期:2010-10-25 00:00:00

  • Develop and test a solvent accessible surface area-based model in conformational entropy calculations.

    abstract::It is of great interest in modern drug design to accurately calculate the free energies of protein-ligand or nucleic acid-ligand binding. MM-PBSA (molecular mechanics Poisson-Boltzmann surface area) and MM-GBSA (molecular mechanics generalized Born surface area) have gained popularity in this field. For both methods, ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300064d

    authors: Wang J,Hou T

    更新日期:2012-05-25 00:00:00

  • Cheminformatics Modeling of Adverse Drug Responses by Clinically Relevant Mutants of Human Androgen Receptor.

    abstract::The human androgen receptor (AR) is a ligand-activated transcription factor that plays a pivotal role in the development and progression of prostate cancer (PCa). Many forms of castration-resistant prostate cancer (CRPC) still rely on the AR for survival. Currently used antiandrogens face clinical limitations as drug ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00400

    authors: Paul N,Carabet LA,Lallous N,Yamazaki T,Gleave ME,Rennie PS,Cherkasov A

    更新日期:2016-12-27 00:00:00

  • Training a scoring function for the alignment of small molecules.

    abstract::A comprehensive data set of aligned ligands with highly similar binding pockets from the Protein Data Bank has been built. Based on this data set, a scoring function for recognizing good alignment poses for small molecules has been developed. This function is based on atoms and hydrogen-bond projected features. The co...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100227h

    authors: Chan SL,Labute P

    更新日期:2010-09-27 00:00:00

  • Residue preference mapping of ligand fragments in the Protein Data Bank.

    abstract::The interaction between small molecules and proteins is one of the major concerns for structure-based drug design because the principles of protein-ligand interactions and molecular recognition are not thoroughly understood. Fortunately, the analysis of protein-ligand complexes in the Protein Data Bank (PDB) enables u...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci100386y

    authors: Wang L,Xie Z,Wipf P,Xie XQ

    更新日期:2011-04-25 00:00:00

  • VMD Store-A VMD Plugin to Browse, Discover, and Install VMD Extensions.

    abstract::Herein we present the VMD Store, an open-source VMD plugin that simplifies the way that users browse, discover, install, update, and uninstall extensions for the Visual Molecular Dynamics (VMD) software. The VMD Store obtains data about all the indexed VMD extensions hosted on GitHub and presents a one-click mechanism...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00739

    authors: Fernandes HS,Sousa SF,Cerqueira NMFSA

    更新日期:2019-11-25 00:00:00

  • The ensemble performance index: an improved measure for assessing ensemble pose prediction performance.

    abstract::We present a theoretical study on the performance of ensemble docking methodologies considering multiple protein structures. We perform a theoretical analysis of pose prediction experiments which is completely unbiased, as we make no assumptions about specific scoring functions, search paradigms, protein structures, o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2002796

    authors: Korb O,McCabe P,Cole J

    更新日期:2011-11-28 00:00:00