Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation.

Abstract:

:There is a growing public concern about the lack of reproducibility of experimental data published in peer-reviewed scientific literature. Herein, we review the most recent alerts regarding experimental data quality and discuss initiatives taken thus far to address this problem, especially in the area of chemical genomics. Going beyond just acknowledging the issue, we propose a chemical and biological data curation workflow that relies on existing cheminformatics approaches to flag, and when appropriate, correct possibly erroneous entries in large chemogenomics data sets. We posit that the adherence to the best practices for data curation is important for both experimental scientists who generate primary data and deposit them in chemical genomics databases and computational researchers who rely on these data for model development.

journal_name

J Chem Inf Model

authors

Fourches D,Muratov E,Tropsha A

doi

10.1021/acs.jcim.6b00129

subject

Has Abstract

pub_date

2016-07-25 00:00:00

pages

1243-52

issue

7

eissn

1549-9596

issn

1549-960X

journal_volume

56

pub_type

杂志文章,评审
  • Efficient Strategy for the Calculation of Solvation Free Energies in Water and Chloroform at the Quantum Mechanical/Molecular Mechanical Level.

    abstract::The partitioning of solute molecules between immiscible solvents with significantly different polarities is of great importance. The polarization between the solute and solvent molecules plays an essential role in determining the solubility of the solute, which makes computational studies utilizing molecular mechanics...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00001

    authors: Wang M,Li P,Jia X,Liu W,Shao Y,Hu W,Zheng J,Brooks BR,Mei Y

    更新日期:2017-10-23 00:00:00

  • PIIMS Server: A Web Server for Mutation Hotspot Scanning at the Protein-Protein Interface.

    abstract::Protein-protein interactions (PPIs) play vital roles in regulating biological processes, such as cellular and signaling pathways. Hotspots are certain residues located at protein-protein interfaces that contribute more in protein-protein binding than other residues. Research on the mutational effects of hotspots is im...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00966

    authors: Wu FX,Yang JF,Mei LC,Wang F,Hao GF,Yang GF

    更新日期:2021-01-25 00:00:00

  • The ensemble performance index: an improved measure for assessing ensemble pose prediction performance.

    abstract::We present a theoretical study on the performance of ensemble docking methodologies considering multiple protein structures. We perform a theoretical analysis of pose prediction experiments which is completely unbiased, as we make no assumptions about specific scoring functions, search paradigms, protein structures, o...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2002796

    authors: Korb O,McCabe P,Cole J

    更新日期:2011-11-28 00:00:00

  • Dependence of QSAR models on the selection of trial descriptor sets: a demonstration using nanotoxicity endpoints of decorated nanotubes.

    abstract::Little attention has been given to the selection of trial descriptor sets when designing a QSAR analysis even though a great number of descriptor classes, and often a greater number of descriptors within a given class, are now available. This paper reports an effort to explore interrelationships between QSAR models an...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3005308

    authors: Shao CY,Chen SZ,Su BH,Tseng YJ,Esposito EX,Hopfinger AJ

    更新日期:2013-01-28 00:00:00

  • Reliable and Performant Identification of Low-Energy Conformers in the Gas Phase and Water.

    abstract::Prediction of compound properties from structure via quantitative structure-activity relationship and machine-learning approaches is an important computational chemistry task in small-molecule drug research. Though many such properties are dependent on three-dimensional structures or even conformer ensembles, the majo...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00151

    authors: Cavasin AT,Hillisch A,Uellendahl F,Schneckener S,Göller AH

    更新日期:2018-05-29 00:00:00

  • Adaptive BP-Dock: An Induced Fit Docking Approach for Full Receptor Flexibility.

    abstract::We present an induced fit docking approach called Adaptive BP-Dock that integrates perturbation response scanning (PRS) with the flexible docking protocol of RosettaLigand in an adaptive manner. We first perturb the binding pocket residues of a receptor and obtain a new conformation based on the residue response fluct...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00587

    authors: Bolia A,Ozkan SB

    更新日期:2016-04-25 00:00:00

  • Rigidity Strengthening: A Mechanism for Protein-Ligand Binding.

    abstract::Protein-ligand binding is essential to almost all life processes. The understanding of protein-ligand interactions is fundamentally important to rational drug and protein design. Based on large scale data sets, we show that protein rigidity strengthening or flexibility reduction is a mechanism in protein-ligand bindin...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00226

    authors: Nguyen DD,Xiao T,Wang M,Wei GW

    更新日期:2017-07-24 00:00:00

  • Identification of ligand templates using local structure alignment for structure-based drug design.

    abstract::With a rapid increase in the number of high-resolution protein-ligand structures, the known protein-ligand structures can be used to gain insight into ligand-binding modes in a target protein. On the basis of the fact that the structurally similar binding sites share information about their ligands, we have developed ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300178e

    authors: Lee HS,Im W

    更新日期:2012-10-22 00:00:00

  • Comparison Study of Polar and Nonpolar Contributions to Solvation Free Energy.

    abstract::In this study, we compared the contributions of polar and nonpolar interactions to the solvation free energy of a solute in solvent, which is decomposed into four different terms based on the nature of interactions: (i) electrostatic solvation free energy term counting for the work done to move solute charges from fix...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00368

    authors: Izairi R,Kamberaj H

    更新日期:2017-10-23 00:00:00

  • Combined 3D-QSAR modeling and molecular docking study on indolinone derivatives as inhibitors of 3-phosphoinositide-dependent protein kinase-1.

    abstract::3-Phosphoinositide-dependent protein kinase-1 (PDK1) is a promising target for developing novel anticancer drugs. In order to understand the structure-activity correlation of indolinone-based PDK1 inhibitors, we have carried out a combined molecular docking and three-dimensional quantitative structure-activity relatio...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800147v

    authors: AbdulHameed MD,Hamza A,Liu J,Zhan CG

    更新日期:2008-09-01 00:00:00

  • Automated extraction of information on chemical-P-glycoprotein interactions from the literature.

    abstract::Knowledge of the interactions between drugs and transporters is important for drug discovery and development as well as for the evaluation of their clinical safety. We recently developed a text-mining system for the automatic extraction of information on chemical-CYP3A4 interactions from the literature. This system is...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4003188

    authors: Yoshida S,Yamashita F,Ose A,Maeda K,Sugiyama Y,Hashida M

    更新日期:2013-10-28 00:00:00

  • Comparative modeling and benchmarking data sets for human histone deacetylases and sirtuin families.

    abstract::Histone deacetylases (HDACs) are an important class of drug targets for the treatment of cancers, neurodegenerative diseases, and other types of diseases. Virtual screening (VS) has become fairly effective approaches for drug discovery of novel and highly selective histone deacetylase inhibitors (HDACIs). To facilitat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5005515

    authors: Xia J,Tilahun EL,Kebede EH,Reid TE,Zhang L,Wang XS

    更新日期:2015-02-23 00:00:00

  • Toward high throughput 3D virtual screening using spherical harmonic surface representations.

    abstract::Searching chemical databases for possible drug leads is often one of the main activities conducted during the early stages of a drug development project. This article shows that spherical harmonic molecular shape representations provide a powerful way to search and cluster small-molecule databases rapidly and accurate...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7001507

    authors: Mavridis L,Hudson BD,Ritchie DW

    更新日期:2007-09-01 00:00:00

  • Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries.

    abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci200078f

    authors: Li L,Wang B,Meroueh SO

    更新日期:2011-09-26 00:00:00

  • Modeling Binding with Large Conformational Changes: Key Points in Ensemble-Docking Approaches.

    abstract::Protein dynamics play a critical role in ligand binding, and different models have been proposed to explain the relationships between protein motion and molecular recognition. Here, we present a study of ligand-binding processes associated with large conformational changes of a protein to elucidate the critical choice...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00125

    authors: Motta S,Bonati L

    更新日期:2017-07-24 00:00:00

  • Ligand coordinate analysis of SC-558 from the active site to the surface of COX-2: a molecular dynamics study.

    abstract::We have performed a ligand coordinate analysis to monitor the movement of the inhibitor SC-558 from the active site of the COX-2 protein to the exterior using molecular dynamics techniques. This study provides an insight into the intermolecular interactions formed by the ligand during this journey. The published cryst...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050142i

    authors: Sai Ram KV,Rambabu G,Sarma JA,Desiraju GR

    更新日期:2006-07-01 00:00:00

  • Identifying biologically active compound classes using phenotypic screening data and sampling statistics.

    abstract::Scoring the activity of compounds in phenotypic high-throughput assays presents a unique challenge because of the limited resolution and inherent measurement error of these assays. Techniques that leverage the structural similarity of compounds within an assay can be used to improve the hit-recovery rate from screenin...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050087d

    authors: Klekota J,Brauner E,Schreiber SL

    更新日期:2005-11-01 00:00:00

  • Multifingerprint based similarity searches for targeted class compound selection.

    abstract::Molecular fingerprints are widely used for similarity-based virtual screening in drug discovery projects. In this paper we discuss the performance and the complementarity of nine two-dimensional fingerprints (Daylight, Unity, AlFi, Hologram, CATS, TRUST, Molprint 2D, ChemGPS, and ALOGP) in retrieving active molecules ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0504723

    authors: Kogej T,Engkvist O,Blomberg N,Muresan S

    更新日期:2006-05-01 00:00:00

  • Allosteric Modulation of Human Hsp90α Conformational Dynamics.

    abstract::Central to Hsp90's biological function is its ability to interconvert between various conformational states. Drug targeting of Hsp90's regulatory mechanisms, including its modulation by cochaperone association, presents as an attractive therapeutic strategy for Hsp90 associated pathologies. In this study, we utilized ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00630

    authors: Penkler DL,Atilgan C,Tastan Bishop Ö

    更新日期:2018-02-26 00:00:00

  • Exploring Topological Pharmacophore Graphs for Scaffold Hopping.

    abstract::The primary goal of ligand-based virtual screening is to identify active compounds consisting of a core scaffold that is not found in the current active compound pool. Scaffold hopping is the term used for this purpose. In the present study, topological representations of pharmacophore features on chemical graphs were...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00098

    authors: Nakano H,Miyao T,Funatsu K

    更新日期:2020-04-27 00:00:00

  • Determination of partition coefficient of spin probe between different lipid membrane phases.

    abstract::Model lipid membranes made from binary mixtures of dimyristoylphosphatidylcholine/dipalmitoylphosphatidylcholine (DMPC/DPPC) and dimyristoylphosphatidylcholine/cholesterol (DMPC/Chol) exhibit coexistence of diverse lipid phases at appropriate temperature and composition. Since lipids in different phases show different...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0501793

    authors: Arsov Z,Strancar J

    更新日期:2005-11-01 00:00:00

  • FOG: Fragment Optimized Growth algorithm for the de novo generation of molecules occupying druglike chemical space.

    abstract::An essential feature of all practical de novo molecule generating programs is the ability to focus the potential combinatorial explosion of grown molecules on a desired chemical space. It is a daunting task to balance the generation of new molecules with limitations on growth that produce desired features such as stab...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci9000458

    authors: Kutchukian PS,Lou D,Shakhnovich EI

    更新日期:2009-07-01 00:00:00

  • LiCABEDS II. Modeling of ligand selectivity for G-protein-coupled cannabinoid receptors.

    abstract::The cannabinoid receptor subtype 2 (CB2) is a promising therapeutic target for blood cancer, pain relief, osteoporosis, and immune system disease. The recent withdrawal of Rimonabant, which targets another closely related cannabinoid receptor (CB1), accentuates the importance of selectivity for the development of CB2 ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3003914

    authors: Ma C,Wang L,Yang P,Myint KZ,Xie XQ

    更新日期:2013-01-28 00:00:00

  • Comparative Binding Analysis of N-Acetylneuraminic Acid in Bovine Serum Albumin and Human α-1 Acid Glycoprotein.

    abstract::The present study focuses on the determination of the biologically significant N-acetylneuraminic acid (NANA) drug binding interaction mechanism between bovine serum albumin (BSA) and human α-1 acid glycoprotein (HAG) using various optical spectroscopy and computational methods. The steady state fluorescence spectrosc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00558

    authors: Karthikeyan S,Bharanidharan G,Ragavan S,Kandasamy S,Chinnathambi S,Udayakumar K,Mangaiyarkarasi R,Sundaramoorthy A,Aruna P,Ganesan S

    更新日期:2019-01-28 00:00:00

  • The assembly-inducing laulimalide/peloruside a binding site on tubulin: molecular modeling and biochemical studies with [³H]peloruside A.

    abstract::We used synthetic peloruside A for the commercial preparation of [³H]peloruside A. The radiolabeled compound bound to preformed tubulin polymer in amounts stoichiometric with the polymer's tubulin content, with an apparent K(d) value of 0.35 μM. A less active peloruside A analogue, (11-R)-peloruside A and laulimalide ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci1002894

    authors: Nguyen TL,Xu X,Gussio R,Ghosh AK,Hamel E

    更新日期:2010-11-22 00:00:00

  • Similarity searching in databases of flexible 3D structures using autocorrelation vectors derived from smoothed bounded distance matrices.

    abstract::This paper presents an exploratory study of a novel method for flexible 3-D similarity searching based on autocorrelation vectors and smoothed bounded distance matrices. Although the new approach is unable to outperform an existing 2-D similarity searching in terms of enrichment factors, it is able to retrieve differe...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0503863

    authors: Rhodes N,Clark DE,Willett P

    更新日期:2006-03-01 00:00:00

  • Spatial sign preprocessing: a simple way to impart moderate robustness to multivariate estimators.

    abstract::The spatial sign is a multivariate extension of the concept of sign. Recently multivariate estimators of covariance structures based on spatial signs have been examined by various authors. These new estimators are found to be robust to outlying observations. From a computational point of view, estimators based on spat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050498u

    authors: Serneels S,De Nolf E,Van Espen PJ

    更新日期:2006-05-01 00:00:00

  • Viscosity Prediction of Lubricants by a General Feed-Forward Neural Network.

    abstract::Modern industrial lubricants are often blended with an assortment of chemical additives to improve the performance of the base stock. Machine learning-based predictive models allow fast and veracious derivation of material properties and facilitate novel and innovative material designs. In this study, we outline the d...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01068

    authors: Loh GC,Lee HC,Tee XY,Chow PS,Zheng JW

    更新日期:2020-03-23 00:00:00

  • Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.

    abstract::The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very la...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00281

    authors: Liu R,AbdulHameed MDM,Wallqvist A

    更新日期:2017-09-25 00:00:00

  • Assessing the Protective Activity of a Recently Discovered Phenolic Compound against Oxidative Stress Using Computational Chemistry.

    abstract::The protection exerted by 3,5-dihydroxy-4-methoxybenzyl alcohol (DHMBA), a phenolic compound recently isolated from the Pacific oyster, against oxidative stress (OS) is investigated using the density functional theory. Our results indicate that DHMBA is an outstanding peroxyl radical scavenger, being about 15 times an...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00513

    authors: Villuendas-Rey Y,Alvarez-Idaboy JR,Galano A

    更新日期:2015-12-28 00:00:00