Abstract:
:The appropriate selection of a chemical space represented by the data set, the selection of its chemical data representation, the development of a correct modeling process using a robust and reproducible algorithm, and the performance of an exhaustive training and external validation determine the usability and reproducibility of a quantitative structure-activity relationship (QSAR) classification model. In this paper, we show that the use of relative versus absolute data in the representation of the data sets produces better classification models when the other processes are not modified. Relative data considers a reference frame to measure the chemical characteristics involved in the classification model, refining the data set representation and smoothing the lack of chemical information. Three data sets with different characteristics have been used in this study, and classifications models have been built applying the support vector machine algorithm. For randomly selected training and test sets, values of accuracy and area under the receiver operating characteristic curve close to 100% have been obtained for the generation of the models and external validations in all cases.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Ruiz IL,Gómez-Nieto MÁdoi
10.1021/acs.jcim.7b00492subject
Has Abstractpub_date
2017-11-27 00:00:00pages
2776-2788issue
11eissn
1549-9596issn
1549-960Xjournal_volume
57pub_type
杂志文章abstract::The COSMO surface polarization charge density σ resulting from quantum chemical calculations combined with a virtual conductor embedding has been widely proven to be a very suitable descriptor for the quantification of interactions of molecules in liquids. In a preceding paper, grid-based local histograms of σ have be...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300231t
更新日期:2012-08-27 00:00:00
abstract::Large ring cyclodextrins have become increasingly important for drug delivery applications. In this work, we have performed replica-exchange molecular dynamics simulations using both implicit and explicit water solvation models to study the conformational diversity of iota-cyclodextrin containing 14 α-1,4 glycosidic l...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00595
更新日期:2017-04-24 00:00:00
abstract::Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00249
更新日期:2017-08-28 00:00:00
abstract::The goal of the present study was to ascertain the differential performance of a long molecular dynamics trajectory versus several shorter ones starting from different points in the phase space and covering the same sampling time. For this purpose, we selected the 16-mer peptide Bak16BH3 as a model for study and carri...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00347
更新日期:2016-10-24 00:00:00
abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci5002668
更新日期:2014-08-25 00:00:00
abstract::Growing data sets with increased time for analysis is hampering predictive modeling in drug discovery. Model building can be carried out on high-performance computer clusters, but these can be expensive to purchase and maintain. We have evaluated ligand-based modeling on cloud computing resources where computations ar...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500580y
更新日期:2015-01-26 00:00:00
abstract::Aβ25-35 is a short, cytotoxic, and naturally occurring fragment of the Alzheimer's Aβ peptide. To map the molecular mechanism of Aβ25-35 binding to the zwitterionic dimyristoylphosphatidylcholine (DMPC) bilayer, we have performed replica exchange with solute tempering molecular dynamics simulations using all-atom expl...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00045
更新日期:2018-05-29 00:00:00
abstract::We present a theoretical study on the performance of ensemble docking methodologies considering multiple protein structures. We perform a theoretical analysis of pose prediction experiments which is completely unbiased, as we make no assumptions about specific scoring functions, search paradigms, protein structures, o...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci2002796
更新日期:2011-11-28 00:00:00
abstract::Engineering shape-controlled bionanomaterials requires comprehensive understanding of interactions between biomolecules and inorganic surfaces. We explore the origin of facet-selective binding of peptides adsorbed onto Pt(100) and Pt(111) crystallographic planes. Using molecular dynamics simulations, we show that upon...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400630d
更新日期:2013-12-23 00:00:00
abstract::Searching chemical databases for possible drug leads is often one of the main activities conducted during the early stages of a drug development project. This article shows that spherical harmonic molecular shape representations provide a powerful way to search and cluster small-molecule databases rapidly and accurate...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci7001507
更新日期:2007-09-01 00:00:00
abstract::All molecules of up to 11 atoms of C, N, O, and F possible under consideration of simple valency, chemical stability, and synthetic feasibility rules were generated and collected in a database (GDB). GDB contains 26.4 million molecules (110.9 million stereoisomers), including three- and four-membered rings and triple ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600423u
更新日期:2007-03-01 00:00:00
abstract::The number of journal articles in the scientific domain has grown to the point where it has become impossible for researchers to capitalize on all findings in their relevant discipline. Information is stored in these articles in a number of ways, including figures that describe important results. In organic chemistry,...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00042
更新日期:2020-04-27 00:00:00
abstract::We describe a novel deep learning neural network method and its application to impute assay pIC50 values. Unlike conventional machine learning approaches, this method is trained on sparse bioactivity data as input, typical of that found in public and commercial databases, enabling it to learn directly from correlation...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00768
更新日期:2019-03-25 00:00:00
abstract::Due to the recent availability of high quality small molecule databases, such as ZINC and PubChem,1,2 virtual screening is playing an even more important role in identifying biologically relevant molecules in drug discovery campaigns. The success of pharmacophore-based virtual screening (PBVS) relies largely on the ac...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700089w
更新日期:2007-07-01 00:00:00
abstract::In this DFT study, activities of 11 different N2O4, N2O3, and NO2 core containing Zr(IV) complexes, 4,13-diaza-18-crown-6 (I'N2O4), 1,4,10-trioxa-7,13-diazacyclopentadecane (I'N2O3), and 2-(2-methoxy)ethanol (I'NO2), respectively, and their analogues in peptide hydrolysis have been investigated. Based on the experimen...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00781
更新日期:2017-05-22 00:00:00
abstract::Template CoMFA, a novel alignment methodology for training or test set structures in 3D-QSAR, is introduced. Its two most significant advantages are its complete automation and its ability to derive a single combined model from multiple structural series affecting a biological target. Its only two inputs are one or mo...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400696v
更新日期:2014-02-24 00:00:00
abstract::We describe a general methodology for designing an empirical scoring function and provide smina, a version of AutoDock Vina specially optimized to support high-throughput scoring and user-specified custom scoring functions. Using our general method, the unique capabilities of smina, a set of default interaction terms ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300604z
更新日期:2013-08-26 00:00:00
abstract::We present our newly developed and highly efficient lossless compression algorithm for trajectories of atom positions and volumetric data. The algorithm is designed as a two-step approach. In the first step, efficient polynomial extrapolation schemes reduce the information entropy of the data by exploiting both spatia...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00501
更新日期:2018-10-22 00:00:00
abstract::In the present work, the Henderson-Hasselbalch (HH) equation has been employed for the development of a tool for the prediction of pH-dependent aqueous solubility of drugs and drug candidates. A new prediction method for the intrinsic solubility was developed, based on artificial neural networks that have been trained...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600292q
更新日期:2006-11-01 00:00:00
abstract::Umami or the taste of monosodium glutamate represents one of the major attractive taste modalities in humans. Therefore, knowledge about biophysical and biochemical properties of the umami taste is important for both scientific research and the food industry. Experimental approaches for predicting umami peptides are l...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00707
更新日期:2020-12-28 00:00:00
abstract::Accurate hydrogen placement in molecular modeling is crucial for studying the interactions and dynamics of biomolecular systems. The carboxyl functional group is a prototypical example of a functional group that requires protonation during structure preparation. To our knowledge, when in their neutral form, carboxylic...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00835
更新日期:2019-05-28 00:00:00
abstract::New hits against HIV-1 wild-type and Y181C drug-resistant reverse transcriptases were predicted taking into account the possibility of some of the known metabolism interactions. In silico hits against a set of antitargets (i.e., proteins or nucleic acids that are off-targets from the desired pharmaceutical target obje...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200203h
更新日期:2011-10-24 00:00:00
abstract::The causative agent of African sleeping sickness, Trypanosoma brucei , undergoes an unusual mitochondrial RNA editing process that is essential for its survival. RNA editing terminal uridylyl transferase 2 of T. brucei (TbRET2) is an indispensable component of the editosome machinery that performs this editing. TbR...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci3001327
更新日期:2012-05-25 00:00:00
abstract::Ionotropic glutamate receptors (iGluRs) are synaptic proteins that facilitate signal transmission in the central nervous system. Extracellular iGluR cleft closure is linked to receptor activation; however, the mechanism underlying partial agonism is not entirely understood. Full agonists close the bilobed ligand-bindi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci2000055
更新日期:2011-05-23 00:00:00
abstract::The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In th...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00148
更新日期:2018-06-25 00:00:00
abstract::Fragment-based methods have emerged in the last two decades as alternatives to traditional high throughput screenings for the identification of chemical starting points in drug discovery. One arguable yet popular assumption about fragment-based design is that the fragment binding mode remains conserved upon chemical e...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300355p
更新日期:2012-12-21 00:00:00
abstract::We propose an improved solvent contact model to estimate the solvation free energy of an organic molecule from individual atomic contributions. The modification of the solvation model involves the optimization of three kinds of parameters in the solvation free energy function: atomic fragmental volume, maximum atomic ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600453b
更新日期:2007-03-01 00:00:00
abstract::In this study, two probabilistic machine-learning algorithms were compared for in silico target prediction of bioactive molecules, namely the well-established Laplacian-modified Naïve Bayes classifier (NB) and the more recently introduced (to Cheminformatics) Parzen-Rosenblatt Window. Both classifiers were trained in ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300435j
更新日期:2013-08-26 00:00:00
abstract::In this work we present the third generation of FAst MEtabolizer (FAME 3), a collection of extra trees classifiers for the prediction of sites of metabolism (SoMs) in small molecules such as drugs, druglike compounds, natural products, agrochemicals, and cosmetics. FAME 3 was derived from the MetaQSAR database ( Pedre...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00376
更新日期:2019-08-26 00:00:00
abstract::It is of great interest in modern drug design to accurately calculate the free energies of protein-ligand or nucleic acid-ligand binding. MM-PBSA (molecular mechanics Poisson-Boltzmann surface area) and MM-GBSA (molecular mechanics generalized Born surface area) have gained popularity in this field. For both methods, ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300064d
更新日期:2012-05-25 00:00:00