Abstract:
:With chemical libraries increasingly containing millions of compounds or more, there is a fast-growing need for computational methods that can rank or prioritize compounds for screening. Machine learning methods have shown considerable promise for this task; indeed, classification methods such as support vector machines (SVMs), together with their variants, have been used in virtual screening to distinguish active compounds from inactive ones, while regression methods such as partial least-squares (PLS) and support vector regression (SVR) have been used in quantitative structure-activity relationship (QSAR) analysis for predicting biological activities of compounds. Recently, a new class of machine learning methods - namely, ranking methods, which are designed to directly optimize ranking performance - have been developed for ranking tasks such as web search that arise in information retrieval (IR) and other applications. Here we report the application of these new ranking methods in machine learning to the task of ranking chemical structures. Our experiments show that the new ranking methods give better ranking performance than both classification based methods in virtual screening and regression methods in QSAR analysis. We also make some interesting connections between ranking performance measures used in cheminformatics and those used in IR studies.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Agarwal S,Dugar D,Sengupta Sdoi
10.1021/ci9003865subject
Has Abstractpub_date
2010-05-24 00:00:00pages
716-31issue
5eissn
1549-9596issn
1549-960Xjournal_volume
50pub_type
杂志文章abstract::Telomere maintenance is a universal cancer hallmark, and small molecules that disrupt telomere maintenance generally have anticancer properties. Since the vast majority of cancer cells utilize telomerase activity for telomere maintenance, the enzyme has been considered as an anticancer drug target. Recently, rational ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00336
更新日期:2015-12-28 00:00:00
abstract::In this study we investigate π-stacking interactions of a variety of aromatic heterocycles with benzene using dispersion corrected density functional theory. We calculate extensive potential energy surfaces for parallel-displaced interaction geometries. We find that dispersion contributes significantly to the interact...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500183u
更新日期:2014-05-27 00:00:00
abstract::With a rapid increase in the number of high-resolution protein-ligand structures, the known protein-ligand structures can be used to gain insight into ligand-binding modes in a target protein. On the basis of the fact that the structurally similar binding sites share information about their ligands, we have developed ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300178e
更新日期:2012-10-22 00:00:00
abstract::Identifying drug-target interactions (DTIs) plays an important role in the field of drug discovery, drug side-effects, and drug repositioning. However, in vivo or biochemical experimental methods for identifying new DTIs are extremely expensive and time-consuming. Recently, in silico or various computational methods h...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00408
更新日期:2019-07-22 00:00:00
abstract::The COSMO surface polarization charge density σ resulting from quantum chemical calculations combined with a virtual conductor embedding has been widely proven to be a very suitable descriptor for the quantification of interactions of molecules in liquids. In a preceding paper, grid-based local histograms of σ have be...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci300231t
更新日期:2012-08-27 00:00:00
abstract::Engineered nanomaterials (ENMs) are increasingly infiltrating our lives as a result of their applications across multiple fields. However, ENM formulations may result in the modulation of pathways and mechanisms of toxic action that endanger human health and the environment. Alternative testing methods such as in sili...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00223
更新日期:2017-09-25 00:00:00
abstract::The ongoing search for fast Li-ion conducting solid electrolytes has driven the deployment surge on density functional theory (DFT) computation and materials informatics for exploring novel chemistries before actual experimental testing. Existing structure prototypes can now be readily evaluated beforehand not only to...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500752n
更新日期:2015-06-22 00:00:00
abstract::Up to now, publicly available data sets to build and evaluate Ames mutagenicity prediction tools have been very limited in terms of size and chemical space covered. In this report we describe a new unique public Ames mutagenicity data set comprising about 6500 nonconfidential compounds (available as SMILES strings and...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci900161g
更新日期:2009-09-01 00:00:00
abstract::In the present contribution, we have developed a database, called the FAR-database, where the acronym FAR stands for Fused Aromatic Rings, which presents the results of nuclear independent chemical shifts calculations, NICS(0), NICS(1), NICS(0)ZZ, and NICS(1)ZZ, of 660 neutral benzenoid-PAHs and cyclopenta-fused PAHs....
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00909
更新日期:2020-02-24 00:00:00
abstract::Aminoglycosides are antibiotics targeting the 16S RNA A site of the bacterial ribosome. There have been many efforts directed toward design of their synthetic derivatives, however with only few successes. As RNA binders, aminoglycosides are also a difficult target for computational drug design, since most of the exist...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci800361a
更新日期:2009-02-01 00:00:00
abstract::Computer programs for structure diagram generation (SDG) are indispensable cheminformatic tools that translate one- or three-dimensional (1D or 3D) chemical structure data stored in electronic formats to human-readable 2D depictions. Although many such programs are known, only a moderate part of chemical space can be ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00391
更新日期:2016-12-27 00:00:00
abstract::G-protein-coupled receptors (GPCRs) transmit signals into the cell in response to ligand binding at its extracellular domain, which is characterized by the coupling of agonist-induced receptor conformational change to guanine nucleotide (GDP) exchange with guanosine triphosphate on a heterotrimeric (αβγ) guanine nucle...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00432
更新日期:2020-08-24 00:00:00
abstract::3-Pyridyl ethers are excellent nAChRs ligands, which show high subtype selectivity and binding affinity to alpha4beta2 nAChR. Although the quantitative structure-activity relationship (QSAR) of nAChRs ligands has been widely investigated using various classes of compounds, the open ring analogues of 3-pyridyl ethers h...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0498113
更新日期:2005-03-01 00:00:00
abstract::Searching chemical databases for possible drug leads is often one of the main activities conducted during the early stages of a drug development project. This article shows that spherical harmonic molecular shape representations provide a powerful way to search and cluster small-molecule databases rapidly and accurate...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci7001507
更新日期:2007-09-01 00:00:00
abstract::The solvation layer surrounding a protein is clearly an intrinsic part of protein structure-dynamics-function, and our understanding of how the hydration dynamics influences protein function is emerging. We have recently reported simulations indicating a correlation between regional hydration dynamics and the structur...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00009
更新日期:2019-05-28 00:00:00
abstract::Blockade of human ether-à-go-go related gene (hERG) channel prolongs the duration of the cardiac action potential and is a common reason for drug failure in preclinical safety trials. Therefore, it is of great importance to develop robust in silico tools to predict potential hERG blockers in the early stages of drug d...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200271d
更新日期:2011-11-28 00:00:00
abstract::Metal-ligand (M-L) bond lengths for a range of ligands (carboxylates, chlorides, pyridines, water, tertiary phosphines, and alkenes) and a variety of metals have been retrieved from the Cambridge Structural Database, CSD. Analysis of the factors which affect M-L bond lengths (for example, ligand coordination mode, oxi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0500785
更新日期:2005-11-01 00:00:00
abstract::Random Forest regression (RF), Partial-Least-Squares (PLS) regression, Support Vector Machines (SVM), and Artificial Neural Networks (ANN) were used to develop QSPR models for the prediction of aqueous solubility, based on experimental data for 988 organic molecules. The Random Forest regression model predicted aqueou...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci060164k
更新日期:2007-01-01 00:00:00
abstract::Throughout the drug discovery process, discovery teams are compelled to use statistics for making decisions using data from a variety of inputs. For instance, teams are asked to prioritize compounds for subsequent stages of the drug discovery process, given results from multiple screens. To assist in the prioritizatio...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600556v
更新日期:2007-05-01 00:00:00
abstract::Resonance structures of polycyclic aromatic hydrocarbons can be associated with numerical formulas by assigning pi-electrons of C=C double bonds to individual benzenoid rings. Each C=C double bond in a resonance structure assigns two pi-electrons to a ring in a fused-benzenoid system if it is not shared by adjacent ri...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050196s
更新日期:2006-01-01 00:00:00
abstract::Partial covalent interactions (PCIs) in proteins, which include hydrogen bonds, salt bridges, cation-π, and π-π interactions, contribute to thermodynamic stability and facilitate interactions with other biomolecules. Several score functions have been developed within the Rosetta protein modeling framework that identif...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00398
更新日期:2018-05-29 00:00:00
abstract::The appropriate selection of a chemical space represented by the data set, the selection of its chemical data representation, the development of a correct modeling process using a robust and reproducible algorithm, and the performance of an exhaustive training and external validation determine the usability and reprod...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00492
更新日期:2017-11-27 00:00:00
abstract::In the preceding paper (Duca, J. S.; Madison, V. S.; Voigt, J. H. J. Chem. Inf. Model. 2008, 48, 659-668), the accuracy of docking and affinity predictions of the Gold and Glide programs were investigated using single protein conformations spanning 150 CDK2/inhibitor crystallographic complexes. High docking accuracy w...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700428d
更新日期:2008-03-01 00:00:00
abstract::The current drug virtual screen (VS) methods mainly include two categories. i.e., ligand/target structure-based virtual screen and that, utilizing protein-ligand interaction fingerprint information based on the large number of complex structures. Since the former one focuses on the one-side information while the later...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200481c
更新日期:2012-03-26 00:00:00
abstract::Aβ25-35 is a short, cytotoxic, and naturally occurring fragment of the Alzheimer's Aβ peptide. To map the molecular mechanism of Aβ25-35 binding to the zwitterionic dimyristoylphosphatidylcholine (DMPC) bilayer, we have performed replica exchange with solute tempering molecular dynamics simulations using all-atom expl...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00045
更新日期:2018-05-29 00:00:00
abstract::CDC25 phosphatases play critical roles in cell cycle regulation and are attractive targets for anticancer therapies. Several small non-peptide molecules are known to inhibit CDC25, but many of them appear to form a covalent bond with the enzyme or act through oxidation of the thiolate group of the catalytic cysteine. ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700313e
更新日期:2008-01-01 00:00:00
abstract::Protein arginine methyltransferases (PRMTs) catalyze the posttranslational methylation of arginine, which is important in a range of biological processes, including epigenetic regulation, signal transduction, and cancer progression. Although previous studies of PRMT1 mutants suggest that the dimerization arm and the N...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00454
更新日期:2015-12-28 00:00:00
abstract::In this study, we have developed a two model system to mimic the active and inactive states of a G-protein coupled receptor specifically the alpha1A adrenergic receptor. We have docked two agonists, epinephrine (phenylamine type) and oxymetazoline (imidazoline type), as well as two antagonists, prazosin and 5-methylur...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci700026v
更新日期:2007-09-01 00:00:00
abstract::Cyanobacterial fructose-1,6-/sedoheptulose-1,7-bisphoshatase (cy-FBP/SBPase) is a potential enzymatic target for screening of novel inhibitors that can combat harmful algal blooms. In the present study, we targeted the substrate binding pocket of cy-FBP/SBPase. A series of novel hit compounds from the SPECs database w...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci4007529
更新日期:2014-03-24 00:00:00
abstract::We present a theoretical study on the performance of ensemble docking methodologies considering multiple protein structures. We perform a theoretical analysis of pose prediction experiments which is completely unbiased, as we make no assumptions about specific scoring functions, search paradigms, protein structures, o...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci2002796
更新日期:2011-11-28 00:00:00