Spatial sign preprocessing: a simple way to impart moderate robustness to multivariate estimators.


:The spatial sign is a multivariate extension of the concept of sign. Recently multivariate estimators of covariance structures based on spatial signs have been examined by various authors. These new estimators are found to be robust to outlying observations. From a computational point of view, estimators based on spatial sign are very easy to implement as they boil down to a transformation of the data to their spatial signs, from which the classical estimator is then computed. Hence, one can also consider the transformation to spatial signs to be a preprocessing technique, which ensures that the calibration procedure as a whole is robust. In this paper, we examine the special case of spatial sign preprocessing in combination with partial least squares regression as the latter technique is frequently applied in the context of chemical data analysis. In a simulation study, we compare the performance of the spatial sign transformation to nontransformed data as well as to two robust counterparts of partial least squares regression. It turns out that the spatial sign transform is fairly efficient but has some undesirable bias properties. The method is applied to a recently published data set in the field of quantitative structure-activity relationships, where it is seen to perform equally well as the previously described best linear model for these data.


J Chem Inf Model


Serneels S,De Nolf E,Van Espen PJ




Has Abstract


2006-05-01 00:00:00












  • Assessing different classification methods for virtual screening.

    abstract::How well do different classification methods perform in selecting the ligands of a protein target out of large compound collections not used to train the model? Support vector machines, random forest, artificial neural networks, k-nearest-neighbor classification with genetic-algorithm-optimized feature selection, tren...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Plewczynski D,Spieser SA,Koch U

    更新日期:2006-05-01 00:00:00

  • Evaluation of different virtual screening programs for docking in a charged binding pocket.

    abstract::Virtual screening of small molecules against a protein target often identifies the correct pose, but the ranking in terms of binding energy remains a difficult problem, resulting in unacceptable numbers of false positives and negatives. To investigate this problem, the performance of three docking programs, FRED, QXP/...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Deng W,Verlinde CL

    更新日期:2008-10-01 00:00:00

  • Improved Computation of Protein-Protein Relative Binding Energies with the Nwat-MMGBSA Method.

    abstract::A MMGBSA variant (here referred to as Nwat-MMGBSA), based on the inclusion of a certain number of explicit water molecules (Nwat) during the calculations, has been tested on a set of 20 protein-protein complexes, using the correlation between predicted and experimental binding energy as the evaluation metric. Besides ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Maffucci I,Contini A

    更新日期:2016-09-26 00:00:00

  • Molecular simulations of aromatase reveal new insights into the mechanism of ligand binding.

    abstract::CYP19A1, also known as aromatase or estrogen synthetase, is the rate-limiting enzyme in the biosynthesis of estrogens from their corresponding androgens. Several clinically used breast cancer therapies target aromatase. In this work, explicitly solvated all-atom molecular dynamics simulations of aromatase with a model...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Park J,Czapla L,Amaro RE

    更新日期:2013-08-26 00:00:00

  • Estimation of ligand efficacies of metabotropic glutamate receptors from conformational forces obtained from molecular dynamics simulations.

    abstract::Group 1 metabotropic glutamate receptors (mGluR) are G-protein coupled receptors with a large bilobate extracellular ligand binding region (LBR) that resembles a Venus fly trap. Closing of this LBR in the presence of a ligand is associated with the activation of the receptor. From conformational sampling of the LBR-li...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Lakkaraju SK,Xue F,Faden AI,MacKerell AD Jr

    更新日期:2013-06-24 00:00:00

  • Adaptive configuring of radial basis function network by hybrid particle swarm algorithm for QSAR studies of organic compounds.

    abstract::The configuring of a radial basis function network (RBFN) consists of selecting the network parameters (centers and widths in RBF units and weights between the hidden and output layers) and network architecture. The issues of suboptimum and overfitting, however, often occur in RBFN configuring. This paper presented a ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Zhou YP,Jiang JH,Lin WQ,Zou HY,Wu HL,Shen GL,Yu RQ

    更新日期:2006-11-01 00:00:00

  • Pharmacophore Model for Wnt/Porcupine Inhibitors and Its Use in Drug Design.

    abstract::Porcupine is a component of the Wnt pathway which regulates cell proliferation, migration, stem cell self-renewal, and differentiation. The Wnt pathway has been shown to be dysregulated in a variety of cancers. Porcupine is a membrane bound O-acyltransferase that palmitoylates Wnt. Inhibiting porcupine blocks the secr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Poulsen A,Ho SY,Wang W,Alam J,Jeyaraj DA,Ang SH,Tan ES,Lin GR,Cheong VW,Ke Z,Lee MA,Keller TH

    更新日期:2015-07-27 00:00:00

  • FragPELE: Dynamic Ligand Growing within a Binding Site. A Novel Tool for Hit-To-Lead Drug Design.

    abstract::The early stages of drug discovery rely on hit-to-lead programs, where initial hits undergo partial optimization to improve binding affinities for their biological target. This is an expensive and time-consuming process, requiring multiple iterations of trial and error designs, an ideal scenario for applying computer ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Perez C,Soler D,Soliva R,Guallar V

    更新日期:2020-03-23 00:00:00

  • Improving protocols for protein mapping through proper comparison to crystallography data.

    abstract::Computational approaches to fragment-based drug design (FBDD) can complement experiments and facilitate the identification of potential hot spots along the protein surface. However, the evaluation of computational methods for mapping binding sites frequently focuses upon the ability to reproduce crystallographic coord...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Lexa KW,Carlson HA

    更新日期:2013-02-25 00:00:00

  • Mechanisms for Flavin-Mediated Oxidation: Hydride or Hydrogen-Atom Transfer?

    abstract::Flavins are versatile biological cofactors which catalyze proton-coupled electron transfers (PCET) with varying number and coupling of electrons. Flavin-mediated oxidations of nicotinamide adenine dinucleotide (NADH) and of succinate, initial redox reactions in cellular respiration, were examined here with multiconfig...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Curtolo F,Arantes GM

    更新日期:2020-12-28 00:00:00

  • Exploring Topological Pharmacophore Graphs for Scaffold Hopping.

    abstract::The primary goal of ligand-based virtual screening is to identify active compounds consisting of a core scaffold that is not found in the current active compound pool. Scaffold hopping is the term used for this purpose. In the present study, topological representations of pharmacophore features on chemical graphs were...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Nakano H,Miyao T,Funatsu K

    更新日期:2020-04-27 00:00:00

  • Computational evidence for the role of Arabidopsis thaliana UVR8 as UV-B photoreceptor and identification of its chromophore amino acids.

    abstract::A homology model of the Arabidopsis thaliana UV resistance locus 8 (UVR8) protein is presented herein, showing a seven-bladed β-propeller conformation similar to the globular structure of RCC1. The UVR8 amino acid sequence contains a very high amount of conserved tryptophans, and the homology model shows that seven of...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Wu M,Grahn E,Eriksson LA,Strid A

    更新日期:2011-06-27 00:00:00

  • Predicted Biological Activity of Purchasable Chemical Space.

    abstract::Whereas 400 million distinct compounds are now purchasable within the span of a few weeks, the biological activities of most are unknown. To facilitate access to new chemistry for biology, we have combined the Similarity Ensemble Approach (SEA) with the maximum Tanimoto similarity to the nearest bioactive to predict a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Irwin JJ,Gaskins G,Sterling T,Mysinger MM,Keiser MJ

    更新日期:2018-01-22 00:00:00

  • Study of chromatographic retention of natural terpenoids by chemoinformatic tools.

    abstract::The study of chromatographic retention of natural products can be used to increase their identification speed in complex biological matrices. In this work, six variables were used to study the retention behavior in reversed phase liquid chromatography of 39 sesquiterpene lactones (SL) from an in-house database using c...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Oliveira TB,Gobbo-Neto L,Schmidt TJ,Da Costa FB

    更新日期:2015-01-26 00:00:00

  • Computational Prediction and Biochemical Analyses of New Inverse Agonists for the CB1 Receptor.

    abstract::Human cannabinoid type 1 (CB1) G-protein coupled receptor is a potential therapeutic target for obesity. The previously predicted and experimentally validated ensemble of ligand-free conformations of CB1 [Scott, C. E. et al. Protein Sci. 2013 , 22 , 101 - 113 ; Ahn, K. H. et al. Proteins 2013 , 81 , 1304 - 1317] are u...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Scott CE,Ahn KH,Graf ST,Goddard WA 3rd,Kendall DA,Abrol R

    更新日期:2016-01-25 00:00:00

  • Structure-Based Rational Design of Novel Inhibitors Against Fructose-1,6-Bisphosphate Aldolase from Candida albicans.

    abstract::Class II fructose-1,6-bisphosphate aldolases (FBA-II) are attractive new targets for the discovery of drugs to combat invasive fungal infection, because they are absent in animals and higher plants. Although several FBA-II inhibitors have been reported, none of these inhibitors exhibit antifungal effect so far. In thi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Han X,Zhu X,Hong Z,Wei L,Ren Y,Wan F,Zhu S,Peng H,Guo L,Rao L,Feng L,Wan J

    更新日期:2017-06-26 00:00:00

  • PyCGTOOL: Automated Generation of Coarse-Grained Molecular Dynamics Models from Atomistic Trajectories.

    abstract::Development of coarse-grained (CG) molecular dynamics models is often a laborious process which commonly relies upon approximations to similar models, rather than systematic parametrization. PyCGTOOL automates much of the construction of CG models via calculation of both equilibrium values and force constants of inter...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Graham JA,Essex JW,Khalid S

    更新日期:2017-04-24 00:00:00

  • Kinetic Models of Cyclosporin A in Polar and Apolar Environments Reveal Multiple Congruent Conformational States.

    abstract::The membrane permeability of cyclic peptides and peptidomimetics, which are generally larger and more complex than typical drug molecules, is likely strongly influenced by the conformational behavior of these compounds in polar and apolar environments. The size and complexity of peptides often limit their bioavailabil...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Witek J,Keller BG,Blatter M,Meissner A,Wagner T,Riniker S

    更新日期:2016-08-22 00:00:00

  • Prediction of synthetic accessibility based on commercially available compound databases.

    abstract::A compound's synthetic accessibility (SA) is an important aspect of drug design, since in some cases computer-designed compounds cannot be synthesized. There have been several reports on SA prediction, most of which have focused on the difficulties of synthetic reactions based on retro-synthesis analyses, reaction dat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Fukunishi Y,Kurosawa T,Mikami Y,Nakamura H

    更新日期:2014-12-22 00:00:00

  • Structural protein-ligand interaction fingerprints (SPLIF) for structure-based virtual screening: method and benchmark study.

    abstract::Accurate and affordable assessment of ligand-protein affinity for structure-based virtual screening (SB-VS) is a standing challenge. Hence, empirical postdocking filters making use of various types of structure-activity information may prove useful. Here, we introduce one such filter based upon three-dimensional struc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Da C,Kireev D

    更新日期:2014-09-22 00:00:00

  • Systematic analysis of enzyme-catalyzed reaction patterns and prediction of microbial biodegradation pathways.

    abstract::The roles of chemical compounds in biological systems are now systematically analyzed by high-throughput experimental technologies. To automate the processing and interpretation of large-scale data it is necessary to develop bioinformatics methods to extract information from the chemical structures of these small mole...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Oh M,Yamada T,Hattori M,Goto S,Kanehisa M

    更新日期:2007-07-01 00:00:00

  • In silico target predictions: defining a benchmarking data set and comparison of performance of the multiclass Naïve Bayes and Parzen-Rosenblatt window.

    abstract::In this study, two probabilistic machine-learning algorithms were compared for in silico target prediction of bioactive molecules, namely the well-established Laplacian-modified Naïve Bayes classifier (NB) and the more recently introduced (to Cheminformatics) Parzen-Rosenblatt Window. Both classifiers were trained in ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Koutsoukas A,Lowe R,Kalantarmotamedi Y,Mussa HY,Klaffke W,Mitchell JB,Glen RC,Bender A

    更新日期:2013-08-26 00:00:00

  • Coordination of Na(+) by monoamine ligands in dopamine, norepinephrine, and serotonin transporters.

    abstract::The reuptake of neurotransmitters by dopamine, norepinephrine, and serotonin transporters during neuronal transmission requires a sodium gradient. An "ionic mode" of binding proposes that aspartate anchors the ligand's positive charge but ignores the direct role of sodium in ligand binding seen in the only representat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Xhaard H,Backström V,Denessiouk K,Johnson MS

    更新日期:2008-07-01 00:00:00

  • Statistical Analysis on the Performance of Molecular Mechanics Poisson-Boltzmann Surface Area versus Absolute Binding Free Energy Calculations: Bromodomains as a Case Study.

    abstract::Binding free energy calculations that make use of alchemical pathways are becoming increasingly feasible thanks to advances in hardware and algorithms. Although relative binding free energy (RBFE) calculations are starting to find widespread use, absolute binding free energy (ABFE) calculations are still being explore...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Aldeghi M,Bodkin MJ,Knapp S,Biggin PC

    更新日期:2017-09-25 00:00:00

  • Selective Fusion of Heterogeneous Classifiers for Predicting Substrates of Membrane Transporters.

    abstract::Membrane transporters play a crucial role in determining fate of administered drugs in a biological system. Early identification of plausible transporters for a drug molecule can provide insights into its therapeutic, pharmacokinetic, and toxicological profiles. In the present study, predictive models for classifying ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Shaikh N,Sharma M,Garg P

    更新日期:2017-03-27 00:00:00

  • A Polarization-Consistent Model for Alcohols to Predict Solvation Free Energies.

    abstract::Classical nonpolarizable models, normally based on a combination of Lennard-Jones sites and point charges, are extensively used to model thermodynamic properties of fluids, including solvation. An important shortcoming of these models is that they do not explicitly account for polarization effects, i.e., a description...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Barrera MC,Jorge M

    更新日期:2020-03-23 00:00:00

  • Binding Residence Time through Scaled Molecular Dynamics: A Prospective Application to hDAAO Inhibitors.

    abstract::Traditionally, a drug potency is expressed in terms of thermodynamic quantities, mostly Kd, and empirical IC50 values. Although binding affinity as an estimate of drug activity remains relevant, it is increasingly clear that it is also important to include (un)binding kinetic parameters in the characterization of pote...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Bernetti M,Rosini E,Mollica L,Masetti M,Pollegioni L,Recanatini M,Cavalli A

    更新日期:2018-11-26 00:00:00

  • An Analysis of Different Components of a High-Throughput Screening Library.

    abstract::Since many projects at pharmaceutical organizations get their start from a high-throughput screening (HTS) campaign, improving the quality of the HTS deck can improve the likelihood of discovering a high-quality lead molecule that can be progressed to a drug candidate. Over the past decade, Janssen has implemented sev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Saha A,Varghese T,Liu A,Allen SJ,Mirzadegan T,Hack MD

    更新日期:2018-10-22 00:00:00

  • LiCABEDS II. Modeling of ligand selectivity for G-protein-coupled cannabinoid receptors.

    abstract::The cannabinoid receptor subtype 2 (CB2) is a promising therapeutic target for blood cancer, pain relief, osteoporosis, and immune system disease. The recent withdrawal of Rimonabant, which targets another closely related cannabinoid receptor (CB1), accentuates the importance of selectivity for the development of CB2 ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Ma C,Wang L,Yang P,Myint KZ,Xie XQ

    更新日期:2013-01-28 00:00:00

  • Energetics, Thermodynamics, and Molecular Recognition of Piperine with DNA.

    abstract::Piperine, the bioactive phytochemical from black pepper (Piper nigrum L.), is a nontoxic natural compound exhibiting many physiological and pharmacological properties. They include antioxidant, anti-inflammatory, antimutagenic, antitumor, antiapoptotic, antigenotoxic, antiarthritic, antifungal, antimicrobial, antidepr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章


    authors: Haris P,Mary V,Haridas M,Sudarsanakumar C

    更新日期:2015-12-28 00:00:00