OPUS-Rota3: Improving Protein Side-Chain Modeling by Deep Neural Networks and Ensemble Methods.

Abstract:

:Side-chain modeling is critical for protein structure prediction since the uniqueness of the protein structure is largely determined by its side-chain packing conformation. In this paper, differing from most approaches that rely on rotamer library sampling, we first propose a novel side-chain rotamer prediction method based on deep neural networks, named OPUS-RotaNN. Then, on the basis of our previous work OPUS-Rota2, we propose an open-source side-chain modeling framework, OPUS-Rota3, which integrates the results of different methods into its rotamer library as the sampling candidates. By including OPUS-RotaNN into OPUS-Rota3, we conduct our experiments on three native backbone test sets and one non-native backbone test set. On the native backbone test set, CAMEO-Hard61 for example, OPUS-Rota3 successfully predicts 51.14% of all side-chain dihedral angles with a tolerance criterion of 20° and outperforms OSCAR-star (50.87%), SCWRL4 (50.40%), and FASPR (49.85%). On the non-native backbone test set DB379-ITASSER, the accuracy of OPUS-Rota3 is 52.49%, better than OSCAR-star (48.95%), FASPR (48.69%), and SCWRL4 (48.29%). All the source codes including the training codes and the data we used are available at https://github.com/thuxugang/opus_rota3.

journal_name

J Chem Inf Model

authors

Xu G,Wang Q,Ma J

doi

10.1021/acs.jcim.0c00951

subject

Has Abstract

pub_date

2020-12-28 00:00:00

pages

6691-6697

issue

12

eissn

1549-9596

issn

1549-960X

journal_volume

60

pub_type

杂志文章
  • Equally Weighted Multiscale Elastic Network Model and Its Comparison with Traditional and Parameter-Free Models.

    abstract::Dynamical properties of proteins play an essential role in their function exertion. The elastic network model (ENM) is an effective and efficient tool in characterizing the intrinsic dynamical properties encoded in biomacromolecule structures. The Gaussian network model (GNM) and anisotropic network model (ANM) are th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01178

    authors: Gong W,Liu Y,Zhao Y,Wang S,Han Z,Li C

    更新日期:2021-01-26 00:00:00

  • GalaxyDock: protein-ligand docking with flexible protein side-chains.

    abstract::An important issue in developing protein-ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300342z

    authors: Shin WH,Seok C

    更新日期:2012-12-21 00:00:00

  • Molecular Dynamics Simulations of Substrate Release from Trypanosoma cruzi UDP-Galactopyranose Mutase.

    abstract::The enzyme UDP-galactopyranose mutase (UGM) represents a promising drug target for the treatment of infections with Trypanosoma cruzi. We have computed the Potential of Mean Force for the release of UDP-galactopyranose from UGM, using Umbrella Sampling simulations. The simulations revealed the conformational changes t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00675

    authors: Cossio-Pérez R,Pierdominici-Sottile G,Sobrado P,Palma J

    更新日期:2019-02-25 00:00:00

  • Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation.

    abstract::There is a growing public concern about the lack of reproducibility of experimental data published in peer-reviewed scientific literature. Herein, we review the most recent alerts regarding experimental data quality and discuss initiatives taken thus far to address this problem, especially in the area of chemical geno...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章,评审

    doi:10.1021/acs.jcim.6b00129

    authors: Fourches D,Muratov E,Tropsha A

    更新日期:2016-07-25 00:00:00

  • Predicting the DNA Conductance Using a Deep Feedforward Neural Network Model.

    abstract::Double-stranded DNA (dsDNA) has been established as an efficient medium for charge migration, bringing it to the forefront of the field of molecular electronics and biological research. The charge migration rate is controlled by the electronic couplings between the two nucleobases of DNA/RNA. These electronic coupling...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01072

    authors: Aggarwal A,Vinayak V,Bag S,Bhattacharyya C,Waghmare UV,Maiti PK

    更新日期:2021-01-25 00:00:00

  • Loop Grafting between Similar Local Environments for Fc-Silent Antibodies.

    abstract::Reduction of the affinity of the fragment crystallizable (Fc) region with immune receptors by substitution of one or a few amino acids, known as Fc-silencing, is an established approach to reduce the immune effector functions of monoclonal antibody therapeutics. This approach to Fc-silencing, however, is problematic a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01198

    authors: Lešnik S,Hodošček M,Podobnik B,Konc J

    更新日期:2020-11-23 00:00:00

  • DNA minor groove pharmacophores describing sequence specific properties.

    abstract::The more that is known about human and other genome sequences and the correlation between gene expression and the course of a disease, the more evident it seems to be that DNA is chosen as a drug target instead of proteins which are built with the information encoded by DNA. According to this approach, small minor gro...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600500v

    authors: Spitzer GM,Wellenzohn B,Laggner C,Langer T,Liedl KR

    更新日期:2007-07-01 00:00:00

  • Molecular Dynamics Simulations of Membrane-Bound STIM1 to Investigate Conformational Changes during STIM1 Activation upon Calcium Release.

    abstract::Calcium is involved in important intracellular processes, such as intracellular signaling from cell membrane receptors to the nucleus. Typically, calcium levels are kept at less than 100 nM in the nucleus and cytosol, but some calcium is stored in the endoplasmic reticulum (ER) lumen for rapid release to activate intr...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00475

    authors: Mukherjee S,Karolak A,Debant M,Buscaglia P,Renaudineau Y,Mignen O,Guida WC,Brooks WH

    更新日期:2017-02-27 00:00:00

  • Assessment of the Cruzain Cysteine Protease Reversible and Irreversible Covalent Inhibition Mechanism.

    abstract::Reversible and irreversible covalent ligands are advanced cysteine protease inhibitors in the drug development pipeline. K777 is an irreversible inhibitor of cruzain, a necessary enzyme for the survival of the Trypanosoma cruzi (T. cruzi) parasite, the causative agent of Chagas disease. Despite their importance, irrev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01138

    authors: Silva JRA,Cianni L,Araujo D,Batista PHJ,de Vita D,Rosini F,Leitão A,Lameira J,Montanari CA

    更新日期:2020-03-23 00:00:00

  • PyPLIF HIPPOS: A Molecular Interaction Fingerprinting Tool for Docking Results of AutoDock Vina and PLANTS.

    abstract::We describe here our tool named PyPLIF HIPPOS, which was newly developed to analyze the docking results of AutoDock Vina and PLANTS. Its predecessor, PyPLIF (https://github.com/radifar/pyplif), is a molecular interaction fingerprinting tool for the docking results of PLANTS, exclusively. Unlike its predecessor, PyPLIF...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00305

    authors: Istyastono EP,Radifar M,Yuniarti N,Prasasty VD,Mungkasi S

    更新日期:2020-08-24 00:00:00

  • GESSE: Predicting Drug Side Effects from Drug-Target Relationships.

    abstract::The in silico prediction of unwanted side effects (SEs) caused by the promiscuous behavior of drugs and their targets is highly relevant to the pharmaceutical industry. Considerable effort is now being put into computational and experimental screening of several suspected off-target proteins in the hope that SEs might...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00120

    authors: Pérez-Nueno VI,Souchet M,Karaboga AS,Ritchie DW

    更新日期:2015-09-28 00:00:00

  • Growth of ligand-target interaction data in ChEMBL is associated with increasing and activity measurement-dependent compound promiscuity.

    abstract::Compounds with high-confidence target annotations and activity measurements in the original and current release of the ChEMBL database have been compared to better understand how the growth of compound activity data might influence the spectrum of ligand-target interactions and the degree of target promiscuity among a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci3003304

    authors: Hu Y,Bajorath J

    更新日期:2012-10-22 00:00:00

  • Prediction of the Favorable Hydration Sites in a Protein Binding Pocket and Its Application to Scoring Function Formulation.

    abstract::The important role of water molecules in protein-ligand binding energetics has attracted wide attention in recent years. A range of computational methods has been developed to predict the favorable locations of water molecules in a protein binding pocket. Most of the current methods are based on extensive molecular dy...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00619

    authors: Li Y,Gao Y,Holloway MK,Wang R

    更新日期:2020-09-28 00:00:00

  • Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets.

    abstract::On the order of hundreds of absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) models have been described in the literature in the past decade which are more often than not inaccessible to anyone but their authors. Public accessibility is also an issue with computational models for bioactivity, a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00143

    authors: Clark AM,Dole K,Coulon-Spektor A,McNutt A,Grass G,Freundlich JS,Reynolds RC,Ekins S

    更新日期:2015-06-22 00:00:00

  • Development of an informatics platform for therapeutic protein and peptide analytics.

    abstract::The momentum gained by research on biologics has not been met yet with equal thrust on the informatics side. There is a noticeable lack of software for data management that empowers the bench scientists working on the development of biologic therapeutics. SARvision|Biologics is a tool to analyze data associated with b...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400333x

    authors: Hansen MR,Villar HO,Feyfant E

    更新日期:2013-10-28 00:00:00

  • The valence state combination model: a generic framework for handling tautomers and protonation states.

    abstract::The consistent handling of molecules is probably the most basic and important requirement in the field of cheminformatics. Reliable results can only be obtained if the underlying calculations are independent of the specific way molecules are represented in the input data. However, ensuring consistency is a complex tas...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400724v

    authors: Urbaczek S,Kolodzik A,Rarey M

    更新日期:2014-03-24 00:00:00

  • The molecular basis for the selectivity of tadalafil toward phosphodiesterase 5 and 6: a modeling study.

    abstract::Great attention has been paid to the clinical significance of phosphodiesterase 5 (PDE5) inhibitors, such as sildenafil, tadalafil, and vardenafil widely used for erectile dysfunction. However, sildenafil causes side effects on visual functions since it shows similar potencies to inhibit PDE5 and PDE6, whereas tadalaf...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400458z

    authors: Huang YY,Li Z,Cai YH,Feng LJ,Wu Y,Li X,Luo HB

    更新日期:2013-11-25 00:00:00

  • What do we know about C28H14 and C30H14 benzenoid hydrocarbons and their evolution to related polymer strips?

    abstract::While critically reviewing the current status of what is known about C28H14 and C30H14 benzenoid isomers, which are ubiquitous pyrolytic constituents, some new insights will be presented. Representative isomers belonging to these benzenoid hydrocarbons are at the crossroads to homologous series that extend to infinite...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050298i

    authors: Dias JR

    更新日期:2006-03-01 00:00:00

  • Relationships between Molecular Complexity, Biological Activity, and Structural Diversity.

    abstract::Following the theoretical model by Hann et al. moderately complex structures are preferable lead compounds since they lead to specific binding events involving the complete ligand molecule. To make this concept usable in practice for library design, we studied several complexity measures on the biological activity of ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0503558

    authors: Schuffenhauer A,Brown N,Selzer P,Ertl P,Jacoby E

    更新日期:2006-03-01 00:00:00

  • Supervised self-organizing maps in drug discovery. 2. Improvements in descriptor selection and model validation.

    abstract::The modeling of nonlinear descriptor-target relationships is a topic of considerable interest in drug discovery. We, herein, continue reporting the use of the self-organizing map-a nonlinear, topology-preserving pattern recognition technique that exhibits considerable promise in modeling and decoding these relationshi...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0500841

    authors: Xiao YD,Harris R,Bayram E,Ii PS,Schmitt JD

    更新日期:2006-01-01 00:00:00

  • GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors.

    abstract::The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In th...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00148

    authors: Won J,Lee GR,Park H,Seok C

    更新日期:2018-06-25 00:00:00

  • Including explicit water molecules as part of the protein structure in MM/PBSA calculations.

    abstract::Water is the natural medium of molecules in the cell and plays an important role in protein structure, function and interaction with small molecule ligands. However, the widely used molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) method for binding energy calculation does not explicitly take account of wa...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4001794

    authors: Zhu YL,Beroza P,Artis DR

    更新日期:2014-02-24 00:00:00

  • Trans and Cis Conformations of the Antihypertensive Drug Valsartan Respectively Lock the Inactive and Active-like States of Angiotensin II Type 1 Receptor: A Molecular Dynamics Study.

    abstract::Angiotensin II type 1 receptor (AT1R) is the principal regulator of blood pressure in humans. The overactivation of AT1R by the stimulation of angiotensin II would result in high blood pressure. To prevent hypertension, nonpeptide "sartan" drugs, such as valsartan (VST), have been developed to competitively block the ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00364

    authors: Wang L,Yan F

    更新日期:2018-10-22 00:00:00

  • Searching for recursively defined generic chemical patterns in nonenumerated fragment spaces.

    abstract::Retrieving molecules with specific structural features is a fundamental requirement of today's molecular database technologies. Estimates claim the chemical space relevant for drug discovery to be around 10⁶⁰ molecules. This figure is many orders of magnitude larger than the amount of molecules conventional databases ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400107k

    authors: Ehrlich HC,Henzler AM,Rarey M

    更新日期:2013-07-22 00:00:00

  • Enrichment analysis for discovering biological associations in phenotypic screens.

    abstract::A phenotypic screen (PS) is used to identify compounds causing a desired phenotype in a complex biological system where mechanisms and targets are largely unknown. Deconvoluting the mechanism of action of actives and identification of relevant targets and pathways remains a formidable challenge. Current methods fail t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400245c

    authors: Polyakov VR,Moorcroft ND,Drawid A

    更新日期:2014-02-24 00:00:00

  • FORTRAN interface for code interoperability in quantum chemistry: the Q5Cost library.

    abstract::Ab initio quantum-chemistry programs produce and use large amounts of data, which are usually stored on disk in the form of binary files. A FORTRAN library, named Q5Cost, has been designed and implemented in order to allow the storage of these data sets in a special data format built with the HDF5 technology. This dat...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7000567

    authors: Borini S,Monari A,Rossi E,Tajti A,Angeli C,Bendazzoli GL,Cimiraglia R,Emerson A,Evangelisti S,Maynau D,Sanchez-Marin J,Szalay PG

    更新日期:2007-05-01 00:00:00

  • Modeling Binding with Large Conformational Changes: Key Points in Ensemble-Docking Approaches.

    abstract::Protein dynamics play a critical role in ligand binding, and different models have been proposed to explain the relationships between protein motion and molecular recognition. Here, we present a study of ligand-binding processes associated with large conformational changes of a protein to elucidate the critical choice...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00125

    authors: Motta S,Bonati L

    更新日期:2017-07-24 00:00:00

  • New serotonin 5-HT(6) ligands from common feature pharmacophore hypotheses.

    abstract::Serotonin 5-HT6 receptor antagonists are thought to play an important role in the treatment of psychiatry, Alzheimer's disease, and probably obesity. To find novel and potent 5-HT6 antagonists and to provide a new idea for drug design, we used a ligand-based pharmacophore to perform the virtual screening of a commerci...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700160t

    authors: Kim HJ,Doddareddy MR,Choo H,Cho YS,No KT,Park WK,Pae AN

    更新日期:2008-01-01 00:00:00

  • Molecular Structure Extraction from Documents Using Deep Learning.

    abstract::Chemical structure extraction from documents remains a hard problem because of both false positive identification of structures during segmentation and errors in the predicted structures. Current approaches rely on handcrafted rules and subroutines that perform reasonably well generally but still routinely encounter s...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00669

    authors: Staker J,Marshall K,Abel R,McQuaw CM

    更新日期:2019-03-25 00:00:00

  • Antihypertensive drug valsartan in solution and at the AT1 receptor: conformational analysis, dynamic NMR spectroscopy, in silico docking, and molecular dynamics simulations.

    abstract::The conformational properties of AT1 antagonist valsartan have been analyzed both in solution and at the binding site of the receptor. Low energy conformations of valsartan in solution were explored by NMR spectroscopy and molecular modeling studies. The NMR results showed the existence of two distinct and almost isoe...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci800427s

    authors: Potamitis C,Zervou M,Katsiaras V,Zoumpoulakis P,Durdagi S,Papadopoulos MG,Hayes JM,Grdadolnik SG,Kyrikou I,Argyropoulos D,Vatougia G,Mavromoustakos T

    更新日期:2009-03-01 00:00:00