ChemSchematicResolver: A Toolkit to Decode 2D Chemical Diagrams with Labels and R-Groups into Annotated Chemical Named Entities.

Abstract:

:The number of journal articles in the scientific domain has grown to the point where it has become impossible for researchers to capitalize on all findings in their relevant discipline. Information is stored in these articles in a number of ways, including figures that describe important results. In organic chemistry, these figures often present chemical schematic diagrams that graphically define the structures of carbon-based compounds. These diagrams are intuitive for an expert to comprehend, but they are not designed for machines. This work presents ChemSchematicResolver, a software tool that can be used to identify chemical schematic diagrams within the figure of a document, resolve any R-group substituents within them, and convert the resulting diagrams to a machine-readable format in a high-throughput, autonomous fashion. The tool includes a new algorithm that is used to identify relevant diagrams and a mechanism that combines these data with contextual information from the rest of the document for the creation of highly relational databases. It includes support for a variety of general R-group structures, the first time this is available in any open-source chemical schematic diagram extraction tool. It is presented alongside a self-generated evaluation set, on which the most important assessment metric, precision, achieved 83-100% for all assessed areas. The ChemSchematicResolver tool is released under the MIT license and is available to download from www.chemschematicresolver.org.

journal_name

J Chem Inf Model

authors

Beard EJ,Cole JM

doi

10.1021/acs.jcim.0c00042

subject

Has Abstract

pub_date

2020-04-27 00:00:00

pages

2059-2072

issue

4

eissn

1549-9596

issn

1549-960X

journal_volume

60

pub_type

杂志文章
  • Machine Learning Enhanced Spectrum Recognition Based on Computer Vision (SRCV) for Intelligent NMR Data Extraction.

    abstract::A machine learning enhanced spectrum recognition system called spectrum recognition based on computer vision (SRCV) for data extraction from previously analyzed 13C and 1H NMR spectra has been developed. The intelligent system was designed with four function modules to extract data from three areas of NMR images, incl...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01046

    authors: Jia W,Yang Z,Yang M,Cheng L,Lei Z,Wang X

    更新日期:2021-01-25 00:00:00

  • Drug effect prediction by polypharmacology-based interaction profiling.

    abstract::Most drugs exert their effects via multitarget interactions, as hypothesized by polypharmacology. While these multitarget interactions are responsible for the clinical effect profiles of drugs, current methods have failed to uncover the complex relationships between them. Here, we introduce an approach which is able t...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci2002022

    authors: Simon Z,Peragovics A,Vigh-Smeller M,Csukly G,Tombor L,Yang Z,Zahoránszky-Kohalmi G,Végner L,Jelinek B,Hári P,Hetényi C,Bitter I,Czobor P,Málnási-Csizmadia A

    更新日期:2012-01-23 00:00:00

  • Assessment of the Sampling Performance of Multiple-Copy Dynamics versus a Unique Trajectory.

    abstract::The goal of the present study was to ascertain the differential performance of a long molecular dynamics trajectory versus several shorter ones starting from different points in the phase space and covering the same sampling time. For this purpose, we selected the 16-mer peptide Bak16BH3 as a model for study and carri...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.6b00347

    authors: Perez JJ,Tomas MS,Rubio-Martinez J

    更新日期:2016-10-24 00:00:00

  • Critical Assessment of the Hildebrand and Hansen Solubility Parameters for Polymers.

    abstract::Solubility parameter models are widely used to select suitable solvents/nonsolvents for polymers in a variety of processing and engineering applications. In this study, we focus on two well-established models, namely, the Hildebrand and Hansen solubility parameter models. Both models are built on the basis of the noti...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00656

    authors: Venkatram S,Kim C,Chandrasekaran A,Ramprasad R

    更新日期:2019-10-28 00:00:00

  • Conformator: A Novel Method for the Generation of Conformer Ensembles.

    abstract::Computer-aided drug design methods such as docking, pharmacophore searching, 3D database searching, and the creation of 3D-QSAR models need conformational ensembles to handle the flexibility of small molecules. Here, we present Conformator, an accurate and effective knowledge-based algorithm for generating conformer e...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00704

    authors: Friedrich NO,Flachsenberg F,Meyder A,Sommer K,Kirchmair J,Rarey M

    更新日期:2019-02-25 00:00:00

  • Mechanisms for Flavin-Mediated Oxidation: Hydride or Hydrogen-Atom Transfer?

    abstract::Flavins are versatile biological cofactors which catalyze proton-coupled electron transfers (PCET) with varying number and coupling of electrons. Flavin-mediated oxidations of nicotinamide adenine dinucleotide (NADH) and of succinate, initial redox reactions in cellular respiration, were examined here with multiconfig...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00945

    authors: Curtolo F,Arantes GM

    更新日期:2020-12-28 00:00:00

  • Automated extraction of information on chemical-P-glycoprotein interactions from the literature.

    abstract::Knowledge of the interactions between drugs and transporters is important for drug discovery and development as well as for the evaluation of their clinical safety. We recently developed a text-mining system for the automatic extraction of information on chemical-CYP3A4 interactions from the literature. This system is...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci4003188

    authors: Yoshida S,Yamashita F,Ose A,Maeda K,Sugiyama Y,Hashida M

    更新日期:2013-10-28 00:00:00

  • Conformational analysis of macrocycles: finding what common search methods miss.

    abstract::As computational drug design becomes increasingly reliant on virtual screening and on high-throughput 3D modeling, the need for fast, robust, and reliable methods for sampling molecular conformations has become greater than ever. Furthermore, chemical novelty is at a premium, forcing medicinal chemists to explore more...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci900238a

    authors: Bonnet P,Agrafiotis DK,Zhu F,Martin E

    更新日期:2009-10-01 00:00:00

  • BFMP: a method for discretizing and visualizing pyranose conformations.

    abstract::We report a new classification method for pyranose ring conformations called Best-fit, Four-Membered Plane (BFMP), which describes pyranose ring conformations based on reference planes defined by four atoms. The method is able to characterize all asymmetrical and symmetrical shapes of a pyran ring, is readily automate...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500325b

    authors: Makeneni S,Foley BL,Woods RJ

    更新日期:2014-10-27 00:00:00

  • Large-scale mining for similar protein binding pockets: with RAPMAD retrieval on the fly becomes real.

    abstract::Determination of structural similarities between protein binding pockets is an important challenge in in silico drug design. It can help to understand selectivity considerations, predict unexpected ligand cross-reactivity, and support the putative annotation of function to orphan proteins. To this end, Cavbase was dev...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5005898

    authors: Krotzky T,Grunwald C,Egerland U,Klebe G

    更新日期:2015-01-26 00:00:00

  • Assessing the Conformational Equilibrium of Carboxylic Acid via Quantum Mechanical and Molecular Dynamics Studies on Acetic Acid.

    abstract::Accurate hydrogen placement in molecular modeling is crucial for studying the interactions and dynamics of biomolecular systems. The carboxyl functional group is a prototypical example of a functional group that requires protonation during structure preparation. To our knowledge, when in their neutral form, carboxylic...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.8b00835

    authors: Lim VT,Bayly CI,Fusti-Molnar L,Mobley DL

    更新日期:2019-05-28 00:00:00

  • ColBioS-FlavRC: a collection of bioselective flavonoids and related compounds filtered from high-throughput screening outcomes.

    abstract::Flavonoids, the vastest class of natural polyphenols, are extensively investigated for their multiple benefits on human health. Due to their physicochemical or biological properties, many representatives are considered to exhibit low selectivity among various protein targets or to plague high-throughput screening (HTS...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci5002668

    authors: Avram SI,Pacureanu LM,Bora A,Crisan L,Avram S,Kurunczi L

    更新日期:2014-08-25 00:00:00

  • A Polarization-Consistent Model for Alcohols to Predict Solvation Free Energies.

    abstract::Classical nonpolarizable models, normally based on a combination of Lennard-Jones sites and point charges, are extensively used to model thermodynamic properties of fluids, including solvation. An important shortcoming of these models is that they do not explicitly account for polarization effects, i.e., a description...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b01005

    authors: Barrera MC,Jorge M

    更新日期:2020-03-23 00:00:00

  • How do metabolites differ from their parent molecules and how are they excreted?

    abstract::Understanding which physicochemical properties, or property distributions, are favorable for successful design and development of drugs, nutritional supplements, cosmetics, and agrochemicals is of great importance. In this study we have analyzed molecules from three distinct chemical spaces (i) approved drugs, (ii) hu...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci300487z

    authors: Kirchmair J,Howlett A,Peironcely JE,Murrell DS,Williamson MJ,Adams SE,Hankemeier T,van Buren L,Duchateau G,Klaffke W,Glen RC

    更新日期:2013-02-25 00:00:00

  • 3D QSAR methods: Phase and Catalyst compared.

    abstract::The programs Phase and Catalyst HypoGen are compared for their performance in determining three-dimensional quantitative structure-activity relationships. Eight sets of compounds with measured activity were collected from the public literature and partitioned into suitable training and test sets by an automated proced...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci7000082

    authors: Evans DA,Doman TN,Thorner DA,Bodkin MJ

    更新日期:2007-05-01 00:00:00

  • Generalized topological indices. Modeling gas-phase rate coefficients of atmospheric relevance.

    abstract::We develop the idea that the use of ad hoc molecular descriptors in QSAR/QSPR studies is not an optimal solution. Instead, we propose to optimize these descriptors for the specific properties under study. In the case of topological indices (TIs) we propose the use of the generalized topological indices (GTIs), which a...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci600448b

    authors: Estrada E,Matamala AR

    更新日期:2007-05-01 00:00:00

  • Insights on the facet specific adsorption of amino acids and peptides toward platinum.

    abstract::Engineering shape-controlled bionanomaterials requires comprehensive understanding of interactions between biomolecules and inorganic surfaces. We explore the origin of facet-selective binding of peptides adsorbed onto Pt(100) and Pt(111) crystallographic planes. Using molecular dynamics simulations, we show that upon...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci400630d

    authors: Ramakrishnan SK,Martin M,Cloitre T,Firlej L,Cuisinier FJ,Gergely C

    更新日期:2013-12-23 00:00:00

  • Symplectic molecular dynamics simulations on specially designed parallel computers.

    abstract::We have developed a computer program for molecular dynamics (MD) simulation that implements the Split Integration Symplectic Method (SISM) and is designed to run on specialized parallel computers. The MD integration is performed by the SISM, which analytically treats high-frequency vibrational motion and thus enables ...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050216q

    authors: Borstnik U,Janezic D

    更新日期:2005-11-01 00:00:00

  • Influence of Descriptor Implementation on Compound Ranking Based on Multiparameter Assessment.

    abstract::Most of the common molecular descriptors have numerous different implementations. This can influence the results of compound prioritization based on the multiparameter assessment (MPA) approach that allows a medicinal chemist to simultaneously analyze and achieve the desired balance of the diverse and often conflictin...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00734

    authors: Sosnina EA,Osolodkin DI,Radchenko EV,Sosnin S,Palyulin VA

    更新日期:2018-05-29 00:00:00

  • BiKi Life Sciences: A New Suite for Molecular Dynamics and Related Methods in Drug Discovery.

    abstract::In this paper, we introduce the BiKi Life Sciences suite. This software makes it easy for computational medicinal chemists to run ad hoc molecular dynamics protocols in a novel and task-oriented environment; as a notebook, BiKi (acronym of Binding Kinetics) keeps memory of any activity together with dependencies among...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.7b00680

    authors: Decherchi S,Bottegoni G,Spitaleri A,Rocchia W,Cavalli A

    更新日期:2018-02-26 00:00:00

  • Ligand-based molecular modeling study on a chemically diverse series of cholecystokinin-B/gastrin receptor antagonists: generation of predictive model.

    abstract::Pharmacophore hypotheses were developed for six structurally diverse series of cholecystokinin-B/gastrin receptor (CCK-BR) antagonists. A training set consisting of 33 compounds was carefully selected. The activity spread of the training set molecules was from 0.1 to 2100 nM. The most predictive pharmacophore model (h...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci050257m

    authors: Chopra M,Mishra AK

    更新日期:2005-11-01 00:00:00

  • Ligand- and Structure-Based Analysis of Deep Learning-Generated Potential α2a Adrenoceptor Agonists.

    abstract::The α2a adrenoceptor is a medically relevant subtype of the G protein-coupled receptor family. Unfortunately, high-throughput techniques aimed at producing novel drug leads for this receptor have been largely unsuccessful because of the complex pharmacology of adrenergic receptors. As such, cutting-edge in silico liga...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c01019

    authors: Schultz KJ,Colby SM,Lin VS,Wright AT,Renslow RS

    更新日期:2021-01-25 00:00:00

  • Isomerization and Decomposition of 2-Methylfuran with External Forces.

    abstract::The primary goal of this project was to evaluate the performance of the Standard and Enforced Geometry Optimization (SEGO) method which we have recently developed. The SEGO method has been designed for an automatic location of multiple minima on the molecular Potential Energy Surface (PES), and its usefulness has been...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.9b00352

    authors: Brzyska A,Woliński K

    更新日期:2019-08-26 00:00:00

  • Structural protein-ligand interaction fingerprints (SPLIF) for structure-based virtual screening: method and benchmark study.

    abstract::Accurate and affordable assessment of ligand-protein affinity for structure-based virtual screening (SB-VS) is a standing challenge. Hence, empirical postdocking filters making use of various types of structure-activity information may prove useful. Here, we introduce one such filter based upon three-dimensional struc...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci500319f

    authors: Da C,Kireev D

    更新日期:2014-09-22 00:00:00

  • Multidimensional Drift of Sequence Attributes and Functional Profiles in the Superfamily of the Three-Finger Proteins and Their Structural Homologues.

    abstract::Functional diversity of the three-finger-protein domain (TFPD) had been acquired via hypervariability of some sequence positions and extensive insertion/deletion of short AA-segments that caused multidimensional drift of several sequence attributes such as the overall (HI) and local hydrophobicity levels, the isoelect...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.5b00322

    authors: Galat A

    更新日期:2015-09-28 00:00:00

  • Linear and nonlinear 3D-QSAR approaches in tandem with ligand-based homology modeling as a computational strategy to depict the pyrazolo-triazolo-pyrimidine antagonists binding site of the human adenosine A2A receptor.

    abstract::The integration of ligand- and structure-based strategies might sensitively increase the success of drug discovery process. We have recently described the application of Molecular Electrostatic Potential autocorrelated vectors (autoMEPs) in generating both linear (Partial Least-Square, PLS) and nonlinear (Response Sur...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700300w

    authors: Michielan L,Bacilieri M,Schiesaro A,Bolcato C,Pastorin G,Spalluto G,Cacciari B,Klotz KN,Kaseda C,Moro S

    更新日期:2008-02-01 00:00:00

  • Improving Protein-Ligand Docking Results with High-Throughput Molecular Dynamics Simulations.

    abstract::Structure-based virtual screening relies on classical scoring functions that often fail to reliably discriminate binders from nonbinders. In this work, we present a high-throughput protein-ligand complex molecular dynamics (MD) simulation that uses the output from AutoDock Vina to improve docking results in distinguis...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00057

    authors: Guterres H,Im W

    更新日期:2020-04-27 00:00:00

  • Cross-docking of inhibitors into CDK2 structures. 2.

    abstract::In the preceding paper (Duca, J. S.; Madison, V. S.; Voigt, J. H. J. Chem. Inf. Model. 2008, 48, 659-668), the accuracy of docking and affinity predictions of the Gold and Glide programs were investigated using single protein conformations spanning 150 CDK2/inhibitor crystallographic complexes. High docking accuracy w...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci700428d

    authors: Voigt JH,Elkin C,Madison VS,Duca JS

    更新日期:2008-03-01 00:00:00

  • Comparative Dynamics and Functional Mechanisms of the CYP17A1 Tunnels Regulated by Ligand Binding.

    abstract::As an important member of cytochrome P450 (CYP) enzymes, CYP17A1 is a dual-function monooxygenase with a critical role in the synthesis of many human steroid hormones, making it an attractive therapeutic target. The emerging structural information about CYP17A1 and the growing number of inhibitors for these enzymes ca...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/acs.jcim.0c00447

    authors: Xiao F,Song X,Tian P,Gan M,Verkhivker GM,Hu G

    更新日期:2020-07-27 00:00:00

  • Flux (1): a virtual synthesis scheme for fragment-based de novo design.

    abstract::It is demonstrated that the fragmentation of druglike molecules by applying simplistic pseudo-retrosynthesis results in a stock of chemically meaningful building blocks for de novo molecule generation. A stochastic search algorithm in conjunction with ligand-based similarity scoring (Flux: fragment-based ligand builde...

    journal_title:Journal of chemical information and modeling

    pub_type: 杂志文章

    doi:10.1021/ci0503560

    authors: Fechner U,Schneider G

    更新日期:2006-03-01 00:00:00