Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles.

Abstract:

:Anticancer drug-associated side effect knowledge often exists in multiple heterogeneous and complementary data sources. A comprehensive anticancer drug-side effect (drug-SE) relationship knowledge base is important for computation-based drug target discovery, drug toxicity predication and drug repositioning. In this study, we present a two-step approach by combining table classification and relationship extraction to extract drug-SE pairs from a large number of high-profile oncological full-text articles. The data consists of 31,255 tables downloaded from the Journal of Oncology (JCO). We first trained a statistical classifier to classify tables into SE-related and -unrelated categories. We then extracted drug-SE pairs from SE-related tables. We compared drug side effect knowledge extracted from JCO tables to that derived from FDA drug labels. Finally, we systematically analyzed relationships between anti-cancer drug-associated side effects and drug-associated gene targets, metabolism genes, and disease indications. The statistical table classifier is effective in classifying tables into SE-related and -unrelated (precision: 0.711; recall: 0.941; F1: 0.810). We extracted a total of 26,918 drug-SE pairs from SE-related tables with a precision of 0.605, a recall of 0.460, and a F1 of 0.520. Drug-SE pairs extracted from JCO tables is largely complementary to those derived from FDA drug labels; as many as 84.7% of the pairs extracted from JCO tables have not been included a side effect database constructed from FDA drug labels. Side effects associated with anticancer drugs positively correlate with drug target genes, drug metabolism genes, and disease indications.

journal_name

J Biomed Inform

authors

Xu R,Wang Q

doi

10.1016/j.jbi.2014.10.002

subject

Has Abstract

pub_date

2015-02-01 00:00:00

pages

128-35

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(14)00221-4

journal_volume

53

pub_type

杂志文章
  • A model-driven methodology for exploring complex disease comorbidities applied to autism spectrum disorder and inflammatory bowel disease.

    abstract::We propose a model-driven methodology aimed to shed light on complex disorders. Our approach enables exploring shared etiologies of comorbid diseases at the molecular pathway level. The method, Comparative Comorbidities Simulation (CCS), uses stochastic Petri net simulation for examining the phenotypic effects of pert...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.08.008

    authors: Somekh J,Peleg M,Eran A,Koren I,Feiglin A,Demishtein A,Shiloh R,Heiner M,Kong SW,Elazar Z,Kohane I

    更新日期:2016-10-01 00:00:00

  • Homology assessment and molecular sequence alignment.

    abstract::Hypotheses of homology are the basis of phylogenetic analysis. All character data are considered to be equivalent regardless of the source of those characters. Putative homology statements are designated based on observations of similarity. Pairwise sequence alignment using the Needleman-Wunsch algorithm is the basis ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2005.11.005

    authors: Phillips AJ

    更新日期:2006-02-01 00:00:00

  • Induction of comprehensible models for gene expression datasets by subgroup discovery methodology.

    abstract::Finding disease markers (classifiers) from gene expression data by machine learning algorithms is characterized by a high risk of overfitting the data due the abundance of attributes (simultaneously measured gene expression values) and shortage of available examples (observations). To avoid this pitfall and achieve pr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2004.07.007

    authors: Gamberger D,Lavrac N,Zelezný F,Tolar J

    更新日期:2004-08-01 00:00:00

  • Toward national comparable nurse practitioner data: proposed data elements, rationale, and methods.

    abstract::Federal funds have supported Nurse Practitioner (NP) education and the establishment of nurse-managed centers. Yet, important questions are raised about the quality and appropriate scope of NP care. Few NP-patient encounters are documented in the largest national surveys of ambulatory care, sponsored by the National C...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2003.09.016

    authors: Jenkins ML

    更新日期:2003-08-01 00:00:00

  • Knowledge-based personalized search engine for the Web-based Human Musculoskeletal System Resources (HMSR) in biomechanics.

    abstract::Human musculoskeletal system resources of the human body are valuable for the learning and medical purposes. Internet-based information from conventional search engines such as Google or Yahoo cannot response to the need of useful, accurate, reliable and good-quality human musculoskeletal resources related to medical ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2012.11.001

    authors: Dao TT,Hoang TN,Ta XH,Tho MC

    更新日期:2013-02-01 00:00:00

  • A novel web informatics approach for automated surveillance of cancer mortality trends.

    abstract::Cancer surveillance data are collected every year in the United States via the National Program of Cancer Registries (NPCR) and the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute (NCI). General trends are closely monitored to measure the nation's progress against cancer. The...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.03.027

    authors: Tourassi G,Yoon HJ,Xu S

    更新日期:2016-06-01 00:00:00

  • Challenges in clinical natural language processing for automated disorder normalization.

    abstract:BACKGROUND:Identifying key variables such as disorders within the clinical narratives in electronic health records has wide-ranging applications within clinical practice and biomedical research. Previous research has demonstrated reduced performance of disorder named entity recognition (NER) and normalization (or groun...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.07.010

    authors: Leaman R,Khare R,Lu Z

    更新日期:2015-10-01 00:00:00

  • The European cross-border health data exchange roadmap: Case study in the Italian setting.

    abstract::Health data exchange is a major challenge due to the sensitive information and the privacy issues entailed. Considering the European context, in which health data must be exchanged between different European Union (EU) Member States, each having a different national regulatory framework as well as different national h...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103183

    authors: Nalin M,Baroni I,Faiella G,Romano M,Matrisciano F,Gelenbe E,Martinez DM,Dumortier J,Natsiavas P,Votis K,Koutkias V,Tzovaras D,Clemente F

    更新日期:2019-06-01 00:00:00

  • Balancing volume and duration of information consumption by physicians: The case of health information exchange in critical care.

    abstract:BACKGROUND:The realization of the potential benefits of health information exchange systems (HIEs) for emergency departments (EDs) depends on the way these systems are actually used. The attributes of volume of information and duration of information processing are important for the study of HIE use patterns in the ED,...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.05.007

    authors: Politi L,Codish S,Sagy I,Fink L

    更新日期:2017-07-01 00:00:00

  • A framework for modeling health behavior protocols and their linkage to behavioral theory.

    abstract::With the rise in chronic, behavior-related disease, computerized behavioral protocols (CBPs) that help individuals improve behaviors have the potential to play an increasing role in the future health of society. To be effective and widely used CBPs should be based on accepted behavioral theory. However, designing CBPs...

    journal_title:Journal of biomedical informatics

    pub_type: 临床试验,杂志文章

    doi:10.1016/j.jbi.2004.12.001

    authors: Lenert L,Norman GJ,Mailhot M,Patrick K

    更新日期:2005-08-01 00:00:00

  • Multi-step ahead meningitis case forecasting based on decomposition and multi-objective optimization methods.

    abstract::Epidemiological time series forecasting plays an important role in health public systems, due to its ability to allow managers to develop strategic planning to avoid possible epidemics. In this paper, a hybrid learning framework is developed to forecast multi-step-ahead (one, two, and three-month-ahead) meningitis cas...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103575

    authors: Ribeiro MHDM,Mariani VC,Coelho LDS

    更新日期:2020-11-01 00:00:00

  • Extending the Fellegi-Sunter probabilistic record linkage method for approximate field comparators.

    abstract::Probabilistic record linkage is a method commonly used to determine whether demographic records refer to the same person. The Fellegi-Sunter method is a probabilistic approach that uses field weights based on log likelihood ratios to determine record similarity. This paper introduces an extension of the Fellegi-Sunter...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.08.004

    authors: DuVall SL,Kerber RA,Thomas A

    更新日期:2010-02-01 00:00:00

  • All-IP wireless sensor networks for real-time patient monitoring.

    abstract::This paper proposes the all-IP WSNs (wireless sensor networks) for real-time patient monitoring. In this paper, the all-IP WSN architecture based on gateway trees is proposed and the hierarchical address structure is presented. Based on this architecture, the all-IP WSN can perform routing without route discovery. Mor...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.08.002

    authors: Wang X,Le D,Cheng H,Xie C

    更新日期:2014-12-01 00:00:00

  • Benchmarking deep learning models on large healthcare datasets.

    abstract::Deep learning models (aka Deep Neural Networks) have revolutionized many fields including computer vision, natural language processing, speech recognition, and is being increasingly used in clinical healthcare applications. However, few works exist which have benchmarked the performance of the deep learning models wit...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.04.007

    authors: Purushotham S,Meng C,Che Z,Liu Y

    更新日期:2018-07-01 00:00:00

  • Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.

    abstract:INTRODUCTION:Machine learning (ML) and natural language processing have great potential to improve information extraction (IE) within electronic medical records (EMRs) for a wide variety of clinical search and summarization tools. Despite ML advancements, clinical adoption of real time IE tools for patient care remains...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103354

    authors: Steinkamp JM,Bala W,Sharma A,Kantrowitz JJ

    更新日期:2020-02-01 00:00:00

  • An automated reasoning framework for translational research.

    abstract::In this paper we propose a novel approach to the design and implementation of knowledge-based decision support systems for translational research, specifically tailored to the analysis and interpretation of data from high-throughput experiments. Our approach is based on a general epistemological model of the scientifi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.11.005

    authors: Riva A,Nuzzo A,Stefanelli M,Bellazzi R

    更新日期:2010-06-01 00:00:00

  • Systematic comparison of the protein-protein interaction databases from a user's perspective.

    abstract::In absence of periodic systematic comparisons, biologists/bioinformaticians may be forced to make a subjective selection among the many protein-protein interaction (PPI) databases and tools. We conducted a comprehensive compilation and comparison of such resources. We compiled 375 PPI resources, short-listed 125 impor...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103380

    authors: Bajpai AK,Davuluri S,Tiwary K,Narayanan S,Oguru S,Basavaraju K,Dayalan D,Thirumurugan K,Acharya KK

    更新日期:2020-03-01 00:00:00

  • TRAK ontology: defining standard care for the rehabilitation of knee conditions.

    abstract::In this paper we discuss the design and development of TRAK (Taxonomy for RehAbilitation of Knee conditions), an ontology that formally models information relevant for the rehabilitation of knee conditions. TRAK provides the framework that can be used to collect coded data in sufficient detail to support epidemiologic...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.04.009

    authors: Button K,van Deursen RW,Soldatova L,Spasić I

    更新日期:2013-08-01 00:00:00

  • Patient similarity for precision medicine: A systematic review.

    abstract::Evidence-based medicine is the most prevalent paradigm adopted by physicians. Clinical practice guidelines typically define a set of recommendations together with eligibility criteria that restrict their applicability to a specific group of patients. The ever-growing size and availability of health-related data is cur...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.06.001

    authors: Parimbelli E,Marini S,Sacchi L,Bellazzi R

    更新日期:2018-07-01 00:00:00

  • Scenario-based design: a method for connecting information system design with public health operations and emergency management.

    abstract:UNLABELLED:Responding to public health emergencies requires rapid and accurate assessment of workforce availability under adverse and changing circumstances. However, public health information systems to support resource management during both routine and emergency operations are currently lacking. We applied scenario-...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.07.004

    authors: Reeder B,Turner AM

    更新日期:2011-12-01 00:00:00

  • Inductive creation of an annotation schema for manually indexing clinical conditions from emergency department reports.

    abstract::Evaluating automated indexing applications requires comparing automatically indexed terms against manual reference standard annotations. However, there are no standard guidelines for determining which words from a textual document to include in manual annotations, and the vague task can result in substantial variation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2005.06.004

    authors: Chapman WW,Dowling JN

    更新日期:2006-04-01 00:00:00

  • Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

    abstract::We followed a systematic approach based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses to identify existing clinical natural language processing (NLP) systems that generate structured information from unstructured free text. Seven literature databases were searched with a query combining the...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2017.07.012

    authors: Kreimeyer K,Foster M,Pandey A,Arya N,Halford G,Jones SF,Forshee R,Walderhaug M,Botsis T

    更新日期:2017-09-01 00:00:00

  • Relevance feedback for enhancing content based image retrieval and automatic prediction of semantic image features: Application to bone tumor radiographs.

    abstract:BACKGROUND:The majority of current medical CBIR systems perform retrieval based only on "imaging signatures" generated by extracting pixel-level quantitative features, and only rarely has a feedback mechanism been incorporated to improve retrieval performance. In addition, current medical CBIR approaches do not routine...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.07.002

    authors: Banerjee I,Kurtz C,Devorah AE,Do B,Rubin DL,Beaulieu CF

    更新日期:2018-08-01 00:00:00

  • Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new GO relationships.

    abstract::The Gene Ontology (GO), a set of three sub-ontologies, is one of the most popular bio-ontologies used for describing gene product characteristics. GO annotation data containing terms from multiple sub-ontologies and at different levels in the ontologies is an important source of implicit relationships between terms fr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.06.012

    authors: Manda P,McCarthy F,Bridges SM

    更新日期:2013-10-01 00:00:00

  • A reference ontology for biomedical informatics: the Foundational Model of Anatomy.

    abstract::The Foundational Model of Anatomy (FMA), initially developed as an enhancement of the anatomical content of UMLS, is a domain ontology of the concepts and relationships that pertain to the structural organization of the human body. It encompasses the material objects from the molecular to the macroscopic levels that c...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2003.11.007

    authors: Rosse C,Mejino JL Jr

    更新日期:2003-12-01 00:00:00

  • An evaluation of clinical order patterns machine-learned from clinician cohorts stratified by patient mortality outcomes.

    abstract:OBJECTIVE:Evaluate the quality of clinical order practice patterns machine-learned from clinician cohorts stratified by patient mortality outcomes. MATERIALS AND METHODS:Inpatient electronic health records from 2010 to 2013 were extracted from a tertiary academic hospital. Clinicians (n = 1822) were stratified into lo...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.09.005

    authors: Wang JK,Hom J,Balasubramanian S,Schuler A,Shah NH,Goldstein MK,Baiocchi MTM,Chen JH

    更新日期:2018-10-01 00:00:00

  • glUCModel: a monitoring and modeling system for chronic diseases applied to diabetes.

    abstract::Chronic patients must carry out a rigorous control of diverse factors in their lives. Diet, sport activity, medical analysis or blood glucose levels are some of them. This is a hard task, because some of these controls are performed very often, for instance some diabetics measure their glucose levels several times eve...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.12.015

    authors: Hidalgo JI,Maqueda E,Risco-Martín JL,Cuesta-Infante A,Colmenar JM,Nobel J

    更新日期:2014-04-01 00:00:00

  • Reflective Random Indexing and indirect inference: a scalable method for discovery of implicit connections.

    abstract::The discovery of implicit connections between terms that do not occur together in any scientific document underlies the model of literature-based knowledge discovery first proposed by Swanson. Corpus-derived statistical models of semantic distance such as Latent Semantic Analysis (LSA) have been evaluated previously a...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.09.003

    authors: Cohen T,Schvaneveldt R,Widdows D

    更新日期:2010-04-01 00:00:00

  • Prediction of influenza vaccination outcome by neural networks and logistic regression.

    abstract::The major challenge in influenza vaccination is to predict vaccine efficacy. The purpose of this study was to design a model to enable successful prediction of the outcome of influenza vaccination based on real historical medical data. A non-linear neural network approach was used, and its performance compared to logi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2010.04.011

    authors: Trtica-Majnaric L,Zekic-Susac M,Sarlija N,Vitale B

    更新日期:2010-10-01 00:00:00

  • Heterogeneous database integration in biomedicine.

    abstract::The rapid expansion of biomedical knowledge, reduction in computing costs, and spread of internet access have created an ocean of electronic data. The decentralized nature of our scientific community and healthcare system, however, has resulted in a patchwork of diverse, or heterogeneous, database implementations, mak...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1006/jbin.2001.1024

    authors: Sujansky W

    更新日期:2001-08-01 00:00:00