A survey on literature based discovery approaches in biomedical domain.

Abstract:

:Literature Based Discovery (LBD) refers to the problem of inferring new and interesting knowledge by logically connecting independent fragments of information units through explicit or implicit means. This area of research, which incorporates techniques from Natural Language Processing (NLP), Information Retrieval and Artificial Intelligence, has significant potential to reduce discovery time in biomedical research fields. Formally introduced in 1986, LBD has grown to be a significant and a core task for text mining practitioners in the biomedical domain. Together with its inter-disciplinary nature, this has led researchers across domains to contribute in advancing this field of study. This survey attempts to consolidate and present the evolution of techniques in this area. We cover a variety of techniques and provide a detailed description of the problem setting, the intuition, the advantages and limitations of various influential papers. We also list the current bottlenecks in this field and provide a general direction of research activities for the future. In an effort to be comprehensive and for ease of reference for off-the-shelf users, we also list many publicly available tools for LBD. We hope this survey will act as a guide to both academic and industry (bio)-informaticians, introduce the various methodologies currently employed and also the challenges yet to be tackled.

journal_name

J Biomed Inform

authors

Gopalakrishnan V,Jha K,Jin W,Zhang A

doi

10.1016/j.jbi.2019.103141

subject

Has Abstract

pub_date

2019-05-01 00:00:00

pages

103141

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(19)30059-0

journal_volume

93

pub_type

杂志文章,评审
  • Interacting agents through a web-based health serviceflow management system.

    abstract::The management of chronic and out-patients is a complex process which requires the cooperation of different agents belonging to several organizational units. Patients have to move to different locations to access the necessary services and to communicate their health status data. From their point of view there should ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.12.002

    authors: Leonardi G,Panzarasa S,Quaglini S,Stefanelli M,van der Aalst WM

    更新日期:2007-10-01 00:00:00

  • Automatic signal extraction, prioritizing and filtering approaches in detecting post-marketing cardiovascular events associated with targeted cancer drugs from the FDA Adverse Event Reporting System (FAERS).

    abstract:OBJECTIVE:Targeted drugs dramatically improve the treatment outcomes in cancer patients; however, these innovative drugs are often associated with unexpectedly high cardiovascular toxicity. Currently, cardiovascular safety represents both a challenging issue for drug developers, regulators, researchers, and clinicians ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.10.008

    authors: Xu R,Wang Q

    更新日期:2014-02-01 00:00:00

  • Unleashing genotypes in epidemiology - A novel method for managing high throughput information.

    abstract::The large amounts of data generated when high-throughput genotyping methods are used in large-scale epidemiological studies (>10,000 participants) present an enormous challenge to researchers in terms of structured data management. In order to face these challenges, a system has been designed and implemented where gen...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.07.005

    authors: Olund G,Brinne A,Lindqvist P,Litton JE

    更新日期:2009-12-01 00:00:00

  • Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: the SHARPn project.

    abstract::The Strategic Health IT Advanced Research Projects (SHARP) Program, established by the Office of the National Coordinator for Health Information Technology in 2010 supports research findings that remove barriers for increased adoption of health IT. The improvements envisioned by the SHARP Area 4 Consortium (SHARPn) wi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2012.01.009

    authors: Rea S,Pathak J,Savova G,Oniki TA,Westberg L,Beebe CE,Tao C,Parker CG,Haug PJ,Huff SM,Chute CG

    更新日期:2012-08-01 00:00:00

  • A new framework for the selection of tag SNPs by multimarker haplotypes.

    abstract::This paper proposes a new framework for the selection of tag SNPs based on haplotypes instead of on a single SNP. The tag SNPs found by this framework form a set of haplotypes completely predictive of the alleles of all untyped SNPs. We refer to this problem as MTMH, which is defined as follows: given a set of SNPs, f...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.04.003

    authors: Huang YT,Chao KM

    更新日期:2008-12-01 00:00:00

  • Classification of forensic autopsy reports through conceptual graph-based document representation model.

    abstract::Text categorization has been used extensively in recent years to classify plain-text clinical reports. This study employs text categorization techniques for the classification of open narrative forensic autopsy reports. One of the key steps in text classification is document representation. In document representation,...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.04.013

    authors: Mujtaba G,Shuib L,Raj RG,Rajandram R,Shaikh K,Al-Garadi MA

    更新日期:2018-06-01 00:00:00

  • Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.

    abstract:INTRODUCTION:Machine learning (ML) and natural language processing have great potential to improve information extraction (IE) within electronic medical records (EMRs) for a wide variety of clinical search and summarization tools. Despite ML advancements, clinical adoption of real time IE tools for patient care remains...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103354

    authors: Steinkamp JM,Bala W,Sharma A,Kantrowitz JJ

    更新日期:2020-02-01 00:00:00

  • Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches.

    abstract::The accurate diagnosis of heart failure in emergency room patients is quite important, but can also be quite difficult due to our insufficient understanding of the characteristics of heart failure. The purpose of this study is to design a decision-making model that provides critical factors and knowledge associated wi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2012.04.013

    authors: Son CS,Kim YN,Kim HS,Park HS,Kim MS

    更新日期:2012-10-01 00:00:00

  • Deep learning with wearable based heart rate variability for prediction of mental and general health.

    abstract::The ubiquity and commoditisation of wearable biosensors (fitness bands) has led to a deluge of personal healthcare data, but with limited analytics typically fed back to the user. The feasibility of feeding back more complex, seemingly unrelated measures to users was investigated, by assessing whether increased levels...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103610

    authors: Coutts LV,Plans D,Brown AW,Collomosse J

    更新日期:2020-12-01 00:00:00

  • Inductive creation of an annotation schema for manually indexing clinical conditions from emergency department reports.

    abstract::Evaluating automated indexing applications requires comparing automatically indexed terms against manual reference standard annotations. However, there are no standard guidelines for determining which words from a textual document to include in manual annotations, and the vague task can result in substantial variation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2005.06.004

    authors: Chapman WW,Dowling JN

    更新日期:2006-04-01 00:00:00

  • Clinical decision support models and frameworks: Seeking to address research issues underlying implementation successes and failures.

    abstract::Computer-based clinical decision support (CDS) has been pursued for more than five decades. Despite notable accomplishments and successes, wide adoption and broad use of CDS in clinical practice has not been achieved. Many issues have been identified as being partially responsible for the relatively slow adoption and ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2017.12.005

    authors: Greenes RA,Bates DW,Kawamoto K,Middleton B,Osheroff J,Shahar Y

    更新日期:2018-02-01 00:00:00

  • Developing, implementing, and evaluating decision support systems for shared decision making in patient care: a conceptual model and case illustration.

    abstract::The importance of including patient preferences in decisions regarding their care has received increased emphasis over recent years. Medical informatics can play an important role in improving patient-centered care by developing decision support systems to support the inclusion of patient preferences in clinical decis...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(03)00037-6

    authors: Ruland CM,Bakken S

    更新日期:2002-10-01 00:00:00

  • Wisdom of artificial crowds feature selection in untargeted metabolomics: An application to the development of a blood-based diagnostic test for thrombotic myocardial infarction.

    abstract:INTRODUCTION:Heart disease remains a leading cause of global mortality. While acute myocardial infarction (colloquially: heart attack), has multiple proximate causes, proximate etiology cannot be determined by a blood-based diagnostic test. We enrolled a suitable patient cohort and conducted a non-targeted quantificati...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,多中心研究

    doi:10.1016/j.jbi.2018.03.007

    authors: Trainor PJ,Yampolskiy RV,DeFilippis AP

    更新日期:2018-05-01 00:00:00

  • Modelling and analysing the dynamics of disease progression from cross-sectional studies.

    abstract::Clinical trials are typically conducted over a population within a defined time period in order to illuminate certain characteristics of a health issue or disease process. These cross-sectional studies give us a 'snapshot' of this disease process over a large number of people but do not allow us to model the temporal ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2012.11.003

    authors: Li Y,Swift S,Tucker A

    更新日期:2013-04-01 00:00:00

  • Information extraction from biomedical text.

    abstract::Information extraction is the process of scanning text for information relevant to some interest, including extracting entities, relations, and events. It requires deeper analysis than key word searches, but its aims fall short of the very hard and long-term problem of full text understanding. Information extraction r...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(03)00015-7

    authors: Hobbs JR

    更新日期:2002-08-01 00:00:00

  • Patient empowerment for cancer patients through a novel ICT infrastructure.

    abstract::As a result of recent advances in cancer research and "precision medicine" approaches, i.e. the idea of treating each patient with the right drug at the right time, more and more cancer patients are being cured, or might have to cope with a life with cancer. For many people, cancer survival today means living with a c...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103342

    authors: Kondylakis H,Bucur A,Crico C,Dong F,Graf N,Hoffman S,Koumakis L,Manenti A,Marias K,Mazzocco K,Pravettoni G,Renzi C,Schera F,Triberti S,Tsiknakis M,Kiefer S

    更新日期:2020-01-01 00:00:00

  • Semi-supervised medical entity recognition: A study on Spanish and Swedish clinical corpora.

    abstract:OBJECTIVE:The goal of this study is to investigate entity recognition within Electronic Health Records (EHRs) focusing on Spanish and Swedish. Of particular importance is a robust representation of the entities. In our case, we utilized unsupervised methods to generate such representations. METHODS:The significance of...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.05.009

    authors: Pérez A,Weegar R,Casillas A,Gojenola K,Oronoz M,Dalianis H

    更新日期:2017-07-01 00:00:00

  • A graph-based approach to auditing RxNorm.

    abstract:OBJECTIVES:RxNorm is a standardized nomenclature for clinical drug entities developed by the National Library of Medicine. In this paper, we audit relations in RxNorm for consistency and completeness through the systematic analysis of the graph of its concepts and relationships. METHODS:The representation of multi-ing...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.04.004

    authors: Bodenreider O,Peters LB

    更新日期:2009-06-01 00:00:00

  • GLIF3: a representation format for sharable computer-interpretable clinical practice guidelines.

    abstract::The Guideline Interchange Format (GLIF) is a model for representation of sharable computer-interpretable guidelines. The current version of GLIF (GLIF3) is a substantial update and enhancement of the model since the previous version (GLIF2). GLIF3 enables encoding of a guideline at three levels: a conceptual flowchart...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2004.04.002

    authors: Boxwala AA,Peleg M,Tu S,Ogunyemi O,Zeng QT,Wang D,Patel VL,Greenes RA,Shortliffe EH

    更新日期:2004-06-01 00:00:00

  • A tiered approach is more cost effective than traditional pharmacist-based review for classifying computer-detected signals as adverse drug events.

    abstract:OBJECTIVE:To develop a cost-efficient method for identifying adverse drug events (ADEs) and medication errors (MEs) identified using outpatient electronic medical records within ambulatory settings. DESIGN:Comparison of sensitivity and cost of "traditional" pharmacist based approach to identifying ADEs and MEs during ...

    journal_title:Journal of biomedical informatics

    pub_type: 临床试验,杂志文章,多中心研究

    doi:10.1016/s1532-0464(03)00059-5

    authors: Hope C,Overhage JM,Seger A,Teal E,Mills V,Fiskio J,Gandhi TK,Bates DW,Murray MD

    更新日期:2003-02-01 00:00:00

  • Evaluating warfarin dosing models on multiple datasets with a novel software framework and evolutionary optimisation.

    abstract::Warfarin is an effective preventative treatment for arterial and venous thromboembolism, but requires individualised dosing due to its narrow therapeutic range and high individual variation. Many machine learning techniques have been demonstrated in this domain. This study evaluated the accuracy of the most promising ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103634

    authors: Truda G,Marais P

    更新日期:2021-01-01 00:00:00

  • Collaborative text-annotation resource for disease-centered relation extraction from biomedical text.

    abstract::Agglomerating results from studies of individual biological components has shown the potential to produce biomedical discovery and the promise of therapeutic development. Such knowledge integration could be tremendously facilitated by automated text mining for relation extraction in the biomedical literature. Relation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.02.001

    authors: Cano C,Monaghan T,Blanco A,Wall DP,Peshkin L

    更新日期:2009-10-01 00:00:00

  • Neural network-based approaches for biomedical relation classification: A review.

    abstract::The explosive growth of biomedical literature has created a rich source of knowledge, such as that on protein-protein interactions (PPIs) and drug-drug interactions (DDIs), locked in unstructured free text. Biomedical relation classification aims to automatically detect and classify biomedical relations, which has gre...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2019.103294

    authors: Zhang Y,Lin H,Yang Z,Wang J,Sun Y,Xu B,Zhao Z

    更新日期:2019-11-01 00:00:00

  • Enhancing reuse of structured eligibility criteria and supporting their relaxation.

    abstract::Patient recruitment is one of the most important barriers to successful completion of clinical trials and thus to obtaining evidence about new methods for prevention, diagnostics and treatment. The reason is that recruitment is effort consuming. It requires the identification of candidate patients for the trial (the p...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.05.005

    authors: Milian K,Hoekstra R,Bucur A,Ten Teije A,van Harmelen F,Paulissen J

    更新日期:2015-08-01 00:00:00

  • NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information.

    abstract::Over the last 8 years, the National Cancer Institute (NCI) has launched a major effort to integrate molecular and clinical cancer-related information within a unified biomedical informatics framework, with controlled terminology as its foundational layer. The NCI Thesaurus is the reference terminology underpinning the...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.02.013

    authors: Sioutos N,de Coronado S,Haber MW,Hartel FW,Shaiu WL,Wright LW

    更新日期:2007-02-01 00:00:00

  • LGscore: A method to identify disease-related genes using biological literature and Google data.

    abstract::Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which iden...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.01.003

    authors: Kim J,Kim H,Yoon Y,Park S

    更新日期:2015-04-01 00:00:00

  • Predicting the function of transplanted kidney in long-term care processes: Application of a hybrid model.

    abstract:BACKGROUND:A tool that can predict the estimated glomerular filtration rate (eGFR) in routine daily care can help clinicians to make better decisions for kidney transplant patients and to improve transplantation outcome. In this paper, we proposed a hybrid prediction model for predicting a future value for eGFR during ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103116

    authors: Rashidi Khazaee P,Bagherzadeh M J,Niazkhani Z,Pirnejad H

    更新日期:2019-03-01 00:00:00

  • Use of morphological analysis in protein name recognition.

    abstract::Protein name recognition aims to detect each and every protein names appearing in a PubMed abstract. The task is not simple, as the graphic word boundary (space separator) assumed in conventional preprocessing does not necessarily coincide with the protein name boundary. Such boundary disagreement caused by tokenizati...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2004.08.001

    authors: Yamamoto K,Kudo T,Konagaya A,Matsumoto Y

    更新日期:2004-12-01 00:00:00

  • Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics.

    abstract::This work investigates, whether openEHR with its reference model, archetypes and templates is suitable for the digital representation of demographic as well as clinical data. Moreover, it elaborates openEHR as a tool for modelling Hospital Information Systems on a regional level based on a national logical infrastruct...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.04.004

    authors: Pahl C,Zare M,Nilashi M,de Faria Borges MA,Weingaertner D,Detschew V,Supriyanto E,Ibrahim O

    更新日期:2015-06-01 00:00:00

  • Chester: towards a personal medication advisor.

    abstract::Dialogue systems for health communication hold out the promise of providing intelligent assistance to patients through natural interfaces that require no training to use. But in order to make the development of such systems cost effective, we must be able to use generic techniques and components which are then special...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.02.004

    authors: Allen J,Ferguson G,Blaylock N,Byron D,Chambers N,Dzikovska M,Galescu L,Swift M

    更新日期:2006-10-01 00:00:00