Matching patients to clinical trials using semantically enriched document representation.

Abstract:

:Recruiting eligible patients for clinical trials is crucial for reliably answering specific questions about medical interventions and evaluation. However, clinical trial recruitment is a bottleneck in clinical research and drug development. Our goal is to provide an approach towards automating this manual and time-consuming patient recruitment task using natural language processing and machine learning techniques. Specifically, our approach extracts key information from series of narrative clinical documents in patient's records and collates helpful evidence to make decisions on eligibility of patients according to certain inclusion and exclusion criteria. Challenges in applying narrative clinical documents such as differences in reporting styles and sub-languages are addressed by enriching them with knowledge from domain ontologies in the form of semantic vector representations. We show that a machine learning model based on Multi-Layer Perceptron (MLP) is more effective for the task than five other neural networks and four conventional machine learning models. Our approach achieves overall micro-F1-Score of 84% for 13 different eligibility criteria. Our experiments also indicate that semantically enriched documents are more effective than using original documents for cohort selection. Our system provides an end-to-end machine learning-based solution that achieves comparable results with the state-of-the-art which relies on hand-crafted rules or data-centric engineered features.

journal_name

J Biomed Inform

authors

Hassanzadeh H,Karimi S,Nguyen A

doi

10.1016/j.jbi.2020.103406

subject

Has Abstract

pub_date

2020-05-01 00:00:00

pages

103406

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(20)30034-4

journal_volume

105

pub_type

杂志文章
  • Vaidurya: a multiple-ontology, concept-based, context-sensitive clinical-guideline search engine.

    abstract::We designed and implemented a generic search engine (Vaidurya), as part of our Digital clinical-Guideline Library (DeGeL) framework. Two search methods were implemented in addition to full-text search: (1) concept-based search, which relies on pre-indexing the guidelines in a clinically meaningful fashion, and (2) con...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.07.003

    authors: Moskovitch R,Shahar Y

    更新日期:2009-02-01 00:00:00

  • Computing with evidence Part II: An evidential approach to predicting metabolic drug-drug interactions.

    abstract::We describe a novel experiment that we conducted with the Drug Interaction Knowledge-base (DIKB) to determine which combinations of evidence enable a rule-based theory of metabolic drug-drug interactions to make the most optimal set of predictions. The focus of the experiment was a group of 16 drugs including six memb...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.05.010

    authors: Boyce R,Collins C,Horn J,Kalet I

    更新日期:2009-12-01 00:00:00

  • Neural network-based approaches for biomedical relation classification: A review.

    abstract::The explosive growth of biomedical literature has created a rich source of knowledge, such as that on protein-protein interactions (PPIs) and drug-drug interactions (DDIs), locked in unstructured free text. Biomedical relation classification aims to automatically detect and classify biomedical relations, which has gre...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2019.103294

    authors: Zhang Y,Lin H,Yang Z,Wang J,Sun Y,Xu B,Zhao Z

    更新日期:2019-11-01 00:00:00

  • Tracking a moving user in indoor environments using Bluetooth low energy beacons.

    abstract:BACKGROUND:Bluetooth low energy (BLE) beacons have been used to track the locations of individuals in indoor environments for clinical applications such as workflow analysis and infectious disease modelling. Most current approaches use the received signal strength indicator (RSSI) to track locations. When using the RSS...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103288

    authors: Surian D,Kim V,Menon R,Dunn AG,Sintchenko V,Coiera E

    更新日期:2019-10-01 00:00:00

  • Cognitive simulators for medical education and training.

    abstract::Simulators for honing procedural skills (such as surgical skills and central venous catheter placement) have proven to be valuable tools for medical educators and students. While such simulations represent an effective paradigm in surgical education, there is an opportunity to add a layer of cognitive exercises to the...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.02.008

    authors: Kahol K,Vankipuram M,Smith ML

    更新日期:2009-08-01 00:00:00

  • A Health Surveillance Software Framework to deliver information on preventive healthcare strategies.

    abstract::A software framework can reduce costs related to the development of an application because it allows developers to reuse both design and code. Recently, companies and research groups have announced that they have been employing health software frameworks. This paper presents the design, proof-of-concept implementation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.06.002

    authors: Macedo AA,Pollettini JT,Baranauskas JA,Chaves JC

    更新日期:2016-08-01 00:00:00

  • Research-IQ: development and evaluation of an ontology-anchored integrative query tool.

    abstract::Investigators in the translational research and systems medicine domains require highly usable, efficient and integrative tools and methods that allow for the navigation of and reasoning over emerging large-scale data sets. Such resources must cover a spectrum of granularity from bio-molecules to population phenotypes...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.07.006

    authors: Borlawsky TB,Lele O,Payne PR

    更新日期:2011-12-01 00:00:00

  • Unsupervised low-dimensional vector representations for words, phrases and text that are transparent, scalable, and produce similarity metrics that are not redundant with neural embeddings.

    abstract::Neural embeddings are a popular set of methods for representing words, phrases or text as a low dimensional vector (typically 50-500 dimensions). However, it is difficult to interpret these dimensions in a meaningful manner, and creating neural embeddings requires extensive training and tuning of multiple parameters a...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103096

    authors: Smalheiser NR,Cohen AM,Bonifield G

    更新日期:2019-02-01 00:00:00

  • Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.

    abstract::We address the assignment of ICD-10 codes for causes of death by analyzing free-text descriptions in death certificates, together with the associated autopsy reports and clinical bulletins, from the Portuguese Ministry of Health. We leverage a deep neural network that combines word embeddings, recurrent units, and neu...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.02.011

    authors: Duarte F,Martins B,Pinto CS,Silva MJ

    更新日期:2018-04-01 00:00:00

  • Description of a method to support public health information management: organizational network analysis.

    abstract::In this case study, we describe a method that has potential to provide systematic support for public health information management. Public health agencies depend on specialized information that travels throughout an organization via communication networks among employees. Interactions that occur within these networks ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.09.004

    authors: Merrill J,Bakken S,Rockoff M,Gebbie K,Carley KM

    更新日期:2007-08-01 00:00:00

  • The impact of SNOMED CT revisions on a mapped interface terminology: terminology development and implementation issues.

    abstract::Large-scale mapping efforts have been done in attempts to migrate systems that use proprietary concepts to ones that use terminological standards such as SNOMED CT. As efforts move towards implementation, the target maps should retain a predictable structure including those targets requiring post-coordination of SNOME...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.03.004

    authors: Wade G,Rosenbloom ST

    更新日期:2009-06-01 00:00:00

  • Prediction of clinical risks by analysis of preclinical and clinical adverse events.

    abstract::This study examines the ability of nonclinical adverse event observations to predict human clinical adverse events observed in drug development programs. In addition it examines the relationship between nonclinical and clinical adverse event observations to drug withdrawal and proposes a model to predict drug withdraw...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.02.008

    authors: Clark M

    更新日期:2015-04-01 00:00:00

  • Benchmarking deep learning models on large healthcare datasets.

    abstract::Deep learning models (aka Deep Neural Networks) have revolutionized many fields including computer vision, natural language processing, speech recognition, and is being increasingly used in clinical healthcare applications. However, few works exist which have benchmarked the performance of the deep learning models wit...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.04.007

    authors: Purushotham S,Meng C,Che Z,Liu Y

    更新日期:2018-07-01 00:00:00

  • Virtualizing living and working spaces: Proof of concept for a biomedical space-replication methodology.

    abstract::The physical spaces within which the work of health occurs - the home, the intensive care unit, the emergency room, even the bedroom - influence the manner in which behaviors unfold, and may contribute to efficacy and effectiveness of health interventions. Yet the study of such complex workspaces is difficult. Health ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.07.007

    authors: Brennan PF,Ponto K,Casper G,Tredinnick R,Broecker M

    更新日期:2015-10-01 00:00:00

  • Facilitating pre-operative assessment guidelines representation using SNOMED CT.

    abstract:OBJECTIVE:To investigate whether SNOMED CT covers the terms used in pre-operative assessment guidelines, and if necessary, how the measured content coverage can be improved. METHODS:Pre-operative assessment guidelines were retrieved from the websites of (inter)national anesthesia-related societies. The recommendations...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2010.07.009

    authors: Ahmadian L,Cornet R,de Keizer NF

    更新日期:2010-12-01 00:00:00

  • Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier.

    abstract::Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.03.002

    authors: Kumar M,Rath NK,Rath SK

    更新日期:2016-04-01 00:00:00

  • Lessons learnt from the DDIExtraction-2013 Shared Task.

    abstract::The DDIExtraction Shared Task 2013 is the second edition of the DDIExtraction Shared Task series, a community-wide effort to promote the implementation and comparative assessment of natural language processing (NLP) techniques in the field of the pharmacovigilance domain, in particular, to address the extraction of dr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.05.007

    authors: Segura-Bedmar I,Martínez P,Herrero-Zazo M

    更新日期:2014-10-01 00:00:00

  • PharmActa: Personalized pharmaceutical care eHealth platform for patients and pharmacists.

    abstract::Community pharmacists are critically placed in the patient care chain being an extended frontline within primary healthcare networks across Europe. They are trained to ensure safe and effective medication use, a crucial and responsible role, extending beyond the common misconception limited to just providing timely ac...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103336

    authors: Spanakis M,Sfakianakis S,Kallergis G,Spanakis EG,Sakkalis V

    更新日期:2019-12-01 00:00:00

  • An automated reasoning framework for translational research.

    abstract::In this paper we propose a novel approach to the design and implementation of knowledge-based decision support systems for translational research, specifically tailored to the analysis and interpretation of data from high-throughput experiments. Our approach is based on a general epistemological model of the scientifi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.11.005

    authors: Riva A,Nuzzo A,Stefanelli M,Bellazzi R

    更新日期:2010-06-01 00:00:00

  • A comparison of two methods for retrieving ICD-9-CM data: the effect of using an ontology-based method for handling terminology changes.

    abstract:OBJECTIVE:Most existing controlled terminologies can be characterized as collections of terms, wherein the terms are arranged in a simple list or organized in a hierarchy. These kinds of terminologies are considered useful for standardizing terms and encoding data and are currently used in many existing information sys...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.01.005

    authors: Yu AC,Cimino JJ

    更新日期:2011-04-01 00:00:00

  • DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.

    abstract::In Electronic Health Records (EHRs), much of valuable information regarding patients' conditions is embedded in free text format. Natural language processing (NLP) techniques have been developed to extract clinical information from free text. One challenge faced in clinical NLP is that the meaning of clinical entities...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.02.010

    authors: Mehrabi S,Krishnan A,Sohn S,Roch AM,Schmidt H,Kesterson J,Beesley C,Dexter P,Max Schmidt C,Liu H,Palakal M

    更新日期:2015-04-01 00:00:00

  • FRR: fair remote retrieval of outsourced private medical records in electronic health networks.

    abstract::Cloud computing is emerging as the next-generation IT architecture. However, cloud computing also raises security and privacy concerns since the users have no physical control over the outsourced data. This paper focuses on fairly retrieving encrypted private medical records outsourced to remote untrusted cloud server...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.02.008

    authors: Wang H,Wu Q,Qin B,Domingo-Ferrer J

    更新日期:2014-08-01 00:00:00

  • The REDCap consortium: Building an international community of software platform partners.

    abstract::The Research Electronic Data Capture (REDCap) data management platform was developed in 2004 to address an institutional need at Vanderbilt University, then shared with a limited number of adopting sites beginning in 2006. Given bi-directional benefit in early sharing experiments, we created a broader consortium shari...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103208

    authors: Harris PA,Taylor R,Minor BL,Elliott V,Fernandez M,O'Neal L,McLeod L,Delacqua G,Delacqua F,Kirby J,Duda SN,REDCap Consortium.

    更新日期:2019-07-01 00:00:00

  • A pilot study of a heuristic algorithm for novel template identification from VA electronic medical record text.

    abstract:RATIONALE:Templates in text notes pose challenges for automated information extraction algorithms. We propose a method that identifies novel templates in plain text medical notes. The identification can then be used to either include or exclude templates when processing notes for information extraction. METHODS:The tw...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.07.019

    authors: Redd AM,Gundlapalli AV,Divita G,Carter ME,Tran LT,Samore MH

    更新日期:2017-07-01 00:00:00

  • Personal health information in research: Perceived risk, trustworthiness and opinions from patients attending a tertiary healthcare facility.

    abstract:BACKGROUND:Personal health information is a valuable resource to the advancement of research. In order to achieve a comprehensive reform of data infrastructure in Australia, both public engagement and building social trust is vital. In light of this, we conducted a study to explore the opinions, perceived risks and tru...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103222

    authors: Krahe M,Milligan E,Reilly S

    更新日期:2019-07-01 00:00:00

  • Deep learning with wearable based heart rate variability for prediction of mental and general health.

    abstract::The ubiquity and commoditisation of wearable biosensors (fitness bands) has led to a deluge of personal healthcare data, but with limited analytics typically fed back to the user. The feasibility of feeding back more complex, seemingly unrelated measures to users was investigated, by assessing whether increased levels...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103610

    authors: Coutts LV,Plans D,Brown AW,Collomosse J

    更新日期:2020-12-01 00:00:00

  • Interacting agents through a web-based health serviceflow management system.

    abstract::The management of chronic and out-patients is a complex process which requires the cooperation of different agents belonging to several organizational units. Patients have to move to different locations to access the necessary services and to communicate their health status data. From their point of view there should ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.12.002

    authors: Leonardi G,Panzarasa S,Quaglini S,Stefanelli M,van der Aalst WM

    更新日期:2007-10-01 00:00:00

  • A knowledge-based system to find over-the-counter medicines for self-medication.

    abstract::This study developed a medicine query system based on Semantic Web and open data especially for self-medication users to search over-the-counter (OTC) medicines. Most existing medicine query systems are based on keyword searches. If users are uncertain about the exact search words, these query systems do not offer eff...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103504

    authors: Sung HY,Chi YL

    更新日期:2020-08-01 00:00:00

  • Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics.

    abstract::This work investigates, whether openEHR with its reference model, archetypes and templates is suitable for the digital representation of demographic as well as clinical data. Moreover, it elaborates openEHR as a tool for modelling Hospital Information Systems on a regional level based on a national logical infrastruct...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.04.004

    authors: Pahl C,Zare M,Nilashi M,de Faria Borges MA,Weingaertner D,Detschew V,Supriyanto E,Ibrahim O

    更新日期:2015-06-01 00:00:00

  • Temporal phenotyping of medically complex children via PARAFAC2 tensor factorization.

    abstract:OBJECTIVE:Our aim is to extract clinically-meaningful phenotypes from longitudinal electronic health records (EHRs) of medically-complex children. This is a fragile set of patients consuming a disproportionate amount of pediatric care resources but who often end up with sub-optimal clinical outcome. The rise in availab...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103125

    authors: Perros I,Papalexakis EE,Vuduc R,Searles E,Sun J

    更新日期:2019-05-01 00:00:00