Evaluation of a rule-based method for epidemiological document classification towards the automation of systematic reviews.

Abstract:

INTRODUCTION:Most data extraction efforts in epidemiology are focused on obtaining targeted information from clinical trials. In contrast, limited research has been conducted on the identification of information from observational studies, a major source for human evidence in many fields, including environmental health. The recognition of key epidemiological information (e.g., exposures) through text mining techniques can assist in the automation of systematic reviews and other evidence summaries. METHOD:We designed and applied a knowledge-driven, rule-based approach to identify targeted information (study design, participant population, exposure, outcome, confounding factors, and the country where the study was conducted) from abstracts of epidemiological studies included in several systematic reviews of environmental health exposures. The rules were based on common syntactical patterns observed in text and are thus not specific to any systematic review. To validate the general applicability of our approach, we compared the data extracted using our approach versus hand curation for 35 epidemiological study abstracts manually selected for inclusion in two systematic reviews. RESULTS:The returned F-score, precision, and recall ranged from 70% to 98%, 81% to 100%, and 54% to 97%, respectively. The highest precision was observed for exposure, outcome and population (100%) while recall was best for exposure and study design with 97% and 89%, respectively. The lowest recall was observed for the population (54%), which also had the lowest F-score (70%). CONCLUSION:The generated performance of our text-mining approach demonstrated encouraging results for the identification of targeted information from observational epidemiological study abstracts related to environmental exposures. We have demonstrated that rules based on generic syntactic patterns in one corpus can be applied to other observational study design by simple interchanging the dictionaries aiming to identify certain characteristics (i.e., outcomes, exposures). At the document level, the recognised information can assist in the selection and categorization of studies included in a systematic review.

journal_name

J Biomed Inform

authors

Karystianis G,Thayer K,Wolfe M,Tsafnat G

doi

10.1016/j.jbi.2017.04.004

subject

Has Abstract

pub_date

2017-06-01 00:00:00

pages

27-34

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(17)30073-4

journal_volume

70

pub_type

杂志文章,评审
  • Methodological variations in lagged regression for detecting physiologic drug effects in EHR data.

    abstract::We studied how lagged linear regression can be used to detect the physiologic effects of drugs from data in the electronic health record (EHR). We systematically examined the effect of methodological variations ((i) time series construction, (ii) temporal parameterization, (iii) intra-subject normalization, (iv) diffe...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.08.014

    authors: Levine ME,Albers DJ,Hripcsak G

    更新日期:2018-10-01 00:00:00

  • Modeling individual differences: A case study of the application of system identification for personalizing a physical activity intervention.

    abstract:BACKGROUND:Control systems engineering methods, particularly, system identification (system ID), offer an idiographic (i.e., person-specific) approach to develop dynamic models of physical activity (PA) that can be used to personalize interventions in a systematic, scalable way. The purpose of this work is to: (1) appl...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.01.010

    authors: Phatak SS,Freigoun MT,Martín CA,Rivera DE,Korinek EV,Adams MA,Buman MP,Klasnja P,Hekler EB

    更新日期:2018-03-01 00:00:00

  • Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics.

    abstract::This work investigates, whether openEHR with its reference model, archetypes and templates is suitable for the digital representation of demographic as well as clinical data. Moreover, it elaborates openEHR as a tool for modelling Hospital Information Systems on a regional level based on a national logical infrastruct...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.04.004

    authors: Pahl C,Zare M,Nilashi M,de Faria Borges MA,Weingaertner D,Detschew V,Supriyanto E,Ibrahim O

    更新日期:2015-06-01 00:00:00

  • Prediction of influenza vaccination outcome by neural networks and logistic regression.

    abstract::The major challenge in influenza vaccination is to predict vaccine efficacy. The purpose of this study was to design a model to enable successful prediction of the outcome of influenza vaccination based on real historical medical data. A non-linear neural network approach was used, and its performance compared to logi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2010.04.011

    authors: Trtica-Majnaric L,Zekic-Susac M,Sarlija N,Vitale B

    更新日期:2010-10-01 00:00:00

  • Predictive modeling of bacterial infections and antibiotic therapy needs in critically ill adults.

    abstract::Unnecessary antibiotic regimens in the intensive care unit (ICU) are associated with adverse patient outcomes and antimicrobial resistance. Bacterial infections (BI) are both common and deadly in ICUs, and as a result, patients with a suspected BI are routinely started on broad-spectrum antibiotics prior to having con...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103540

    authors: Eickelberg G,Sanchez-Pinto LN,Luo Y

    更新日期:2020-09-01 00:00:00

  • Selecting significant genes by randomization test for cancer classification using gene expression data.

    abstract::Gene selection is an important task in bioinformatics studies, because the accuracy of cancer classification generally depends upon the genes that have biological relevance to the classifying problems. In this work, randomization test (RT) is used as a gene selection method for dealing with gene expression data. In th...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.03.009

    authors: Mao Z,Cai W,Shao X

    更新日期:2013-08-01 00:00:00

  • Predicting the function of transplanted kidney in long-term care processes: Application of a hybrid model.

    abstract:BACKGROUND:A tool that can predict the estimated glomerular filtration rate (eGFR) in routine daily care can help clinicians to make better decisions for kidney transplant patients and to improve transplantation outcome. In this paper, we proposed a hybrid prediction model for predicting a future value for eGFR during ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103116

    authors: Rashidi Khazaee P,Bagherzadeh M J,Niazkhani Z,Pirnejad H

    更新日期:2019-03-01 00:00:00

  • Comparison of orthogonal NLP methods for clinical phenotyping and assessment of bone scan utilization among prostate cancer patients.

    abstract:OBJECTIVE:Clinical care guidelines recommend that newly diagnosed prostate cancer patients at high risk for metastatic spread receive a bone scan prior to treatment and that low risk patients not receive it. The objective was to develop an automated pipeline to interrogate heterogeneous data to evaluate the use of bone...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103184

    authors: Coquet J,Bozkurt S,Kan KM,Ferrari MK,Blayney DW,Brooks JD,Hernandez-Boussard T

    更新日期:2019-06-01 00:00:00

  • Automated identification of adverse events related to central venous catheters.

    abstract::Methods for surveillance of adverse events (AEs) in clinical settings are limited by cost, technology, and appropriate data availability. In this study, two methods for semi-automated review of text records within the Veterans Administration database are utilized to identify AEs related to the placement of central ven...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.06.003

    authors: Penz JF,Wilcox AB,Hurdle JF

    更新日期:2007-04-01 00:00:00

  • MorphoCol: An ontology-based knowledgebase for the characterisation of clinically significant bacterial colony morphologies.

    abstract:BACKGROUND:One of the major concerns of the biomedical community is the increasing prevalence of antimicrobial resistant microorganisms. Recent findings show that the diversification of colony morphology may be indicative of the expression of virulence factors and increased resistance to antibiotic therapeutics. To tra...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.03.007

    authors: Sousa AM,Pereira MO,Lourenço A

    更新日期:2015-06-01 00:00:00

  • A Hidden Semi-Markov Model based approach for rehabilitation exercise assessment.

    abstract::In this paper, a Hidden Semi-Markov Model (HSMM) based approach is proposed to evaluate and monitor body motion during a rehabilitation training program. The approach extracts clinically relevant motion features from skeleton joint trajectories, acquired by the RGB-D camera, and provides a score for the subject's perf...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.12.012

    authors: Capecci M,Ceravolo MG,Ferracuti F,Iarlori S,Kyrki V,Monteriù A,Romeo L,Verdini F

    更新日期:2018-02-01 00:00:00

  • Information extraction from biomedical text.

    abstract::Information extraction is the process of scanning text for information relevant to some interest, including extracting entities, relations, and events. It requires deeper analysis than key word searches, but its aims fall short of the very hard and long-term problem of full text understanding. Information extraction r...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(03)00015-7

    authors: Hobbs JR

    更新日期:2002-08-01 00:00:00

  • A machine-learned knowledge discovery method for associating complex phenotypes with complex genotypes. Application to pain.

    abstract:BACKGROUND:The association of genotyping information with common traits is not satisfactorily solved. One of the most complex traits is pain and association studies have failed so far to provide reproducible predictions of pain phenotypes from genotypes in the general population despite a well-established genetic basis...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.07.010

    authors: Lötsch J,Ultsch A

    更新日期:2013-10-01 00:00:00

  • MeSHing molecular sequences and clinical trials: a feasibility study.

    abstract::The centralized and public availability of molecular sequence and clinical trial data presents an opportunity to identify potentially valuable linkages across the bench-to-bedside "T1" translational barrier. In this study, we sought to leverage keyword metadata (Medical Subject Heading [MeSH] descriptors) to infer rel...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.10.003

    authors: Chen ES,Sarkar IN

    更新日期:2010-06-01 00:00:00

  • The use of logic relationships to model colon cancer gene expression networks with mRNA microarray data.

    abstract::The ultimate goal of genomics research is to describe the network of molecules and interactions that govern all biological functions and disease processes in cells. Nonlinear interactions among genes in terms of their logic relationships play a key role for deciphering the networks of molecules that underlie cellular ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2007.11.006

    authors: Ruan X,Wang J,Li H,Perozzi RE,Perozzi EF

    更新日期:2008-08-01 00:00:00

  • Introducing RFID technology in dynamic and time-critical medical settings: requirements and challenges.

    abstract::We describe the process of introducing RFID technology in the trauma bay of a trauma center to support fast-paced and complex teamwork during resuscitation. We analyzed trauma resuscitation tasks, photographs of medical tools, and videos of simulated resuscitations to gain insight into resuscitation tasks, work practi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2012.04.003

    authors: Parlak S,Sarcevic A,Marsic I,Burd RS

    更新日期:2012-10-01 00:00:00

  • Analysis of eligibility criteria representation in industry-standard clinical trial protocols.

    abstract::Previous research on standardization of eligibility criteria and its feasibility has traditionally been conducted on clinical trial protocols from ClinicalTrials.gov (CT). The portability and use of such standardization for full-text industry-standard protocols has not been studied in-depth. Towards this end, in this ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.06.001

    authors: Bhattacharya S,Cantor MN

    更新日期:2013-10-01 00:00:00

  • Creating hospital-specific customized clinical pathways by applying semantic reasoning to clinical data.

    abstract:OBJECTIVE:Clinical pathways (CPs) are widely studied methods to standardize clinical intervention and improve medical quality. However, standard care plans defined in current CPs are too general to execute in a practical healthcare environment. The purpose of this study was to create hospital-specific personalized CPs ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.07.017

    authors: Wang HQ,Zhou TS,Tian LL,Qian YM,Li JS

    更新日期:2014-12-01 00:00:00

  • Identifying complexity in infectious diseases inpatient settings: An observation study.

    abstract:BACKGROUND:Understanding complexity in healthcare has the potential to reduce decision and treatment uncertainty. Therefore, identifying both patient and task complexity may offer better task allocation and design recommendation for next-generation health information technology system design. OBJECTIVE:To identify spe...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.10.018

    authors: Roosan D,Weir C,Samore M,Jones M,Rahman M,Stoddard GJ,Del Fiol G

    更新日期:2017-07-01 00:00:00

  • 3D interactive surgical visualization system using mobile spatial information acquisition and autostereoscopic display.

    abstract::Three-dimensional (3D) visualization of preoperative and intraoperative medical information becomes more and more important in minimally invasive surgery. We develop a 3D interactive surgical visualization system using mobile spatial information acquisition and autostereoscopic display for surgeons to observe surgical...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.05.014

    authors: Fan Z,Weng Y,Chen G,Liao H

    更新日期:2017-07-01 00:00:00

  • Concept and implementation of a study dashboard module for a continuous monitoring of trial recruitment and documentation.

    abstract:BACKGROUND:The difficulty of managing patient recruitment and documentation for clinical trials prompts a demand for instruments for closely monitoring these critical but unpredictable processes. Increasingly adopted Electronic Data Capture (EDC) applications provide novel opportunities to reutilize stored information ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.10.010

    authors: Toddenroth D,Sivagnanasundaram J,Prokosch HU,Ganslandt T

    更新日期:2016-12-01 00:00:00

  • Integrating cancer diagnosis terminologies based on logical definitions of SNOMED CT concepts.

    abstract::In oncology, the reuse of data is confronted with the heterogeneity of terminologies. It is necessary to semantically integrate these distinct terminologies. The semantic integration by using a third terminology as a support is a conventional approach for the integration of two terminologies that are not very structur...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.08.013

    authors: Nikiema JN,Jouhet V,Mougin F

    更新日期:2017-10-01 00:00:00

  • Developing, implementing, and evaluating decision support systems for shared decision making in patient care: a conceptual model and case illustration.

    abstract::The importance of including patient preferences in decisions regarding their care has received increased emphasis over recent years. Medical informatics can play an important role in improving patient-centered care by developing decision support systems to support the inclusion of patient preferences in clinical decis...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(03)00037-6

    authors: Ruland CM,Bakken S

    更新日期:2002-10-01 00:00:00

  • Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier.

    abstract::Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.03.002

    authors: Kumar M,Rath NK,Rath SK

    更新日期:2016-04-01 00:00:00

  • Temporal phenotyping of medically complex children via PARAFAC2 tensor factorization.

    abstract:OBJECTIVE:Our aim is to extract clinically-meaningful phenotypes from longitudinal electronic health records (EHRs) of medically-complex children. This is a fragile set of patients consuming a disproportionate amount of pediatric care resources but who often end up with sub-optimal clinical outcome. The rise in availab...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103125

    authors: Perros I,Papalexakis EE,Vuduc R,Searles E,Sun J

    更新日期:2019-05-01 00:00:00

  • A multivariate time series approach to modeling and forecasting demand in the emergency department.

    abstract:STUDY OBJECTIVE:The goals of this investigation were to study the temporal relationships between the demands for key resources in the emergency department (ED) and the inpatient hospital, and to develop multivariate forecasting models. METHODS:Hourly data were collected from three diverse hospitals for the year 2006. ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.05.003

    authors: Jones SS,Evans RS,Allen TL,Thomas A,Haug PJ,Welch SJ,Snow GL

    更新日期:2009-02-01 00:00:00

  • Lessons learnt from the DDIExtraction-2013 Shared Task.

    abstract::The DDIExtraction Shared Task 2013 is the second edition of the DDIExtraction Shared Task series, a community-wide effort to promote the implementation and comparative assessment of natural language processing (NLP) techniques in the field of the pharmacovigilance domain, in particular, to address the extraction of dr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.05.007

    authors: Segura-Bedmar I,Martínez P,Herrero-Zazo M

    更新日期:2014-10-01 00:00:00

  • The development and validation of a simulation tool for health policy decision making.

    abstract::Computer simulations have been used to model infectious diseases to examine the outcomes of alternative strategies for managing their spread. Methicillin resistant Staphylococcus aureus (MRSA) skin and soft tissue infections have become prominent in many communities and efforts are underway to reduce the spread of thi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2010.03.013

    authors: Panchanathan SS,Petitti DB,Fridsma DB

    更新日期:2010-08-01 00:00:00

  • Computing with evidence Part II: An evidential approach to predicting metabolic drug-drug interactions.

    abstract::We describe a novel experiment that we conducted with the Drug Interaction Knowledge-base (DIKB) to determine which combinations of evidence enable a rule-based theory of metabolic drug-drug interactions to make the most optimal set of predictions. The focus of the experiment was a group of 16 drugs including six memb...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.05.010

    authors: Boyce R,Collins C,Horn J,Kalet I

    更新日期:2009-12-01 00:00:00

  • Balancing volume and duration of information consumption by physicians: The case of health information exchange in critical care.

    abstract:BACKGROUND:The realization of the potential benefits of health information exchange systems (HIEs) for emergency departments (EDs) depends on the way these systems are actually used. The attributes of volume of information and duration of information processing are important for the study of HIE use patterns in the ED,...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.05.007

    authors: Politi L,Codish S,Sagy I,Fink L

    更新日期:2017-07-01 00:00:00