Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.

Abstract:

BACKGROUND:Named entity recognition (NER) is an important task in clinical natural language processing (NLP) research. Machine learning (ML) based NER methods have shown good performance in recognizing entities in clinical text. Algorithms and features are two important factors that largely affect the performance of ML-based NER systems. Conditional Random Fields (CRFs), a sequential labelling algorithm, and Support Vector Machines (SVMs), which is based on large margin theory, are two typical machine learning algorithms that have been widely applied to clinical NER tasks. For features, syntactic and semantic information of context words has often been used in clinical NER systems. However, Structural Support Vector Machines (SSVMs), an algorithm that combines the advantages of both CRFs and SVMs, and word representation features, which contain word-level back-off information over large unlabelled corpus by unsupervised algorithms, have not been extensively investigated for clinical text processing. Therefore, the primary goal of this study is to evaluate the use of SSVMs and word representation features in clinical NER tasks. METHODS:In this study, we developed SSVMs-based NER systems to recognize clinical entities in hospital discharge summaries, using the data set from the concept extration task in the 2010 i2b2 NLP challenge. We compared the performance of CRFs and SSVMs-based NER classifiers with the same feature sets. Furthermore, we extracted two different types of word representation features (clustering-based representation features and distributional representation features) and integrated them with the SSVMs-based clinical NER system. We then reported the performance of SSVM-based NER systems with different types of word representation features. RESULTS AND DISCUSSION:Using the same training (N = 27,837) and test (N = 45,009) sets in the challenge, our evaluation showed that the SSVMs-based NER systems achieved better performance than the CRFs-based systems for clinical entity recognition, when same features were used. Both types of word representation features (clustering-based and distributional representations) improved the performance of ML-based NER systems. By combining two different types of word representation features together with SSVMs, our system achieved a highest F-measure of 85.82%, which outperformed the best system reported in the challenge by 0.6%. Our results show that SSVMs is a great potential algorithm for clinical NLP research, and both types of unsupervised word representation features are beneficial to clinical NER tasks.

authors

Tang B,Cao H,Wu Y,Jiang M,Xu H

doi

10.1186/1472-6947-13-S1-S1

subject

Has Abstract

pub_date

2013-01-01 00:00:00

pages

S1

issn

1472-6947

pii

1472-6947-13-S1-S1

journal_volume

13 Suppl 1

pub_type

杂志文章
  • CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases.

    abstract:BACKGROUND:Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to C...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-021-01392-2

    authors: Ahn I,Na W,Kwon O,Yang DH,Park GM,Gwon H,Kang HJ,Jeong YU,Yoo J,Kim Y,Jun TJ,Kim YH

    更新日期:2021-01-28 00:00:00

  • An algorithm to identify patients with treated type 2 diabetes using medico-administrative data.

    abstract:BACKGROUND:National authorities have to follow the evolution of diabetes to implement public health policies. An algorithm was developed to identify patients with treated type 2 diabetes and estimate its annual prevalence in Luxembourg using health insurance claims when no diagnosis code is available. METHODS:The DIAB...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-11-23

    authors: Renard LM,Bocquet V,Vidal-Trecan G,Lair ML,Couffignal S,Blum-Boisgard C

    更新日期:2011-04-14 00:00:00

  • Decision-making in percutaneous coronary intervention: a survey.

    abstract:BACKGROUND:Few researchers have examined the perceptions of physicians referring cases for angiography regarding the degree to which collaboration occurs during percutaneous coronary intervention (PCI) decision-making. We sought to determine perceptions of physicians concerning their involvement in PCI decisions in cas...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-8-28

    authors: Rahilly-Tierney CR,Nash IS

    更新日期:2008-06-25 00:00:00

  • Patient and provider acceptance of telecoaching in type 2 diabetes: a mixed-method study embedded in a randomised clinical trial.

    abstract:BACKGROUND:Despite advances in diagnosis and treatment of type 2 diabetes, suboptimal metabolic control persists. Patient education in diabetes has been proved to enhance self-efficacy and guideline-driven treatment, however many people with type 2 diabetes do not have access to or do not participate in self-management...

    journal_title:BMC medical informatics and decision making

    pub_type: 临床试验,杂志文章

    doi:10.1186/s12911-016-0383-3

    authors: Odnoletkova I,Buysse H,Nobels F,Goderis G,Aertgeerts B,Annemans L,Ramaekers D

    更新日期:2016-11-09 00:00:00

  • An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.

    abstract:BACKGROUND:Clinical named entity recognition (CNER) is important for medical information mining and establishment of high-quality knowledge map. Due to the different text features from natural language and a large number of professional and uncommon clinical terms in Chinese electronic medical records (EMRs), there are...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0933-6

    authors: Li L,Zhao J,Hou L,Zhai Y,Shi J,Cui F

    更新日期:2019-12-05 00:00:00

  • An investigation of the effect of nurses' technology readiness on the acceptance of mobile electronic medical record systems.

    abstract:BACKGROUND:Adopting mobile electronic medical record (MEMR) systems is expected to be one of the superior approaches for improving nurses' bedside and point of care services. However, nurses may use the functions for far fewer tasks than the MEMR supports. This may depend on their technological personality associated t...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-13-88

    authors: Kuo KM,Liu CF,Ma CC

    更新日期:2013-08-12 00:00:00

  • An empirical study to determine factors that motivate and limit the implementation of ICT in healthcare environments.

    abstract:BACKGROUND:The maturity and usage of wireless technology has influenced health services, and this has raised expectations from users that healthcare services will become more affordable due to technology growth. There is increasing evidence to justify this expectation, as telehealth is becoming more and more prevalent ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-14-98

    authors: Gururajan R,Hafeez-Baig A

    更新日期:2014-12-23 00:00:00

  • Development of a validation algorithm for 'present on admission' flagging.

    abstract:BACKGROUND:The use of routine hospital data for understanding patterns of adverse outcomes has been limited in the past by the fact that pre-existing and post-admission conditions have been indistinguishable. The use of a 'Present on Admission' (or POA) indicator to distinguish pre-existing or co-morbid conditions from...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-9-48

    authors: Jackson TJ,Michel JL,Roberts R,Shepheard J,Cheng D,Rust J,Perry C

    更新日期:2009-12-01 00:00:00

  • Understanding factors affecting patient and public engagement and recruitment to digital health interventions: a systematic review of qualitative studies.

    abstract:BACKGROUND:Numerous types of digital health interventions (DHIs) are available to patients and the public but many factors affect their ability to engage and enrol in them. This systematic review aims to identify and synthesise the qualitative literature on barriers and facilitators to engagement and recruitment to DHI...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,评审

    doi:10.1186/s12911-016-0359-3

    authors: O'Connor S,Hanlon P,O'Donnell CA,Garcia S,Glanville J,Mair FS

    更新日期:2016-09-15 00:00:00

  • The Computer-based Health Evaluation Software (CHES): a software for electronic patient-reported outcome monitoring.

    abstract:BACKGROUND:Patient-reported Outcomes (PROs) capturing e.g., quality of life, fatigue, depression, medication side-effects or disease symptoms, have become important outcome parameters in medical research and daily clinical practice. Electronic PRO data capture (ePRO) with software packages to administer questionnaires,...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-12-126

    authors: Holzner B,Giesinger JM,Pinggera J,Zugal S,Schöpf F,Oberguggenberger AS,Gamper EM,Zabernigg A,Weber B,Rumpold G

    更新日期:2012-11-09 00:00:00

  • "Assessment of the social influence and facilitating conditions that support nurses' adoption of hospital electronic information management systems (HEIMS) in Ghana using the unified theory of acceptance and use of technology (UTAUT) model".

    abstract:BACKGROUND:Hospital electronic information management systems (HEIMS) are widely used in Ghana, and hence its performance must be carefully assessed. Nurses as clinical health personnel are the largest cluster of hospital staff and are the pillar of healthcare delivery. Therefore, they play a crucial role in the adopti...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0956-z

    authors: Zhou LL,Owusu-Marfo J,Asante Antwi H,Antwi MO,Kachie ADT,Ampon-Wireko S

    更新日期:2019-11-21 00:00:00

  • Evidence-based medicine among internal medicine residents in a community hospital program using smart phones.

    abstract:BACKGROUND:This study implemented and evaluated a point-of-care, wireless Internet access using smart phones for information retrieval during daily clinical rounds and academic activities of internal medicine residents in a community hospital. We did the project to assess the feasibility of using smart phones as an alt...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-7-5

    authors: León SA,Fontelo P,Green L,Ackerman M,Liu F

    更新日期:2007-02-21 00:00:00

  • Integral strategy to supportive care in breast cancer survivors through occupational therapy and a m-health system: design of a randomized clinical trial.

    abstract:BACKGROUND:Technological support using e-health mobile applications (m-health) is a promising strategy to improve the adherence to healthy lifestyles in breast cancer survivors (excess in energy intake or low physical activity are determinants of the risk of recurrence, second cancers and cancer mortality). Moreover, c...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,随机对照试验

    doi:10.1186/s12911-016-0394-0

    authors: Lozano-Lozano M,Martín-Martín L,Galiano-Castillo N,Álvarez-Salvago F,Cantarero-Villanueva I,Fernández-Lao C,Sánchez-Salado C,Arroyo-Morales M

    更新日期:2016-11-25 00:00:00

  • Brain mapping and detection of functional patterns in fMRI using wavelet transform; application in detection of dyslexia.

    abstract:BACKGROUND:Functional Magnetic Resonance Imaging (fMRI) has been proven to be useful for studying brain functions. However, due to the existence of noise and distortion, mapping between the fMRI signal and the actual neural activity is difficult. Because of the difficulty, differential pattern analysis of fMRI brain im...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-9-S1-S6

    authors: Ji SY,Ward K,Najarian K

    更新日期:2009-11-03 00:00:00

  • Atrial fibrillation classification based on convolutional neural networks.

    abstract:BACKGROUND:The global age-adjusted mortality rate related to atrial fibrillation (AF) registered a rapid growth in the last four decades, i.e., from 0.8 to 1.6 and 0.9 to 1.7 per 100,000 for men and women during 1990-2010, respectively. In this context, this study uses convolutional neural networks for classifying (dia...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0946-1

    authors: Lee KS,Jung S,Gil Y,Son HS

    更新日期:2019-10-29 00:00:00

  • Visibility of medical informatics regarding bibliometric indices and databases.

    abstract:BACKGROUND:The quantitative study of the publication output (bibliometrics) deeply influences how scientific work is perceived (bibliometric visibility). Recently, new bibliometric indices and databases have been established, which may change the visibility of disciplines, institutions and individuals. This study exami...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-11-24

    authors: Spreckelsen C,Deserno TM,Spitzer K

    更新日期:2011-04-15 00:00:00

  • Predicting patient-reported outcomes following hip and knee replacement surgery using supervised machine learning.

    abstract:BACKGROUND:Machine-learning classifiers mostly offer good predictive performance and are increasingly used to support shared decision-making in clinical practice. Focusing on performance and practicability, this study evaluates prediction of patient-reported outcomes (PROs) by eight supervised classifiers including a l...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0731-6

    authors: Huber M,Kurz C,Leidl R

    更新日期:2019-01-08 00:00:00

  • A cohort study of a tailored web intervention for preconception care.

    abstract:BACKGROUND:Preconception care may be an efficacious tool to reduce risk factors for adverse pregnancy outcomes that are associated with lifestyles and health status before pregnancy. We conducted a web-based cohort study in Italian women planning a pregnancy to assess whether a tailored web intervention may change know...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-14-33

    authors: Agricola E,Pandolfi E,Gonfiantini MV,Gesualdo F,Romano M,Carloni E,Mastroiacovo P,Tozzi AE

    更新日期:2014-04-15 00:00:00

  • Barriers to exchanging healthcare information in inter-municipal healthcare services: a qualitative case study.

    abstract:BACKGROUND:In recent years, inter-municipal cooperation in healthcare services has been an important measure implemented to meet future demographic changes in western countries. This entails an increased focus on communication and information sharing across organisational borders. Technology enables efficient and effec...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0701-z

    authors: Holen-Rabbersvik E,Thygesen E,Eikebrokk TR,Fensli RW,Slettebø Å

    更新日期:2018-11-07 00:00:00

  • Correction to: The International Conference on Intelligent Biology and Medicine 2019: computational methods for drug interactions.

    abstract::After publication of this supplement article [1], it is requested the grant ID in the Funding section should be corrected from NSF grant IIS-7811367 to NSF grant IIS-1902617. ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12911-020-1096-1

    authors: Ning X,Zhang C,Wang K,Zhao Z,Mathé E

    更新日期:2020-04-28 00:00:00

  • Medicine in words and numbers: a cross-sectional survey comparing probability assessment scales.

    abstract:BACKGROUND:In the complex domain of medical decision making, reasoning under uncertainty can benefit from supporting tools. Automated decision support tools often build upon mathematical models, such as Bayesian networks. These networks require probabilities which often have to be assessed by experts in the domain of a...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-7-13

    authors: Witteman CL,Renooij S,Koele P

    更新日期:2007-06-11 00:00:00

  • Locating previously unknown patterns in data-mining results: a dual data- and knowledge-mining method.

    abstract:BACKGROUND:Data mining can be utilized to automate analysis of substantial amounts of data produced in many organizations. However, data mining produces large numbers of rules and patterns, many of which are not useful. Existing methods for pruning uninteresting patterns have only begun to automate the knowledge acquis...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-6-13

    authors: Siadaty MS,Knaus WA

    更新日期:2006-03-07 00:00:00

  • An open access medical knowledge base for community driven diagnostic decision support system development.

    abstract:INTRODUCTION:While early diagnostic decision support systems were built around knowledge bases, more recent systems employ machine learning to consume large amounts of health data. We argue curated knowledge bases will remain an important component of future diagnostic decision support systems by providing ground truth...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0804-1

    authors: Müller L,Gangadharaiah R,Klein SC,Perry J,Bernstein G,Nurkse D,Wailes D,Graham R,El-Kareh R,Mehta S,Vinterbo SA,Aronoff-Spencer E

    更新日期:2019-04-27 00:00:00

  • The relationship between user interface problems of an admission, discharge and transfer module and usability features: a usability testing method.

    abstract:BACKGROUND:The admission, discharge and transfer (ADT) module is used in the hospital information system (HIS) for the purposes of managing appointments, patient admission, daily control of hospital beds, planning surgery procedures, keeping up-to-date on patient discharges, and registering patient transfers within or ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0893-x

    authors: Farrahi R,Rangraz Jeddi F,Nabovati E,Sadeqi Jabali M,Khajouei R

    更新日期:2019-08-24 00:00:00

  • The role and benefits of accessing primary care patient records during unscheduled care: a systematic review.

    abstract:BACKGROUND:The purpose of this study was to assess the impact of accessing primary care records on unscheduled care. Unscheduled care is typically delivered in hospital Emergency Departments. Studies published to December 2014 reporting on primary care record access during unscheduled care were retrieved. RESULTS:Twen...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,评审

    doi:10.1186/s12911-017-0523-4

    authors: Bowden T,Coiera E

    更新日期:2017-09-22 00:00:00

  • Dynamic prediction of hospital admission with medical claim data.

    abstract:BACKGROUND:Congestive heart failure is one of the most common reasons those aged 65 and over are hospitalized in the United States, which has caused a considerable economic burden. The precise prediction of hospitalization caused by congestive heart failure in the near future could prevent possible hospitalization, opt...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0734-y

    authors: Yang T,Yang Y,Jia Y,Li X

    更新日期:2019-01-31 00:00:00

  • Dual processing model of medical decision-making.

    abstract:BACKGROUND:Dual processing theory of human cognition postulates that reasoning and decision-making can be described as a function of both an intuitive, experiential, affective system (system I) and/or an analytical, deliberative (system II) processing system. To date no formal descriptive model of medical decision-maki...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-12-94

    authors: Djulbegovic B,Hozo I,Beckstead J,Tsalatsanis A,Pauker SG

    更新日期:2012-09-03 00:00:00

  • Using an electronic medical record (EMR) to conduct clinical trials: Salford Lung Study feasibility.

    abstract:BACKGROUND:Real-world data on the benefit/risk profile of medicines is needed, particularly in patients who are ineligible for randomised controlled trials conducted for registration purposes. This paper describes the methodology and source data verification which enables the conduct of pre-licensing clinical trials of...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-015-0132-z

    authors: Elkhenini HF,Davis KJ,Stein ND,New JP,Delderfield MR,Gibson M,Vestbo J,Woodcock A,Bakerly ND

    更新日期:2015-02-07 00:00:00

  • SURF: identifying and allocating resources during Out-of-Hospital Cardiac Arrest.

    abstract:BACKGROUND:When an Out-of-Hospital Cardiac Arrest (OHCA) incident is reported to emergency services, the 911 agent dispatches Emergency Medical Services to the location and activates responder network system (RNS), if the option is available. The RNS notifies all the registered users in the vicinity of the cardiac arre...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01334-4

    authors: Rao G,Choudhury S,Lingras P,Savage D,Mago V

    更新日期:2020-12-30 00:00:00

  • Optimal sequence of tests for the mediastinal staging of non-small cell lung cancer.

    abstract:BACKGROUND:Non-small cell lung cancer (NSCLC) is the most prevalent type of lung cancer and the most difficult to predict. When there are no distant metastases, the optimal therapy depends mainly on whether there are malignant lymph nodes in the mediastinum. Given the vigorous debate among specialists about which tests...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-016-0246-y

    authors: Luque M,Díez FJ,Disdier C

    更新日期:2016-01-26 00:00:00