Use of morphological analysis in protein name recognition.

Abstract:

:Protein name recognition aims to detect each and every protein names appearing in a PubMed abstract. The task is not simple, as the graphic word boundary (space separator) assumed in conventional preprocessing does not necessarily coincide with the protein name boundary. Such boundary disagreement caused by tokenization ambiguity has usually been ignored in conventional preprocessing of general English. In this paper, we argue that boundary disagreement poses serious limitations in biomedical English text processing, not to mention protein name recognition. Our key idea for dealing with the boundary disagreement is to apply techniques used in Japanese morphological analysis where there are no word boundaries. Having evaluated the proposed method with GENIA corpus 3.02, we obtain F-measure of 69.01 on a strict criterion and 79.32 on a relaxed criterion. The result is comparable to other published work in protein name recognition, without resorting to manually prepared ad hoc feature engineering. Further, compared to the conventional preprocessing, the use of morphological analysis as preprocessing improves the performance of protein name recognition and reduces the execution time.

journal_name

J Biomed Inform

authors

Yamamoto K,Kudo T,Konagaya A,Matsumoto Y

doi

10.1016/j.jbi.2004.08.001

keywords:

subject

Has Abstract

pub_date

2004-12-01 00:00:00

pages

471-82

issue

6

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(04)00079-6

journal_volume

37

pub_type

杂志文章
  • Cognitive simulators for medical education and training.

    abstract::Simulators for honing procedural skills (such as surgical skills and central venous catheter placement) have proven to be valuable tools for medical educators and students. While such simulations represent an effective paradigm in surgical education, there is an opportunity to add a layer of cognitive exercises to the...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.02.008

    authors: Kahol K,Vankipuram M,Smith ML

    更新日期:2009-08-01 00:00:00

  • The REDCap consortium: Building an international community of software platform partners.

    abstract::The Research Electronic Data Capture (REDCap) data management platform was developed in 2004 to address an institutional need at Vanderbilt University, then shared with a limited number of adopting sites beginning in 2006. Given bi-directional benefit in early sharing experiments, we created a broader consortium shari...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103208

    authors: Harris PA,Taylor R,Minor BL,Elliott V,Fernandez M,O'Neal L,McLeod L,Delacqua G,Delacqua F,Kirby J,Duda SN,REDCap Consortium.

    更新日期:2019-07-01 00:00:00

  • A Health Surveillance Software Framework to deliver information on preventive healthcare strategies.

    abstract::A software framework can reduce costs related to the development of an application because it allows developers to reuse both design and code. Recently, companies and research groups have announced that they have been employing health software frameworks. This paper presents the design, proof-of-concept implementation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.06.002

    authors: Macedo AA,Pollettini JT,Baranauskas JA,Chaves JC

    更新日期:2016-08-01 00:00:00

  • Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

    abstract::The 2014 i2b2/UTHealth natural language processing shared task featured a track focused on the de-identification of longitudinal medical records. For this track, we de-identified a set of 1304 longitudinal medical records describing 296 patients. This corpus was de-identified under a broad interpretation of the HIPAA ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.07.020

    authors: Stubbs A,Uzuner Ö

    更新日期:2015-12-01 00:00:00

  • Tracking a moving user in indoor environments using Bluetooth low energy beacons.

    abstract:BACKGROUND:Bluetooth low energy (BLE) beacons have been used to track the locations of individuals in indoor environments for clinical applications such as workflow analysis and infectious disease modelling. Most current approaches use the received signal strength indicator (RSSI) to track locations. When using the RSS...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103288

    authors: Surian D,Kim V,Menon R,Dunn AG,Sintchenko V,Coiera E

    更新日期:2019-10-01 00:00:00

  • A controlled greedy supervised approach for co-reference resolution on clinical text.

    abstract::Identification of co-referent entity mentions inside text has significant importance for other natural language processing (NLP) tasks (e.g. event linking). However, this task, known as co-reference resolution, remains a complex problem, partly because of the confusion over different evaluation metrics and partly beca...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.03.007

    authors: Chowdhury MF,Zweigenbaum P

    更新日期:2013-06-01 00:00:00

  • Using natural language processing to extract mammographic findings.

    abstract:OBJECTIVE:Structured data on mammographic findings are difficult to obtain without manual review. We developed and evaluated a rule-based natural language processing (NLP) system to extract mammographic findings from free-text mammography reports. MATERIALS AND METHODS:The NLP system extracted four mammographic findin...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.01.010

    authors: Gao H,Aiello Bowles EJ,Carrell D,Buist DS

    更新日期:2015-04-01 00:00:00

  • Quantifying semantic similarity of clinical evidence in the biomedical literature to facilitate related evidence synthesis.

    abstract:OBJECTIVE:Published clinical trials and high quality peer reviewed medical publications are considered as the main sources of evidence used for synthesizing systematic reviews or practicing Evidence Based Medicine (EBM). Finding all relevant published evidence for a particular medical case is a time and labour intensiv...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103321

    authors: Hassanzadeh H,Nguyen A,Verspoor K

    更新日期:2019-12-01 00:00:00

  • Visual aggregate analysis of eligibility features of clinical trials.

    abstract:OBJECTIVE:To develop a method for profiling the collective populations targeted for recruitment by multiple clinical studies addressing the same medical condition using one eligibility feature each time. METHODS:Using a previously published database COMPACT as the backend, we designed a scalable method for visual aggr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.01.005

    authors: He Z,Carini S,Sim I,Weng C

    更新日期:2015-04-01 00:00:00

  • Development of the nursing problem list subset of SNOMED CT®.

    abstract:OBJECTIVE:To create an interoperable set of nursing diagnoses for use in the patient problem list in the EHR to support interoperability. DESIGN:Queries for nursing diagnostic concepts were executed against the UMLS Metathesaurus to retrieve all nursing diagnoses across four nursing terminologies where the concept was...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.12.003

    authors: Matney SA,Warren JJ,Evans JL,Kim TY,Coenen A,Auld VA

    更新日期:2012-08-01 00:00:00

  • The Analytic Information Warehouse (AIW): a platform for analytics using electronic health record data.

    abstract:OBJECTIVE:To create an analytics platform for specifying and detecting clinical phenotypes and other derived variables in electronic health record (EHR) data for quality improvement investigations. MATERIALS AND METHODS:We have developed an architecture for an Analytic Information Warehouse (AIW). It supports transfor...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.01.005

    authors: Post AR,Kurc T,Cholleti S,Gao J,Lin X,Bornstein W,Cantrell D,Levine D,Hohmann S,Saltz JH

    更新日期:2013-06-01 00:00:00

  • Phenotypic similarity for rare disease: Ciliopathy diagnoses and subtyping.

    abstract::Rare diseases are often hard and long to be diagnosed precisely, and most of them lack approved treatment. For some complex rare diseases, precision medicine approach is further required to stratify patients into homogeneous subgroups based on the clinical, biological or molecular features. In such situation, deep phe...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103308

    authors: Chen X,Garcelon N,Neuraz A,Billot K,Lelarge M,Bonald T,Garcia H,Martin Y,Benoit V,Vincent M,Faour H,Douillet M,Lyonnet S,Saunier S,Burgun A

    更新日期:2019-12-01 00:00:00

  • Predictive modeling of bacterial infections and antibiotic therapy needs in critically ill adults.

    abstract::Unnecessary antibiotic regimens in the intensive care unit (ICU) are associated with adverse patient outcomes and antimicrobial resistance. Bacterial infections (BI) are both common and deadly in ICUs, and as a result, patients with a suspected BI are routinely started on broad-spectrum antibiotics prior to having con...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103540

    authors: Eickelberg G,Sanchez-Pinto LN,Luo Y

    更新日期:2020-09-01 00:00:00

  • TEMPTING system: a hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries.

    abstract::Patient discharge summaries provide detailed medical information about individuals who have been hospitalized. To make a precise and legitimate assessment of the abundant data, a proper time layout of the sequence of relevant events should be compiled and used to drive a patient-specific timeline, which could further ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.09.007

    authors: Chang YC,Dai HJ,Wu JC,Chen JM,Tsai RT,Hsu WL

    更新日期:2013-12-01 00:00:00

  • Modeling nurse-patient assignments considering patient acuity and travel distance metrics.

    abstract::Balancing workload among nurses on a hospital unit is important for the satisfaction and safety of nurses and patients. To balance nurse workloads, direct patient care activities, indirect patient care activities, and non-patient care activities that occur throughout a shift must be considered. The layout of a hospita...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.10.006

    authors: Acar I,Butt SE

    更新日期:2016-12-01 00:00:00

  • Modeling association detection in order to discover compounds to inhibit oral cancer.

    abstract::In the past, algorithms exploiting varying semantics in interactions between biological objects such as genes and diseases have been used in bioinformatics to uncover latent relationships within biological datasets. In this paper, we consider the algorithm Medusa in parallel with binary classification in order to find...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.07.005

    authors: Vittal S,Karthikeyan G

    更新日期:2018-08-01 00:00:00

  • An automated reasoning framework for translational research.

    abstract::In this paper we propose a novel approach to the design and implementation of knowledge-based decision support systems for translational research, specifically tailored to the analysis and interpretation of data from high-throughput experiments. Our approach is based on a general epistemological model of the scientifi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.11.005

    authors: Riva A,Nuzzo A,Stefanelli M,Bellazzi R

    更新日期:2010-06-01 00:00:00

  • A comparison of two methods for retrieving ICD-9-CM data: the effect of using an ontology-based method for handling terminology changes.

    abstract:OBJECTIVE:Most existing controlled terminologies can be characterized as collections of terms, wherein the terms are arranged in a simple list or organized in a hierarchy. These kinds of terminologies are considered useful for standardizing terms and encoding data and are currently used in many existing information sys...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.01.005

    authors: Yu AC,Cimino JJ

    更新日期:2011-04-01 00:00:00

  • Classification of forensic autopsy reports through conceptual graph-based document representation model.

    abstract::Text categorization has been used extensively in recent years to classify plain-text clinical reports. This study employs text categorization techniques for the classification of open narrative forensic autopsy reports. One of the key steps in text classification is document representation. In document representation,...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.04.013

    authors: Mujtaba G,Shuib L,Raj RG,Rajandram R,Shaikh K,Al-Garadi MA

    更新日期:2018-06-01 00:00:00

  • Classification models for the prediction of clinicians' information needs.

    abstract:OBJECTIVE:Clinicians face numerous information needs during patient care activities and most of these needs are not met. Infobuttons are information retrieval tools that help clinicians to fulfill their information needs by providing links to on-line health information resources from within an electronic medical record...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.07.001

    authors: Del Fiol G,Haug PJ

    更新日期:2009-02-01 00:00:00

  • R.A.P.I.D. (Root Aggregated Prioritized Information Display): A single screen display for efficient digital triaging of medical reports.

    abstract:OBJECTIVE:The timely acknowledgement of critical patient clinical reports is vital for the delivery of safe patient care. With current EHR systems, critical reports reside on different screens. This leads to treatment delays and inefficient work flows. As a remedy, the R.A.P.I.D. (Root Aggregated Prioritized Informatio...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,随机对照试验

    doi:10.1016/j.jbi.2016.04.001

    authors: Ford JP,Huang L,Richards DS,Ambinder EP,Rosenberger JL

    更新日期:2016-06-01 00:00:00

  • Cadec: A corpus of adverse drug event annotations.

    abstract::CSIRO Adverse Drug Event Corpus (Cadec) is a new rich annotated corpus of medical forum posts on patient-reported Adverse Drug Events (ADEs). The corpus is sourced from posts on social media, and contains text that is largely written in colloquial language and often deviates from formal English grammar and punctuation...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.03.010

    authors: Karimi S,Metke-Jimenez A,Kemp M,Wang C

    更新日期:2015-06-01 00:00:00

  • Patient empowerment for cancer patients through a novel ICT infrastructure.

    abstract::As a result of recent advances in cancer research and "precision medicine" approaches, i.e. the idea of treating each patient with the right drug at the right time, more and more cancer patients are being cured, or might have to cope with a life with cancer. For many people, cancer survival today means living with a c...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103342

    authors: Kondylakis H,Bucur A,Crico C,Dong F,Graf N,Hoffman S,Koumakis L,Manenti A,Marias K,Mazzocco K,Pravettoni G,Renzi C,Schera F,Triberti S,Tsiknakis M,Kiefer S

    更新日期:2020-01-01 00:00:00

  • Automated identification of adverse events related to central venous catheters.

    abstract::Methods for surveillance of adverse events (AEs) in clinical settings are limited by cost, technology, and appropriate data availability. In this study, two methods for semi-automated review of text records within the Veterans Administration database are utilized to identify AEs related to the placement of central ven...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.06.003

    authors: Penz JF,Wilcox AB,Hurdle JF

    更新日期:2007-04-01 00:00:00

  • Analysis of eligibility criteria representation in industry-standard clinical trial protocols.

    abstract::Previous research on standardization of eligibility criteria and its feasibility has traditionally been conducted on clinical trial protocols from ClinicalTrials.gov (CT). The portability and use of such standardization for full-text industry-standard protocols has not been studied in-depth. Towards this end, in this ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.06.001

    authors: Bhattacharya S,Cantor MN

    更新日期:2013-10-01 00:00:00

  • Applying semantic-based probabilistic context-free grammar to medical language processing--a preliminary study on parsing medication sentences.

    abstract::Semantic-based sublanguage grammars have been shown to be an efficient method for medical language processing. However, given the complexity of the medical domain, parsers using such grammars inevitably encounter ambiguous sentences, which could be interpreted by different groups of production rules and consequently r...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.08.009

    authors: Xu H,AbdelRahman S,Lu Y,Denny JC,Doan S

    更新日期:2011-12-01 00:00:00

  • A tiered approach is more cost effective than traditional pharmacist-based review for classifying computer-detected signals as adverse drug events.

    abstract:OBJECTIVE:To develop a cost-efficient method for identifying adverse drug events (ADEs) and medication errors (MEs) identified using outpatient electronic medical records within ambulatory settings. DESIGN:Comparison of sensitivity and cost of "traditional" pharmacist based approach to identifying ADEs and MEs during ...

    journal_title:Journal of biomedical informatics

    pub_type: 临床试验,杂志文章,多中心研究

    doi:10.1016/s1532-0464(03)00059-5

    authors: Hope C,Overhage JM,Seger A,Teal E,Mills V,Fiskio J,Gandhi TK,Bates DW,Murray MD

    更新日期:2003-02-01 00:00:00

  • Comparison between passive vision-based system and a wearable inertial-based system for estimating temporal gait parameters related to the GAITRite electronic walkway.

    abstract::Quantitative gait analysis allows clinicians to assess the inherent gait variability over time which is a functional marker to aid in the diagnosis of disabilities or diseases such as frailty, the onset of cognitive decline and neurodegenerative diseases, among others. However, despite the accuracy achieved by the cur...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.07.009

    authors: González I,López-Nava IH,Fontecha J,Muñoz-Meléndez A,Pérez-SanPablo AI,Quiñones-Urióstegui I

    更新日期:2016-08-01 00:00:00

  • A private DNA motif finding algorithm.

    abstract::With the increasing availability of genomic sequence data, numerous methods have been proposed for finding DNA motifs. The discovery of DNA motifs serves a critical step in many biological applications. However, the privacy implication of DNA analysis is normally neglected in the existing methods. In this work, we pro...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.12.016

    authors: Chen R,Peng Y,Choi B,Xu J,Hu H

    更新日期:2014-08-01 00:00:00

  • Automated annotation and classification of BI-RADS assessment from radiology reports.

    abstract::The Breast Imaging Reporting and Data System (BI-RADS) was developed to reduce variation in the descriptions of findings. Manual analysis of breast radiology report data is challenging but is necessary for clinical and healthcare quality assurance activities. The objective of this study is to develop a natural languag...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.04.011

    authors: Castro SM,Tseytlin E,Medvedeva O,Mitchell K,Visweswaran S,Bekhuis T,Jacobson RS

    更新日期:2017-05-01 00:00:00