Abstract:
BACKGROUND:Automated disease code classification using free-text medical information is important for public health surveillance. However, traditional natural language processing (NLP) pipelines are limited, so we propose a method combining word embedding with a convolutional neural network (CNN). OBJECTIVE:Our objective was to compare the performance of traditional pipelines (NLP plus supervised machine learning models) with that of word embedding combined with a CNN in conducting a classification task identifying International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) diagnosis codes in discharge notes. METHODS:We used 2 classification methods: (1) extracting from discharge notes some features (terms, n-gram phrases, and SNOMED CT categories) that we used to train a set of supervised machine learning models (support vector machine, random forests, and gradient boosting machine), and (2) building a feature matrix, by a pretrained word embedding model, that we used to train a CNN. We used these methods to identify the chapter-level ICD-10-CM diagnosis codes in a set of discharge notes. We conducted the evaluation using 103,390 discharge notes covering patients hospitalized from June 1, 2015 to January 31, 2017 in the Tri-Service General Hospital in Taipei, Taiwan. We used the receiver operating characteristic curve as an evaluation measure, and calculated the area under the curve (AUC) and F-measure as the global measure of effectiveness. RESULTS:In 5-fold cross-validation tests, our method had a higher testing accuracy (mean AUC 0.9696; mean F-measure 0.9086) than traditional NLP-based approaches (mean AUC range 0.8183-0.9571; mean F-measure range 0.5050-0.8739). A real-world simulation that split the training sample and the testing sample by date verified this result (mean AUC 0.9645; mean F-measure 0.9003 using the proposed method). Further analysis showed that the convolutional layers of the CNN effectively identified a large number of keywords and automatically extracted enough concepts to predict the diagnosis codes. CONCLUSIONS:Word embedding combined with a CNN showed outstanding performance compared with traditional methods, needing very little data preprocessing. This shows that future studies will not be limited by incomplete dictionaries. A large amount of unstructured information from free-text medical writing will be extracted by automated approaches in the future, and we believe that the health care field is about to enter the age of big data.
journal_name
J Med Internet Resjournal_title
Journal of medical Internet researchauthors
Lin C,Hsu CJ,Lou YS,Yeh SJ,Lee CC,Su SL,Chen HCdoi
10.2196/jmir.8344subject
Has Abstractpub_date
2017-11-06 00:00:00pages
e380issue
11eissn
1439-4456issn
1438-8871pii
v19i11e380journal_volume
19pub_type
杂志文章abstract:BACKGROUND:Intimate partner violence (IPV) is a major public health concern. eHealth interventions may reduce exposure to violence and health-related consequences as the technology provides a safe and flexible space for the target population. However, the evidence is unclear. OBJECTIVE:The goal of the review is to exa...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/22361
更新日期:2020-12-11 00:00:00
abstract:BACKGROUND:The peer-led, social media-delivered intervention is an emerging method in sexual health promotion. However, no research has yet investigated its effectiveness as compared with other online channels or in an Asian population. OBJECTIVE:The objective of this study is to compare a peer-led, social media-deliv...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.7403
更新日期:2017-08-09 00:00:00
abstract:BACKGROUND:eHealth is widely used as a tool for improving health care delivery and information. However, distinct policies and strategies are required for its proper implementation and integration at national and international levels. OBJECTIVE:To determine the scope of policy issues faced by individuals, institutions...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/jmir.1633
更新日期:2012-02-17 00:00:00
abstract:BACKGROUND:The Librarian Infobutton Tailoring Environment (LITE) is a Web-based knowledge capture, management, and configuration tool with which users can build profiles used by OpenInfobutton, an open source infobutton manager, to provide electronic health record users with context-relevant links to online knowledge r...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.4281
更新日期:2015-11-30 00:00:00
abstract:BACKGROUND:The Dutch Ministry of Health has formulated ambitious goals concerning the use of telehealth, leading to subsequent changes compared with the current health care situation, in which 93% of care is delivered face-to-face. Since most care is delivered to older people, the prospect of telehealth raises the ques...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.8407
更新日期:2018-04-06 00:00:00
abstract:BACKGROUND:Although "infodemiological" methods have been used in research on coronavirus disease (COVID-19), an examination of the extent of infodemic moniker (misinformation) use on the internet remains limited. OBJECTIVE:The aim of this paper is to investigate internet search behaviors related to COVID-19 and examin...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/20673
更新日期:2020-08-25 00:00:00
abstract:BACKGROUND:Clinical reasoning is based on the declarative and procedural knowledge of workflows in clinical medicine. Educational approaches such as problem-based learning or mannequin simulators support learning of procedural knowledge. Immersive patient simulators (IPSs) go one step further as they allow an illusiona...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.5035
更新日期:2015-11-17 00:00:00
abstract:BACKGROUND:Despite the widespread use and advancements of mobile technology that facilitate rich communication modes, there is little evidence demonstrating the value of smartphones for effective interclinician communication and knowledge processes. OBJECTIVE:The objective of this study was to determine the effects of...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.2758
更新日期:2013-11-27 00:00:00
abstract:BACKGROUND:Crowdsourcing contests (also called innovation challenges, innovation contests, and inducement prize contests) can be used to solicit multisectoral feedback on health programs and design public health campaigns. They consist of organizing a steering committee, soliciting contributions, engaging the community...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/jmir.8226
更新日期:2018-03-09 00:00:00
abstract:BACKGROUND:The British Columbia Centre for Disease Control implemented a comprehensive Web-based testing service GetCheckedOnline (GCO) in September 2014 in Vancouver, Canada. GCO's objectives are to increase testing for sexually transmitted and blood-borne infections (STBBIs), reach high-prevalence populations facing ...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.7097
更新日期:2017-03-20 00:00:00
abstract:BACKGROUND:Antimicrobial resistance has reached globally alarming levels and is becoming a major public health threat. Lack of efficacious antimicrobial resistance surveillance systems was identified as one of the causes of increasing resistance, due to the lag time between new resistances and alerts to care providers....
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2043
更新日期:2012-05-29 00:00:00
abstract:BACKGROUND:Oral hygiene care is of key importance among stroke patients to prevent complications that may compromise rehabilitation or potentially give rise to life-threatening infections such as aspiration pneumonia. OBJECTIVE:The aim of this study was to evaluate the effectiveness of a Web-based continuing professio...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,多中心研究,随机对照试验
doi:10.2196/jmir.7024
更新日期:2017-03-31 00:00:00
abstract:BACKGROUND:The fact that patient satisfaction with primary care clinical practices and physician-patient communications has decreased gradually has brought a new opportunity to the online channel as a supplementary service to provide additional information. OBJECTIVE:In this study, our objectives were to examine the p...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.1574
更新日期:2011-11-02 00:00:00
abstract:BACKGROUND:The past few decades saw considerable advances in research and dissemination of evidence-based psychotherapies, yet available treatment resources are not able to meet the high need for care for individuals suffering from depression or anxiety. Blended care psychotherapy, which combines the strengths of thera...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/18723
更新日期:2020-07-06 00:00:00
abstract:BACKGROUND:There has been a rapid rise in the popularity of electronic cigarettes (e-cigarettes) over the last decade, with growth predicted to continue. The uptake of these devices has escalated despite inconclusive evidence of their efficacy as a smoking cessation device and unknown long-term health consequences. As ...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/11953
更新日期:2019-02-05 00:00:00
abstract:BACKGROUND:In the United States, there is a national shortage of organs donated for transplant. Among the solid organs, most often kidneys are donated by living donors, but the lack of information and complicated processes limit the number of individuals who serve as living kidney donors. Social media can be a tool for...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.6176
更新日期:2016-12-20 00:00:00
abstract:BACKGROUND:For the last decade, mHealth has constantly expanded as a part of eHealth. Mobile applications for health have the potential to target heterogeneous audiences and address specific needs in different situations, with diverse outcomes, and to complement highly developed health care technologies. The market is ...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/jmir.2430
更新日期:2013-05-21 00:00:00
abstract:BACKGROUND:Sleepio is a proven digital sleep improvement program based on cognitive behavioral therapy techniques. Users have the option to join an online community that includes weekly expert discussions, peer-to-peer discussion forums, and personal message walls. OBJECTIVE:The aim of this study was to conduct an onl...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.5654
更新日期:2016-04-25 00:00:00
abstract:BACKGROUND:Human immunodeficiency virus (HIV) is a serious health problem in the Russian Federation. However, the true scale of HIV in Russia has long been the subject of considerable debate. Using digital surveillance to monitor diseases has become increasingly popular in high income countries. But Internet users may ...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2936
更新日期:2013-11-12 00:00:00
abstract:BACKGROUND:Evaluation of online health interventions should investigate the function of theoretical mechanisms of behavior change in this new milieu. OBJECTIVES:To expand our understanding of how Web-based interventions influence behavior, we examined how changes at 6 months in participants' psychosocial characteristi...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.1614
更新日期:2011-03-04 00:00:00
abstract:BACKGROUND:Men continue to smoke in greater numbers than women; however, few interventions have been developed and tested to support men's cessation. Men tend to rely on quitting strategies associated with stereotypical manliness, such as willpower, stoicism, and independence, but they may lack the self-efficacy skills...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.4491
更新日期:2015-08-10 00:00:00
abstract:BACKGROUND:The COVID-19 outbreak has affected the lives of millions of people by causing a dramatic impact on several healthcare systems and the global economy. This devastating pandemic has brought communities across the globe to work on this issue in an unprecedented manner. OBJECTIVE:This case study describes the s...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/25283
更新日期:2021-01-16 00:00:00
abstract:BACKGROUND:Psychosocial problems such as depression, anxiety, and substance abuse are common and burdensome in young people. In New Zealand, screening for such problems is undertaken routinely only with year 9 students in low-decile schools and opportunistically in pediatric settings using a nonvalidated and time-consu...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/13911
更新日期:2019-12-03 00:00:00
abstract:BACKGROUND:Many health information providers on the Internet and doctors with email accounts are confronted with the phenomenon of receiving unsolicited emails from patients asking for medical advice. Also, a growing number of websites offer "ask-the-doctor" services, where patients can ask questions to health professi...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2.1.e1
更新日期:2000-01-01 00:00:00
abstract:BACKGROUND:The police attend numerous domestic violence events each year, recording details of these events as both structured (coded) data and unstructured free-text narratives. Abuse types (including physical, psychological, emotional, and financial) conducted by persons of interest (POIs) along with any injuries sus...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/13067
更新日期:2019-03-12 00:00:00
abstract:BACKGROUND:The rate of smoking commercial tobacco products among American Indian youth is double the rate for white youth. Interventions are needed to reduce this disparity. OBJECTIVE:To test the feasibility of a Web-based intervention to influence attitudes toward and intentions about smoking cigarettes among America...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.1682
更新日期:2012-06-01 00:00:00
abstract:BACKGROUND:The quality of physician-patient communication is a critical factor influencing treatment outcomes and patient satisfaction with care. To date, there is little research to document the effect of telemedicine (TM) on physician-patient communication. OBJECTIVE:The objectives of this study are to measure and d...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.1193
更新日期:2009-09-30 00:00:00
abstract:BACKGROUND:An increasing number of people have access to the Internet, and more people are seeking tobacco cessation resources online every year. Despite the proliferation of various online interventions and their evident acceptance and reach, little research has addressed their impact in the real world. Typically, low...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.9.4.e28
更新日期:2007-09-30 00:00:00
abstract:BACKGROUND:The evaluation of web-based interventions (defined as an intervention that can be downloaded or accessed on the internet through a web browser) in randomized controlled trials (RCTs) has increased over the past two decades. Little is known about how participants' use of the intervention is measured, reported...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/15474
更新日期:2020-04-16 00:00:00
abstract:BACKGROUND:As the use of digital media for health promotion has become increasingly common, descriptive studies exploring current and innovative marketing strategies can enhance the understanding of effective strategies and best practices. OBJECTIVE:This study aims to describe the implementation of a provincial digita...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/11534
更新日期:2019-02-01 00:00:00