Abstract:
BACKGROUND:Word embeddings are dense numeric vectors used to represent language in neural networks. Until recently, there had been no publicly released embeddings trained on clinical data. Our work is the first to study the privacy implications of releasing these models. OBJECTIVE:This paper aims to demonstrate that traditional word embeddings created on clinical corpora that have been deidentified by removing personal health information (PHI) can nonetheless be exploited to reveal sensitive patient information. METHODS:We used embeddings created from 400,000 doctor-written consultation notes and experimented with 3 common word embedding methods to explore the privacy-preserving properties of each. RESULTS:We found that if publicly released embeddings are trained from a corpus anonymized by PHI removal, it is possible to reconstruct up to 68.5% (n=411/600) of the full names that remain in the deidentified corpus and associated sensitive information to specific patients in the corpus from which the embeddings were created. We also found that the distance between the word vector representation of a patient's name and a diagnostic billing code is informative and differs significantly from the distance between the name and a code not billed for that patient. CONCLUSIONS:Special care must be taken when sharing word embeddings created from clinical texts, as current approaches may compromise patient privacy. If PHI removal is used for anonymization before traditional word embeddings are trained, it is possible to attribute sensitive information to patients who have not been fully deidentified by the (necessarily imperfect) removal algorithms. A promising alternative (ie, anonymization by PHI replacement) may avoid these flaws. Our results are timely and critical, as an increasing number of researchers are pushing for publicly available health data.
journal_name
J Med Internet Resjournal_title
Journal of medical Internet researchauthors
Abdalla M,Abdalla M,Hirst G,Rudzicz Fdoi
10.2196/18055subject
Has Abstractpub_date
2020-07-15 00:00:00pages
e18055issue
7eissn
1439-4456issn
1438-8871pii
v22i7e18055journal_volume
22pub_type
杂志文章abstract:BACKGROUND:The majority of workers, regardless of age or occupational status, report engaging in personal Internet use in the workplace. There is little understanding of the impact that personal Internet use may have on patient care in acute clinical settings. OBJECTIVE:The objective of this study was to investigate t...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2421
更新日期:2013-05-17 00:00:00
abstract:BACKGROUND:Despite the amount of online health information, there are several barriers that limit the Internet's adoption as a source of health information. One of these barriers is highlighted in conceptualizations of the digital divide which include the differential possession of Internet skills, or "eHealth literacy...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.1581
更新日期:2011-04-29 00:00:00
abstract:BACKGROUND:A majority of adolescents report the use of some form of social media, and many prefer to communicate via social networking sites. Social media may offer new opportunities in diabetes management, particularly in terms of how health care teams provide tailored support and treatment to adolescents with diabete...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/12149
更新日期:2019-05-30 00:00:00
abstract:BACKGROUND:Artificial intelligence (AI) is developing quickly in the medical field and can benefit both medical staff and patients. The clinical decision support system Watson for Oncology (WFO) is an outstanding representative AI in the medical field, and it can provide to cancer patients prompt treatment recommendati...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/11087
更新日期:2018-09-25 00:00:00
abstract:BACKGROUND:Health care providers do not routinely carry out brief counseling for tobacco cessation despite the evidence for its effectiveness. For this intervention to be routinely used, it must be brief, be convenient, require little investment of resources, require little specialized training, and be perceived as eff...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2074
更新日期:2012-12-03 00:00:00
abstract:BACKGROUND:Existing influenza surveillance in the United States is focused on the collection of data from sentinel physicians and hospitals; however, the compilation and distribution of reports are usually delayed by up to 2 weeks. With the popularity of social media growing, the Internet is a source for syndromic surv...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.3532
更新日期:2014-11-14 00:00:00
abstract:BACKGROUND:Low participation rates are one of the most serious disadvantages of Web-based studies. It is necessary to develop effective strategies to improve participation rates to obtain sufficient data. OBJECTIVE:The objective of this trial was to investigate the effect of emphasizing the incentive in the subject li...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.8561
更新日期:2018-02-08 00:00:00
abstract:BACKGROUND:Overweight and obesity is a significant public health problem that impacts a large number of children globally. Supporting childcare centers to deliver healthy eating and physical activity-promoting policies and practices is a recommended strategy for obesity prevention, given that such services provide acce...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.3639
更新日期:2015-04-30 00:00:00
abstract:BACKGROUND:Advance care planning (ACP) is a process with the overall aim to enhance care in concordance with patients' preferences. Key elements of ACP are to enable persons to define goals and preferences for future medical treatment and care, to discuss these with family and health care professionals, and to document...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/15578
更新日期:2020-03-17 00:00:00
abstract:BACKGROUND:Self-guided internet-based cognitive behavioral therapies (iCBTs) for depressive symptoms may substantially increase accessibility to mental health treatment. Despite this, questions remain as to the generalizability of the research on self-guided iCBT. OBJECTIVE:We sought to describe the clinical entry cri...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/10113
更新日期:2018-11-09 00:00:00
abstract:BACKGROUND:Men who use the Internet to seek sex with other men (MISM) are increasingly using the Internet to find sexual health information and to seek sexual partners, with some research suggesting HIV transmission is associated with sexual partnering online. Aiming to "meet men where they are at," some AIDS service o...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.4503
更新日期:2015-12-09 00:00:00
abstract:BACKGROUND:Many systematic reviews exist on the use of remote patient monitoring (RPM) interventions to improve clinical outcomes and psychological well-being of patients with heart failure. However, research is broadly distributed from simple telephone-based to complex technology-based interventions. The scope and foc...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/jmir.6571
更新日期:2017-01-20 00:00:00
abstract:BACKGROUND:Health risk assessments are becoming more popular as a tool to conveniently and effectively reach community-dwelling adults who may be at risk for serious chronic conditions such as coronary heart disease (CHD). The use of such instruments to improve adults' risk factor awareness and concordance with clinica...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2369
更新日期:2014-04-18 00:00:00
abstract:BACKGROUND:Outbreaks of human infection with a new avian influenza A H7N9 virus occurred in China in the spring of 2013. Control and prevention of a new human infectious disease outbreak can be strongly affected by public reaction and social impact through the Internet and social media. OBJECTIVE:This study aimed to i...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.2911
更新日期:2014-01-17 00:00:00
abstract:BACKGROUND:Chronic obstructive pulmonary disease (COPD) is now the fourth leading cause of death in the world, and it continues to increase in developing countries. The World Health Organization expects COPD to be the third most common cause of death in the world by 2020. Effective and continuous postdischarge care can...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.6743
更新日期:2017-07-21 00:00:00
abstract::During the last decades a variety of telemedicine applications have been trialed worldwide. However, telemedicine is still an example of major potential benefits that have not been fully attained. Health care regulators are still debating why institutionalizing telemedicine applications on a large scale has been so di...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.1669
更新日期:2011-09-28 00:00:00
abstract:BACKGROUND:Improving persuasion in response to vaccine skepticism is a long-standing problem. Elective nonvaccination emerging from skepticism about vaccine safety and efficacy jeopardizes herd immunity, exposing those who are most vulnerable to the risk of serious diseases. OBJECTIVE:This article analyzes vaccine sen...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/19504
更新日期:2020-12-04 00:00:00
abstract:BACKGROUND:More than 35% of American adults are obese. For African American and Hispanic adults, as well as individuals residing in poorer or more racially segregated urban neighborhoods, the likelihood of obesity is even higher. Information and communication technologies (ICTs) may substitute for or complement communi...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.5741
更新日期:2016-06-28 00:00:00
abstract:BACKGROUND:Three-dimensional scans are increasingly used to quantify biological topographical changes and clinical health outcomes. Traditionally, the use of 3D scans has been limited to specialized centers owing to the high cost of the scanning equipment and the necessity for complex analysis software. Technological a...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/17150
更新日期:2020-11-27 00:00:00
abstract::Wearable sensor technology could have an important role for clinical research and in delivering health care. Accordingly, such technology should undergo rigorous evaluation prior to market launch, and its performance should be supported by evidence-based marketing claims. Many studies have been published attempting to...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/10108
更新日期:2018-07-02 00:00:00
abstract:BACKGROUND:Attending to the wide range of communication behaviors that convey empathy is an important but often underemphasized concept to reduce errors in care, improve patient satisfaction, and improve cancer patient outcomes. A virtual human (VH)-based simulation, MPathic-VR, was developed to train health care provi...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/15459
更新日期:2019-11-27 00:00:00
abstract:BACKGROUND:Decision support systems based on reinforcement learning (RL) have been implemented to facilitate the delivery of personalized care. This paper aimed to provide a comprehensive review of RL applications in the critical care setting. OBJECTIVE:This review aimed to survey the literature on RL applications for...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,评审
doi:10.2196/18477
更新日期:2020-07-20 00:00:00
abstract:BACKGROUND:Web-based computer-tailored interventions for multiple health behaviors can improve the strength of behavior habits in people who want to reduce their cardiovascular risk. Nonetheless, few randomized controlled trials have tested this assumption to date. OBJECTIVE:The study aim was to test an 8-week Web-bas...
journal_title:Journal of medical Internet research
pub_type: 杂志文章,随机对照试验
doi:10.2196/jmir.5147
更新日期:2016-04-11 00:00:00
abstract:BACKGROUND:Nutrigenomics forms the basis of personalized nutrition by customizing an individual's dietary plan based on the integration of life stage, current health status, and genome information. Some common genes that are included in nutrition-based multigene test panels include CYP1A2 (rate of caffeine break down),...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/12580
更新日期:2019-06-28 00:00:00
abstract:BACKGROUND:Pollen allergies affect a significant proportion of the population globally. At present, Web-based tools such as pollen diaries and mobile apps allow for easy and fast documentation of allergic symptoms via the internet. OBJECTIVE:This study aimed to characterize the users of the Patient's Hayfever Diary (P...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/16767
更新日期:2020-02-21 00:00:00
abstract:BACKGROUND:A previous study among Antwerp college and university students showed that more male (10.2%-11.1%) than female (1.8%-6.2%) students are at risk for problematic alcohol use. The current literature shows promising results in terms of feasibility and effectiveness for the use of brief electronic interventions t...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.1869
更新日期:2012-04-23 00:00:00
abstract:BACKGROUND:Self-reported medical history information is included in many studies. However, data on the validity of Web-based questionnaires assessing medical history are scarce. If proven to be valid, Web-based questionnaires may provide researchers with an efficient means to collect data on this parameter in large pop...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.3847
更新日期:2015-06-16 00:00:00
abstract::Artificial intelligence (AI) is seen as a strategic lever to improve access, quality, and efficiency of care and services and to build learning and value-based health systems. Many studies have examined the technical performance of AI within an experimental context. These studies provide limited insights into the issu...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/17707
更新日期:2020-07-07 00:00:00
abstract:BACKGROUND:Suicidal thoughts are common among young people presenting to face-to-face and online mental health services. The early detection and rapid response to these suicidal thoughts and other suicidal behaviors is a priority for suicide prevention and early intervention efforts internationally. Establishing how be...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.7897
更新日期:2017-07-12 00:00:00
abstract:BACKGROUND:Pollen forecasts are highly valuable for allergen avoidance and thus raising the quality of life of persons concerned by pollen allergies. They are considered as valuable free services for the public. Careful scientific evaluation of pollen forecasts in terms of accurateness and reliability has not been avai...
journal_title:Journal of medical Internet research
pub_type: 杂志文章
doi:10.2196/jmir.7426
更新日期:2017-05-08 00:00:00