A method for managing re-identification risk from small geographic areas in Canada.

Abstract:

BACKGROUND:A common disclosure control practice for health datasets is to identify small geographic areas and either suppress records from these small areas or aggregate them into larger ones. A recent study provided a method for deciding when an area is too small based on the uniqueness criterion. The uniqueness criterion stipulates that an the area is no longer too small when the proportion of unique individuals on the relevant variables (the quasi-identifiers) approaches zero. However, using a uniqueness value of zero is quite a stringent threshold, and is only suitable when the risks from data disclosure are quite high. Other uniqueness thresholds that have been proposed for health data are 5% and 20%. METHODS:We estimated uniqueness for urban Forward Sortation Areas (FSAs) by using the 2001 long form Canadian census data representing 20% of the population. We then constructed two logistic regression models to predict when the uniqueness is greater than the 5% and 20% thresholds, and validated their predictive accuracy using 10-fold cross-validation. Predictor variables included the population size of the FSA and the maximum number of possible values on the quasi-identifiers (the number of equivalence classes). RESULTS:All model parameters were significant and the models had very high prediction accuracy, with specificity above 0.9, and sensitivity at 0.87 and 0.74 for the 5% and 20% threshold models respectively. The application of the models was illustrated with an analysis of the Ontario newborn registry and an emergency department dataset. At the higher thresholds considerably fewer records compared to the 0% threshold would be considered to be in small areas and therefore undergo disclosure control actions. We have also included concrete guidance for data custodians in deciding which one of the three uniqueness thresholds to use (0%, 5%, 20%), depending on the mitigating controls that the data recipients have in place, the potential invasion of privacy if the data is disclosed, and the motives and capacity of the data recipient to re-identify the data. CONCLUSION:The models we developed can be used to manage the re-identification risk from small geographic areas. Being able to choose among three possible thresholds, a data custodian can adjust the definition of "small geographic area" to the nature of the data and recipient.

authors

El Emam K,Brown A,AbdelMalik P,Neisa A,Walker M,Bottomley J,Roffey T

doi

10.1186/1472-6947-10-18

subject

Has Abstract

pub_date

2010-04-02 00:00:00

pages

18

issn

1472-6947

pii

1472-6947-10-18

journal_volume

10

pub_type

杂志文章
  • Evaluation of syndromic algorithms for detecting patients with potentially transmissible infectious diseases based on computerised emergency-department data.

    abstract:BACKGROUND:The objective of this study was to ascertain the performance of syndromic algorithms for the early detection of patients in healthcare facilities who have potentially transmissible infectious diseases, using computerised emergency department (ED) data. METHODS:A retrospective cohort in an 810-bed University...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-13-101

    authors: Gerbier-Colomban S,Gicquel Q,Millet AL,Riou C,Grando J,Darmoni S,Potinet-Pagliaroli V,Metzger MH

    更新日期:2013-09-03 00:00:00

  • Prediction of blood culture outcome using hybrid neural network model based on electronic health records.

    abstract:BACKGROUND:Blood cultures are often performed to detect patients who has a serious illness without infections and patients with bloodstream infections. Early positive blood culture prediction is important, as bloodstream infections may cause inflammation of the body, even organ failure or death. However, existing work ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-1113-4

    authors: Cheng M,Zhao X,Ding X,Gao J,Xiong S,Ren Y

    更新日期:2020-07-09 00:00:00

  • Optimal sequence of tests for the mediastinal staging of non-small cell lung cancer.

    abstract:BACKGROUND:Non-small cell lung cancer (NSCLC) is the most prevalent type of lung cancer and the most difficult to predict. When there are no distant metastases, the optimal therapy depends mainly on whether there are malignant lymph nodes in the mediastinum. Given the vigorous debate among specialists about which tests...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-016-0246-y

    authors: Luque M,Díez FJ,Disdier C

    更新日期:2016-01-26 00:00:00

  • EDDAMAP: efficient data-dependent approach for monitoring asymptomatic patient.

    abstract:BACKGROUND:A pandemic affects healthcare delivery and consequently leads to socioeconomic complications. During a pandemic, a community where there lives an asymptomatic patient (AP) becomes a potential endemic zone. Assuming we want to monitor the travel and/or activity of an AP in a community where there is a pandemi...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01258-z

    authors: Adu-Gyamfi D,Zhang F,Kwansah Ansah AK

    更新日期:2020-09-29 00:00:00

  • Establishing a baseline for literature mining human genetic variants and their relationships to disease cohorts.

    abstract:BACKGROUND:The Variome corpus, a small collection of published articles about inherited colorectal cancer, includes annotations of 11 entity types and 13 relation types related to the curation of the relationship between genetic variation and disease. Due to the richness of these annotations, the corpus provides a good...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-016-0294-3

    authors: Verspoor KM,Heo GE,Kang KY,Song M

    更新日期:2016-07-18 00:00:00

  • EURISWEB--Web-based epidemiological surveillance of antibiotic-resistant pneumococci in day care centers.

    abstract:BACKGROUND:EURIS (European Resistance Intervention Study) was launched as a multinational study in September of 2000 to identify the multitude of complex risk factors that contribute to the high carriage rate of drug resistant Streptococcus pneumoniae strains in children attending Day Care Centers in several European c...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-3-9

    authors: Silva S,Gouveia-Oliveira R,Maretzek A,Carriço J,Gudnason T,Kristinsson KG,Ekdahl K,Brito-Avô A,Tomasz A,Sanches IS,de Lencastre H,Almeida J

    更新日期:2003-07-08 00:00:00

  • Dual processing model of medical decision-making.

    abstract:BACKGROUND:Dual processing theory of human cognition postulates that reasoning and decision-making can be described as a function of both an intuitive, experiential, affective system (system I) and/or an analytical, deliberative (system II) processing system. To date no formal descriptive model of medical decision-maki...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-12-94

    authors: Djulbegovic B,Hozo I,Beckstead J,Tsalatsanis A,Pauker SG

    更新日期:2012-09-03 00:00:00

  • Information discovery on electronic health records using authority flow techniques.

    abstract:BACKGROUND:As the use of electronic health records (EHRs) becomes more widespread, so does the need to search and provide effective information discovery within them. Querying by keyword has emerged as one of the most effective paradigms for searching. Most work in this area is based on traditional Information Retrieva...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-10-64

    authors: Hristidis V,Varadarajan RR,Biondich P,Weiner M

    更新日期:2010-10-22 00:00:00

  • Comprehensive user requirements engineering methodology for secure and interoperable health data exchange.

    abstract:BACKGROUND:Increased digitalization of healthcare comes along with the cost of cybercrime proliferation. This results to patients' and healthcare providers' skepticism to adopt Health Information Technologies (HIT). In Europe, this shortcoming hampers efficient cross-border health data exchange, which requires a holist...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0664-0

    authors: Natsiavas P,Rasmussen J,Voss-Knude M,Votis Κ,Coppolino L,Campegiani P,Cano I,Marí D,Faiella G,Clemente F,Nalin M,Grivas E,Stan O,Gelenbe E,Dumortier J,Petersen J,Tzovaras D,Romano L,Komnios I,Koutkias V

    更新日期:2018-10-16 00:00:00

  • Factors influencing the implementation of clinical guidelines for health care professionals: a systematic meta-review.

    abstract:BACKGROUND:Nowadays more and more clinical guidelines for health care professionals are being developed. However, this does not automatically mean that these guidelines are actually implemented. The aim of this meta-review is twofold: firstly, to gain a better understanding of which factors affect the implementation of...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,meta分析,评审

    doi:10.1186/1472-6947-8-38

    authors: Francke AL,Smit MC,de Veer AJ,Mistiaen P

    更新日期:2008-09-12 00:00:00

  • Surface structure feature matching algorithm for cardiac motion estimation.

    abstract:BACKGROUND:Cardiac diseases represent the leading cause of sudden death worldwide. During the development of cardiac diseases, the left ventricle (LV) changes obviously in structure and function. LV motion estimation plays an important role for diagnosis and treatment of cardiac diseases. To estimate LV motion accurate...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-017-0560-z

    authors: Zhang Z,Yang X,Tan C,Guo W,Chen G

    更新日期:2017-12-20 00:00:00

  • Early recognition of multiple sclerosis using natural language processing of the electronic health record.

    abstract:BACKGROUND:Diagnostic accuracy might be improved by algorithms that searched patients' clinical notes in the electronic health record (EHR) for signs and symptoms of diseases such as multiple sclerosis (MS). The focus this study was to determine if patients with MS could be identified from their clinical notes prior to...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-017-0418-4

    authors: Chase HS,Mitrani LR,Lu GG,Fulgieri DJ

    更新日期:2017-02-28 00:00:00

  • HIS-based electronic documentation can significantly reduce the time from biopsy to final report for prostate tumours and supports quality management as well as clinical research.

    abstract:BACKGROUND:Timely and accurate information is important to guide the medical treatment process. We developed, implemented and assessed an order-entry system to support documentation of prostate histologies involving urologists, pathologists and physicians in private practice. METHODS:We designed electronic forms for h...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-9-5

    authors: Breil B,Semjonow A,Dugas M

    更新日期:2009-01-20 00:00:00

  • A hybrid solution for extracting structured medical information from unstructured data in medical records via a double-reading/entry system.

    abstract:BACKGROUND:Healthcare providers generate a huge amount of biomedical data stored in either legacy system (paper-based) format or electronic medical records (EMR) around the world, which are collectively referred to as big biomedical data (BBD). To realize the promise of BBD for clinical use and research, it is an essen...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-016-0357-5

    authors: Luo L,Li L,Hu J,Wang X,Hou B,Zhang T,Zhao LP

    更新日期:2016-08-30 00:00:00

  • Derivation and validation of a search algorithm to retrospectively identify mechanical ventilation initiation in the intensive care unit.

    abstract:BACKGROUND:The development and validation of automated electronic medical record (EMR) search strategies are important for establishing the timing of mechanical ventilation initiation in the intensive care unit (ICU).Thus, we sought to develop and validate an automated EMR search algorithm (strategy) for time zero, the...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-14-55

    authors: Smischney NJ,Velagapudi VM,Onigkeit JA,Pickering BW,Herasevich V,Kashyap R

    更新日期:2014-06-25 00:00:00

  • Data mining EEG signals in depression for their diagnostic value.

    abstract:BACKGROUND:Quantitative electroencephalogram (EEG) is one neuroimaging technique that has been shown to differentiate patients with major depressive disorder (MDD) and non-depressed healthy volunteers (HV) at the group-level, but its diagnostic potential for detecting differences at the individual level has yet to be r...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-015-0227-6

    authors: Mohammadi M,Al-Azab F,Raahemi B,Richards G,Jaworska N,Smith D,de la Salle S,Blier P,Knott V

    更新日期:2015-12-23 00:00:00

  • Addressing health literacy in patient decision aids.

    abstract:BACKGROUND:Effective use of a patient decision aid (PtDA) can be affected by the user's health literacy and the PtDA's characteristics. Systematic reviews of the relevant literature can guide PtDA developers to attend to the health literacy needs of patients. The reviews reported here aimed to assess: METHODS:We revie...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,评审

    doi:10.1186/1472-6947-13-S2-S10

    authors: McCaffery KJ,Holmes-Rovner M,Smith SK,Rovner D,Nutbeam D,Clayman ML,Kelly-Blake K,Wolf MS,Sheridan SL

    更新日期:2013-01-01 00:00:00

  • An automated pipeline for analyzing medication event reports in clinical settings.

    abstract:BACKGROUND:Medication events in clinical settings are significant threats to patient safety. Analyzing and learning from the medication event reports is an important way to prevent the recurrence of these events. Currently, the analysis of medication event reports is ineffective and requires heavy workloads for clinici...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0687-6

    authors: Zhou S,Kang H,Yao B,Gong Y

    更新日期:2018-12-07 00:00:00

  • Socioeconomic and behavioural factors associated with access to and use of Personal Health Records.

    abstract:BACKGROUND:Access to and use of digital technology are more common among people of more advantaged socioeconomic status. These differences might be due to lack of interest, not having physical access or having lower intentions to use this technology. By integrating the digital divide approach and the User Acceptance of...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01383-9

    authors: Paccoud I,Baumann M,Le Bihan E,Pétré B,Breinbauer M,Böhme P,Chauvel L,Leist AK

    更新日期:2021-01-13 00:00:00

  • Diagnostic support for selected neuromuscular diseases using answer-pattern recognition and data mining techniques: a proof of concept multicenter prospective trial.

    abstract:BACKGROUND:Diagnosis of neuromuscular diseases in primary care is often challenging. Rare diseases such as Pompe disease are easily overlooked by the general practitioner. We therefore aimed to develop a diagnostic support tool using patient-oriented questions and combined data mining algorithms recognizing answer patt...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,多中心研究

    doi:10.1186/s12911-016-0268-5

    authors: Grigull L,Lechner W,Petri S,Kollewe K,Dengler R,Mehmecke S,Schumacher U,Lücke T,Schneider-Gold C,Köhler C,Güttsches AK,Kortum X,Klawonn F

    更新日期:2016-03-08 00:00:00

  • Comparison of clinical knowledge management capabilities of commercially-available and leading internally-developed electronic health records.

    abstract:BACKGROUND:We have carried out an extensive qualitative research program focused on the barriers and facilitators to successful adoption and use of various features of advanced, state-of-the-art electronic health records (EHRs) within large, academic, teaching facilities with long-standing EHR research and development ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-11-13

    authors: Sittig DF,Wright A,Meltzer S,Simonaitis L,Evans RS,Nichol WP,Ash JS,Middleton B

    更新日期:2011-02-17 00:00:00

  • Design and evaluation of a mobile application to assist the self-monitoring of the chronic kidney disease in developing countries.

    abstract:BACKGROUND:The chronic kidney disease (CKD) is a worldwide critical problem, especially in developing countries. CKD patients usually begin their treatment in advanced stages, which requires dialysis and kidney transplantation, and consequently, affects mortality rates. This issue is faced by a mobile health (mHealth) ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0587-9

    authors: Sobrinho A,da Silva LD,Perkusich A,Pinheiro ME,Cunha P

    更新日期:2018-01-12 00:00:00

  • Simulating an emergency department: the importance of modeling the interactions between physicians and delegates in a discrete event simulation.

    abstract:BACKGROUND:Computer simulation studies of the emergency department (ED) are often patient driven and consider the physician as a human resource whose primary activity is interacting directly with the patient. In many EDs, physicians supervise delegates such as residents, physician assistants and nurse practitioners eac...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-13-59

    authors: Lim ME,Worster A,Goeree R,Tarride JÉ

    更新日期:2013-05-22 00:00:00

  • Modeling healthcare authorization and claim submissions using the openEHR dual-model approach.

    abstract:BACKGROUND:The TISS standard is a set of mandatory forms and electronic messages for healthcare authorization and claim submissions among healthcare plans and providers in Brazil. It is not based on formal models as the new generation of health informatics standards suggests. The objective of this paper is to model the...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-11-60

    authors: Dias RD,Cook TW,Freire SM

    更新日期:2011-10-12 00:00:00

  • An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.

    abstract:BACKGROUND:Clinical named entity recognition (CNER) is important for medical information mining and establishment of high-quality knowledge map. Due to the different text features from natural language and a large number of professional and uncommon clinical terms in Chinese electronic medical records (EMRs), there are...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0933-6

    authors: Li L,Zhao J,Hou L,Zhai Y,Shi J,Cui F

    更新日期:2019-12-05 00:00:00

  • Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.

    abstract:BACKGROUND:Prior studies have demonstrated that cardiorespiratory fitness (CRF) is a strong marker of cardiovascular health. Machine learning (ML) can enhance the prediction of outcomes through classification techniques that classify the data into predetermined categories. The aim of this study is to present an evaluat...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-017-0566-6

    authors: Sakr S,Elshawi R,Ahmed AM,Qureshi WT,Brawner CA,Keteyian SJ,Blaha MJ,Al-Mallah MH

    更新日期:2017-12-19 00:00:00

  • Quantile-based fecal hemoglobin concentration for assessing colorectal neoplasms with 1,263,717 Taiwanese screenees.

    abstract:BACKGROUND:Although fecal hemoglobin concentration (f-Hb) was highly associated with the risk of colorectal neoplasms, current studies on this subject are hampered by skewedness of the data and the ordinal property of f-Hb has not been well studied yet. Our aim was to develop a quantile-based method to estimate adjuste...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0812-1

    authors: Peng SM,Chiu HM,Jen HH,Hsu CY,Chen SL,Chiu SY,Yen AM,Fann JC,Lee YC,Chen HH

    更新日期:2019-05-02 00:00:00

  • CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases.

    abstract:BACKGROUND:Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to C...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-021-01392-2

    authors: Ahn I,Na W,Kwon O,Yang DH,Park GM,Gwon H,Kang HJ,Jeong YU,Yoo J,Kim Y,Jun TJ,Kim YH

    更新日期:2021-01-28 00:00:00

  • Data cleaning process for HIV-indicator data extracted from DHIS2 national reporting system: a case study of Kenya.

    abstract:BACKGROUND:The District Health Information Software-2 (DHIS2) is widely used by countries for national-level aggregate reporting of health-data. To best leverage DHIS2 data for decision-making, countries need to ensure that data within their systems are of the highest quality. Comprehensive, systematic, and transparent...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01315-7

    authors: Gesicho MB,Were MC,Babic A

    更新日期:2020-11-13 00:00:00

  • Development of a validation algorithm for 'present on admission' flagging.

    abstract:BACKGROUND:The use of routine hospital data for understanding patterns of adverse outcomes has been limited in the past by the fact that pre-existing and post-admission conditions have been indistinguishable. The use of a 'Present on Admission' (or POA) indicator to distinguish pre-existing or co-morbid conditions from...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-9-48

    authors: Jackson TJ,Michel JL,Roberts R,Shepheard J,Cheng D,Rust J,Perry C

    更新日期:2009-12-01 00:00:00