Data cleaning process for HIV-indicator data extracted from DHIS2 national reporting system: a case study of Kenya.

Abstract:

BACKGROUND:The District Health Information Software-2 (DHIS2) is widely used by countries for national-level aggregate reporting of health-data. To best leverage DHIS2 data for decision-making, countries need to ensure that data within their systems are of the highest quality. Comprehensive, systematic, and transparent data cleaning approaches form a core component of preparing DHIS2 data for analyses. Unfortunately, there is paucity of exhaustive and systematic descriptions of data cleaning processes employed on DHIS2-based data. The aim of this study was to report on methods and results of a systematic and replicable data cleaning approach applied on HIV-data gathered within DHIS2 from 2011 to 2018 in Kenya, for secondary analyses. METHODS:Six programmatic area reports containing HIV-indicators were extracted from DHIS2 for all care facilities in all counties in Kenya from 2011 to 2018. Data variables extracted included reporting rate, reporting timeliness, and HIV-indicator data elements per facility per year. 93,179 facility-records from 11,446 health facilities were extracted from year 2011 to 2018. Van den Broeck et al.'s framework, involving repeated cycles of a three-phase process (data screening, data diagnosis and data treatment), was employed semi-automatically within a generic five-step data-cleaning sequence, which was developed and applied in cleaning the extracted data. Various quality issues were identified, and Friedman analysis of variance conducted to examine differences in distribution of records with selected issues across eight years. RESULTS:Facility-records with no data accounted for 50.23% and were removed. Of the remaining, 0.03% had over 100% in reporting rates. Of facility-records with reporting data, 0.66% and 0.46% were retained for voluntary medical male circumcision and blood safety programmatic area reports respectively, given that few facilities submitted data or offered these services. Distribution of facility-records with selected quality issues varied significantly by programmatic area (p < 0.001). The final clean dataset obtained was suitable to be used for subsequent secondary analyses. CONCLUSIONS:Comprehensive, systematic, and transparent reporting of cleaning-process is important for validity of the research studies as well as data utilization. The semi-automatic procedures used resulted in improved data quality for use in secondary analyses, which could not be secured by automated procedures solemnly.

authors

Gesicho MB,Were MC,Babic A

doi

10.1186/s12911-020-01315-7

subject

Has Abstract

pub_date

2020-11-13 00:00:00

pages

293

issue

1

issn

1472-6947

pii

10.1186/s12911-020-01315-7

journal_volume

20

pub_type

杂志文章
  • The caCORE Software Development Kit: streamlining construction of interoperable biomedical information services.

    abstract:BACKGROUND:Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented applicat...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-6-2

    authors: Phillips J,Chilukuri R,Fragoso G,Warzel D,Covitz PA

    更新日期:2006-01-06 00:00:00

  • A predictive model for the early identification of patients at risk for a prolonged intensive care unit length of stay.

    abstract:BACKGROUND:Patients with a prolonged intensive care unit (ICU) length of stay account for a disproportionate amount of resource use. Early identification of patients at risk for a prolonged length of stay can lead to quality enhancements that reduce ICU stay. This study developed and validated a model that identifies p...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-10-27

    authors: Kramer AA,Zimmerman JE

    更新日期:2010-05-13 00:00:00

  • CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases.

    abstract:BACKGROUND:Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to C...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-021-01392-2

    authors: Ahn I,Na W,Kwon O,Yang DH,Park GM,Gwon H,Kang HJ,Jeong YU,Yoo J,Kim Y,Jun TJ,Kim YH

    更新日期:2021-01-28 00:00:00

  • Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

    abstract:BACKGROUND:This paper presents a conditional random fields (CRF) method that enables the capture of specific high-order label transition factors to improve clinical named entity recognition performance. Consecutive clinical entities in a sentence are usually separated from each other, and the textual descriptions in cl...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0865-1

    authors: Lee W,Choi J

    更新日期:2019-07-15 00:00:00

  • A cohort study of a tailored web intervention for preconception care.

    abstract:BACKGROUND:Preconception care may be an efficacious tool to reduce risk factors for adverse pregnancy outcomes that are associated with lifestyles and health status before pregnancy. We conducted a web-based cohort study in Italian women planning a pregnancy to assess whether a tailored web intervention may change know...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-14-33

    authors: Agricola E,Pandolfi E,Gonfiantini MV,Gesualdo F,Romano M,Carloni E,Mastroiacovo P,Tozzi AE

    更新日期:2014-04-15 00:00:00

  • A predictive analytics model for differentiating between transient ischemic attacks (TIA) and its mimics.

    abstract:BACKGROUND:Transient ischemic attack (TIA) is a brief episode of neurological dysfunction resulting from cerebral ischemia not associated with permanent cerebral infarction. TIA is associated with high diagnostic errors because of the subjective nature of findings and the lack of clinical and imaging biomarkers. The go...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01154-6

    authors: Stanciu A,Banciu M,Sadighi A,Marshall KA,Holland NR,Abedi V,Zand R

    更新日期:2020-06-18 00:00:00

  • Towards computerizing intensive care sedation guidelines: design of a rule-based architecture for automated execution of clinical guidelines.

    abstract:BACKGROUND:Computerized ICUs rely on software services to convey the medical condition of their patients as well as assisting the staff in taking treatment decisions. Such services are useful for following clinical guidelines quickly and accurately. However, the development of services is often time-consuming and error...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-10-3

    authors: Ongenae F,De Backere F,Steurbaut K,Colpaert K,Kerckhove W,Decruyenaere J,De Turck F

    更新日期:2010-01-18 00:00:00

  • Decision-making in percutaneous coronary intervention: a survey.

    abstract:BACKGROUND:Few researchers have examined the perceptions of physicians referring cases for angiography regarding the degree to which collaboration occurs during percutaneous coronary intervention (PCI) decision-making. We sought to determine perceptions of physicians concerning their involvement in PCI decisions in cas...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-8-28

    authors: Rahilly-Tierney CR,Nash IS

    更新日期:2008-06-25 00:00:00

  • Brain mapping and detection of functional patterns in fMRI using wavelet transform; application in detection of dyslexia.

    abstract:BACKGROUND:Functional Magnetic Resonance Imaging (fMRI) has been proven to be useful for studying brain functions. However, due to the existence of noise and distortion, mapping between the fMRI signal and the actual neural activity is difficult. Because of the difficulty, differential pattern analysis of fMRI brain im...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-9-S1-S6

    authors: Ji SY,Ward K,Najarian K

    更新日期:2009-11-03 00:00:00

  • An end-to-end hybrid algorithm for automated medication discrepancy detection.

    abstract:BACKGROUND:In this study we implemented and developed state-of-the-art machine learning (ML) and natural language processing (NLP) technologies and built a computerized algorithm for medication reconciliation. Our specific aims are: (1) to develop a computerized algorithm for medication discrepancy detection between pa...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-015-0160-8

    authors: Li Q,Spooner SA,Kaiser M,Lingren N,Robbins J,Lingren T,Tang H,Solti I,Ni Y

    更新日期:2015-05-06 00:00:00

  • Evidence in clinical reasoning: a computational linguistics analysis of 789,712 medical case summaries 1983-2012.

    abstract:BACKGROUND:Better understanding of clinical reasoning could reduce diagnostic error linked to 8% of adverse medical events and 30% of malpractice cases. To a greater extent than the evidence-based movement, the clinical reasoning literature asserts the importance of practitioner intuition—unconscious elements of diagno...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,meta分析

    doi:10.1186/s12911-015-0136-8

    authors: Seidel BM,Campbell S,Bell E

    更新日期:2015-03-21 00:00:00

  • "Assessment of the social influence and facilitating conditions that support nurses' adoption of hospital electronic information management systems (HEIMS) in Ghana using the unified theory of acceptance and use of technology (UTAUT) model".

    abstract:BACKGROUND:Hospital electronic information management systems (HEIMS) are widely used in Ghana, and hence its performance must be carefully assessed. Nurses as clinical health personnel are the largest cluster of hospital staff and are the pillar of healthcare delivery. Therefore, they play a crucial role in the adopti...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0956-z

    authors: Zhou LL,Owusu-Marfo J,Asante Antwi H,Antwi MO,Kachie ADT,Ampon-Wireko S

    更新日期:2019-11-21 00:00:00

  • Attitudes of pediatric intensive care unit physicians towards the use of cognitive aids: a qualitative study.

    abstract:BACKGROUND:Cognitive aids are increasingly recommended in clinical practice, yet little is known about the attitudes of physicians towards these tools. METHODS:We employed a qualitative, descriptive design to explore physician attitudes towards cognitive aids in pediatric intensive care units (PICUs). Semi-structured ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-016-0291-6

    authors: Weiss MJ,Kramer C,Tremblay S,Côté L

    更新日期:2016-05-21 00:00:00

  • Digital health system for personalised COPD long-term management.

    abstract:BACKGROUND:Recent telehealth studies have demonstrated minor impact on patients affected by long-term conditions. The use of technology does not guarantee the compliance required for sustained collection of high-quality symptom and physiological data. Remote monitoring alone is not sufficient for successful disease man...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,随机对照试验

    doi:10.1186/s12911-017-0414-8

    authors: Velardo C,Shah SA,Gibson O,Clifford G,Heneghan C,Rutter H,Farmer A,Tarassenko L,EDGE COPD Team.

    更新日期:2017-02-20 00:00:00

  • The predictability of claim-data-based comorbidity-adjusted models could be improved by using medication data.

    abstract:BACKGROUND:Recently, claim-data-based comorbidity-adjusted methods such as the Charlson index and the Elixhauser comorbidity measures have been widely used among researchers. At the same time, there have been an increasing number of attempts to improve the predictability of comorbidity-adjusted models. We tried to impr...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-13-128

    authors: Bang JH,Hwang SH,Lee EJ,Kim Y

    更新日期:2013-11-20 00:00:00

  • Visual AIDS for multimodal treatment options to support decision making of patients with colorectal cancer.

    abstract:BACKGROUND:A variety of multimodal treatment options are available for colorectal cancer and many patients want to be involved in decisions about their therapies. However, their desire for autonomy is limited by lack of disease-specific knowledge. Visual aids may be helpful tools to present complex data in an easy-to-u...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,随机对照试验

    doi:10.1186/1472-6947-12-118

    authors: Hofmann S,Vetter J,Wachter C,Henne-Bruns D,Porzsolt F,Kornmann M

    更新日期:2012-10-23 00:00:00

  • Perception of healthcare workers on mobile app-based clinical guideline for the detection and treatment of mental health problems in primary care: a qualitative study in Nepal.

    abstract:BACKGROUND:In recent years, a significant change has taken place in the health care delivery systems due to the availability of smartphones and mobile software applications. The use of mobile technology can help to reduce a number of barriers for mental health care such as providers' workload, lack of qualified personn...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-021-01386-0

    authors: Pokhrel P,Karmacharya R,Taylor Salisbury T,Carswell K,Kohrt BA,Jordans MJD,Lempp H,Thornicroft G,Luitel NP

    更新日期:2021-01-19 00:00:00

  • Quantile-based fecal hemoglobin concentration for assessing colorectal neoplasms with 1,263,717 Taiwanese screenees.

    abstract:BACKGROUND:Although fecal hemoglobin concentration (f-Hb) was highly associated with the risk of colorectal neoplasms, current studies on this subject are hampered by skewedness of the data and the ordinal property of f-Hb has not been well studied yet. Our aim was to develop a quantile-based method to estimate adjuste...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-019-0812-1

    authors: Peng SM,Chiu HM,Jen HH,Hsu CY,Chen SL,Chiu SY,Yen AM,Fann JC,Lee YC,Chen HH

    更新日期:2019-05-02 00:00:00

  • Initial development of Supportive care Assessment, Prioritization and Recommendations for Kids (SPARK), a symptom screening and management application.

    abstract:BACKGROUND:We developed Supportive care Prioritization, Assessment and Recommendations for Kids (SPARK), a web-based application designed to facilitate symptom screening by children receiving cancer treatments and access to supportive care clinical practice guidelines primarily by healthcare providers. The objective wa...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0715-6

    authors: Cook S,Vettese E,Soman D,Hyslop S,Kuczynski S,Spiegler B,Davis H,Duong N,Ou Wai S,Golabek R,Golabek P,Antoszek-Rallo A,Schechter T,Lee Dupuis L,Sung L

    更新日期:2019-01-10 00:00:00

  • A survey of factors affecting clinician acceptance of clinical decision support.

    abstract:BACKGROUND:Real-time clinical decision support (CDS) integrated into clinicians' workflow has the potential to profoundly affect the cost, quality, and safety of health care delivery. Recent reports have identified a surprisingly low acceptance rate for different types of CDS. We hypothesized that factors affecting CDS...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-6-6

    authors: Sittig DF,Krall MA,Dykstra RH,Russell A,Chin HL

    更新日期:2006-02-01 00:00:00

  • Predicting disease risks from highly imbalanced data using random forest.

    abstract:BACKGROUND:We present a method utilizing Healthcare Cost and Utilization Project (HCUP) dataset for predicting disease risk of individuals based on their medical diagnosis history. The presented methodology may be incorporated in a variety of applications such as risk management, tailored health communication and decis...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-11-51

    authors: Khalilia M,Chakraborty S,Popescu M

    更新日期:2011-07-29 00:00:00

  • A usability design checklist for Mobile electronic data capturing forms: the validation process.

    abstract:BACKGROUND:New Specific Application Domain (SAD) heuristics or design principles are being developed to guide the design and evaluation of mobile applications in a bid to improve on the usability of these applications. This is because the existing heuristics are rather generic and are often unable to reveal a large num...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0718-3

    authors: Mugisha A,Nankabirwa V,Tylleskär T,Babic A

    更新日期:2019-01-09 00:00:00

  • Caregivers' role in using a personal electronic health record: a qualitative study of cancer patients and caregivers in Germany.

    abstract:BACKGROUND:Particularly in the context of severe diseases like cancer, many patients wish to include caregivers in the planning of treatment and care. Many caregivers like to be involved but feel insufficiently enabled. This study aimed at providing insight into patients' and caregivers' perspectives on caregivers' rol...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01172-4

    authors: Weis A,Pohlmann S,Poss-Doering R,Strauss B,Ullrich C,Hofmann H,Ose D,Winkler EC,Szecsenyi J,Wensing M

    更新日期:2020-07-13 00:00:00

  • Mobile phone usage in patients with type II diabetes and their intention to use it for self-management: a cross-sectional study in Iran.

    abstract:BACKGROUND:Mobile health has potential for promotion of self-management in patients with chronic diseases. This study was conducted to investigate smartphone usage in patients with type II diabetes and their intention to use it for self-management. METHODS:This cross-sectional study was conducted in 2018 with 176 pati...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-1038-y

    authors: Rangraz Jeddi F,Nabovati E,Hamidi R,Sharif R

    更新日期:2020-02-07 00:00:00

  • Automatic schizophrenic discrimination on fNIRS by using complex brain network analysis and SVM.

    abstract:BACKGROUND:Schizophrenia is a kind of serious mental illness. Due to the lack of an objective physiological data supporting and a unified data analysis method, doctors can only rely on the subjective experience of the data to distinguish normal people and patients, which easily lead to misdiagnosis. In recent years, fu...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-017-0559-5

    authors: Song H,Chen L,Gao R,Bogdan IIM,Yang J,Wang S,Dong W,Quan W,Dang W,Yu X

    更新日期:2017-12-20 00:00:00

  • Decision aids to help older people make health decisions: a systematic review and meta-analysis.

    abstract:BACKGROUND:Decision aids have been overall successful in improving the quality of health decision making. However, it is unclear whether the impact of the results of using decision aids also apply to older people (aged 65+). We sought to systematically review randomized controlled trials (RCTs) and clinical controlled ...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,meta分析,评审

    doi:10.1186/s12911-016-0281-8

    authors: van Weert JC,van Munster BC,Sanders R,Spijker R,Hooft L,Jansen J

    更新日期:2016-04-21 00:00:00

  • The anxious wait: assessing the impact of patient accessible EHRs for breast cancer patients.

    abstract:BACKGROUND:Personal health records (PHRs) provide patients with access to personal health information (PHI) and targeted education. The use of PHRs has the potential to improve a wide range of outcomes, including empowering patients to be more active participants in their care. There are a number of widespread barriers...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/1472-6947-10-46

    authors: Wiljer D,Leonard KJ,Urowitz S,Apatu E,Massey C,Quartey NK,Catton P

    更新日期:2010-09-01 00:00:00

  • Investigating the satisfaction level of physicians in regards to implementing medical Picture Archiving and Communication System (PACS).

    abstract:BACKGROUND:User satisfaction with PACS is considered as one of the important criteria for assessing success in using PACS. The objective of this study was to determine the level of user satisfaction with PACS and to compare its functional features with traditional film-based systems. METHODS:This study was conducted i...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-020-01203-0

    authors: Abbasi R,Sadeqi Jabali M,Khajouei R,Tadayon H

    更新日期:2020-08-05 00:00:00

  • Concordance between decision analysis and matching systematic review of randomized controlled trials in assessment of treatment comparisons: a systematic review.

    abstract:BACKGROUND:Systematic review (SR) of randomized controlled trials (RCT) is the gold standard for informing treatment choice. Decision analyses (DA) also play an important role in informing health care decisions. It is unknown how often the results of DA and matching SR of RCTs are in concordance. We assessed whether th...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章,评审

    doi:10.1186/1472-6947-14-57

    authors: Mhaskar RS,Wao H,Mahony H,Kumar A,Djulbegovic B

    更新日期:2014-07-15 00:00:00

  • Implementation of informatics for integrating biology and the bedside (i2b2) platform as Docker containers.

    abstract:BACKGROUND:Informatics for Integrating Biology and the Bedside (i2b2) is an open source clinical data analytics platform used at over 200 healthcare institutions for querying patient data. The i2b2 platform has several components with numerous dependencies and configuration parameters, which renders the task of install...

    journal_title:BMC medical informatics and decision making

    pub_type: 杂志文章

    doi:10.1186/s12911-018-0646-2

    authors: Wagholikar KB,Dessai P,Sanz J,Mendis ME,Bell DS,Murphy SN

    更新日期:2018-07-16 00:00:00