Abstract:
BACKGROUND:Data mining can be utilized to automate analysis of substantial amounts of data produced in many organizations. However, data mining produces large numbers of rules and patterns, many of which are not useful. Existing methods for pruning uninteresting patterns have only begun to automate the knowledge acquisition step (which is required for subjective measures of interestingness), hence leaving a serious bottleneck. In this paper we propose a method for automatically acquiring knowledge to shorten the pattern list by locating the novel and interesting ones. METHODS:The dual-mining method is based on automatically comparing the strength of patterns mined from a database with the strength of equivalent patterns mined from a relevant knowledgebase. When these two estimates of pattern strength do not match, a high "surprise score" is assigned to the pattern, identifying the pattern as potentially interesting. The surprise score captures the degree of novelty or interestingness of the mined pattern. In addition, we show how to compute p values for each surprise score, thus filtering out noise and attaching statistical significance. RESULTS:We have implemented the dual-mining method using scripts written in Perl and R. We applied the method to a large patient database and a biomedical literature citation knowledgebase. The system estimated association scores for 50,000 patterns, composed of disease entities and lab results, by querying the database and the knowledgebase. It then computed the surprise scores by comparing the pairs of association scores. Finally, the system estimated statistical significance of the scores. CONCLUSION:The dual-mining method eliminates more than 90% of patterns with strong associations, thus identifying them as uninteresting. We found that the pruning of patterns using the surprise score matched the biomedical evidence in the 100 cases that were examined by hand. The method automates the acquisition of knowledge, thus reducing dependence on the knowledge elicited from human expert, which is usually a rate-limiting step.
journal_name
BMC Med Inform Decis Makjournal_title
BMC medical informatics and decision makingauthors
Siadaty MS,Knaus WAdoi
10.1186/1472-6947-6-13keywords:
subject
Has Abstractpub_date
2006-03-07 00:00:00pages
13issn
1472-6947pii
1472-6947-6-13journal_volume
6pub_type
杂志文章abstract:BACKGROUND:The objective of this study was to ascertain the performance of syndromic algorithms for the early detection of patients in healthcare facilities who have potentially transmissible infectious diseases, using computerised emergency department (ED) data. METHODS:A retrospective cohort in an 810-bed University...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-13-101
更新日期:2013-09-03 00:00:00
abstract:BACKGROUND:This paper presents a conditional random fields (CRF) method that enables the capture of specific high-order label transition factors to improve clinical named entity recognition performance. Consecutive clinical entities in a sentence are usually separated from each other, and the textual descriptions in cl...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-019-0865-1
更新日期:2019-07-15 00:00:00
abstract:BACKGROUND:Systematic review (SR) of randomized controlled trials (RCT) is the gold standard for informing treatment choice. Decision analyses (DA) also play an important role in informing health care decisions. It is unknown how often the results of DA and matching SR of RCTs are in concordance. We assessed whether th...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章,评审
doi:10.1186/1472-6947-14-57
更新日期:2014-07-15 00:00:00
abstract:BACKGROUND:Using Monte Carlo simulations, we compare different methods (maximizing Youden index, maximizing mutual information, and logistic regression) for their ability to determine optimum binary cut-off thresholds for a ratio-scaled diagnostic test variable. Special attention is given to the stability and precision...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-014-0099-1
更新日期:2014-11-25 00:00:00
abstract:BACKGROUND:Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from cl...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-019-0864-2
更新日期:2019-07-25 00:00:00
abstract:BACKGROUND:Access to and use of digital technology are more common among people of more advantaged socioeconomic status. These differences might be due to lack of interest, not having physical access or having lower intentions to use this technology. By integrating the digital divide approach and the User Acceptance of...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-020-01383-9
更新日期:2021-01-13 00:00:00
abstract:BACKGROUND:New Specific Application Domain (SAD) heuristics or design principles are being developed to guide the design and evaluation of mobile applications in a bid to improve on the usability of these applications. This is because the existing heuristics are rather generic and are often unable to reveal a large num...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-018-0718-3
更新日期:2019-01-09 00:00:00
abstract:BACKGROUND:Microcontact datasets gathered automatically by electronic devices have the potential augment the study of the spread of contagious disease by providing detailed representations of the study population's contact dynamics. However, the impact of data collection experimental design on the subsequent simulation...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-12-132
更新日期:2012-11-15 00:00:00
abstract:BACKGROUND:Informational discontinuity can have far reaching consequences like medical errors, increased re-hospitalization rates and adverse events among others. Thus the holy grail of seamless informational continuity in healthcare has been an enigma with some nations going the digital way. Digitization in healthcare...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-020-01190-2
更新日期:2020-07-28 00:00:00
abstract:BACKGROUND:Comparing outcomes between hospitals requires consideration of patient factors that could account for any observed differences. Adjusting for comorbid conditions is common when studying outcomes following cancer surgery, and a commonly used measure is the Charlson comorbidity index. Other measures of patient...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-015-0175-1
更新日期:2015-07-15 00:00:00
abstract:BACKGROUND:National authorities have to follow the evolution of diabetes to implement public health policies. An algorithm was developed to identify patients with treated type 2 diabetes and estimate its annual prevalence in Luxembourg using health insurance claims when no diagnosis code is available. METHODS:The DIAB...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-11-23
更新日期:2011-04-14 00:00:00
abstract:BACKGROUND:Problems may arise during the approval process of treatment after a compensable work injury, which include excess paperwork, delays in approving services, disputes, and allegations of over-servicing. This is perceived as undesirable for injured people, health care professionals and claims managers, and costl...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-017-0460-2
更新日期:2017-05-22 00:00:00
abstract:BACKGROUND:Percutaneous coronary intervention (PCI) is the most commonly performed treatment for coronary atherosclerosis. It is associated with a higher incidence of repeat revascularization procedures compared to coronary artery bypass grafting surgery. Recent results indicate that PCI is only cost-effective for a su...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-015-0131-0
更新日期:2015-02-14 00:00:00
abstract:BACKGROUND:Following the completion of treatment and as they enter the follow-up phase, breast cancer patients (BCPs) often recount feeling 'lost in transition', and are left with many questions concerning how their ongoing care and monitoring for recurrence will be managed. Family physicians (FPs) also frequently repo...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-13-76
更新日期:2013-07-25 00:00:00
abstract:BACKGROUND:Schizophrenia is a kind of serious mental illness. Due to the lack of an objective physiological data supporting and a unified data analysis method, doctors can only rely on the subjective experience of the data to distinguish normal people and patients, which easily lead to misdiagnosis. In recent years, fu...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-017-0559-5
更新日期:2017-12-20 00:00:00
abstract::To answer the need for the rigorous protection of biomedical data, we organized the Critical Assessment of Data Privacy and Protection initiative as a community effort to evaluate privacy-preserving dissemination techniques for biomedical data. We focused on the challenge of sharing aggregate human genomic data (e.g.,...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-14-S1-S1
更新日期:2014-01-01 00:00:00
abstract:BACKGROUND:Automated machine-learning systems are able to de-identify electronic medical records, including free-text clinical notes. Use of such systems would greatly boost the amount of data available to researchers, yet their deployment has been limited due to uncertainty about their performance when applied to new ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-020-1026-2
更新日期:2020-01-30 00:00:00
abstract:BACKGROUND:Evidence-based information available at the point of care improves patient care outcomes. Online knowledge bases can increase the application of evidence-based medicine and influence patient outcome data which may be captured in quality registries. The aim of this study was to explore the effect of use of an...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-020-01313-9
更新日期:2020-11-16 00:00:00
abstract:BACKGROUND:Patients with no history of stroke but with stenosis of the carotid arteries can reduce the risk of future stroke with surgery or stenting. At present, a physicians' ability to recommend optimal treatments based on an individual's risk profile requires estimating the likelihood that a patient will have a poo...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-015-0141-y
更新日期:2015-03-24 00:00:00
abstract:BACKGROUND:Shared decision-making (SDM) is considered a key component of high quality cancer care and may be supported by patient decision aids (PtDAs). Many patients, however, face multiple social disadvantages that may influence their ability to fully participate in SDM or to use PtDAs; additionally, these social dis...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-016-0303-6
更新日期:2016-06-06 00:00:00
abstract:BACKGROUND:The global age-adjusted mortality rate related to atrial fibrillation (AF) registered a rapid growth in the last four decades, i.e., from 0.8 to 1.6 and 0.9 to 1.7 per 100,000 for men and women during 1990-2010, respectively. In this context, this study uses convolutional neural networks for classifying (dia...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-019-0946-1
更新日期:2019-10-29 00:00:00
abstract:BACKGROUND:Despite advances in diagnosis and treatment of type 2 diabetes, suboptimal metabolic control persists. Patient education in diabetes has been proved to enhance self-efficacy and guideline-driven treatment, however many people with type 2 diabetes do not have access to or do not participate in self-management...
journal_title:BMC medical informatics and decision making
pub_type: 临床试验,杂志文章
doi:10.1186/s12911-016-0383-3
更新日期:2016-11-09 00:00:00
abstract:BACKGROUND:The Personal Patient Profile-Prostate (P3P), a web-based decision aid, was demonstrated to reduce decisional conflict in English-speaking men with localized prostate cancer early after initial diagnosis. The purpose of this study was to explore and enhance usability and cultural appropriateness of a Spanish ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-015-0180-4
更新日期:2015-07-24 00:00:00
abstract:BACKGROUND:Methods for linking real-world healthcare data often use a latent class model, where the latent, or unknown, class is the true match status of candidate record-pairs. This commonly used model assumes that agreement patterns among multiple fields within a latent class are independent. When this assumption is ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-13-97
更新日期:2013-08-30 00:00:00
abstract:BACKGROUND:The chronic kidney disease (CKD) is a worldwide critical problem, especially in developing countries. CKD patients usually begin their treatment in advanced stages, which requires dialysis and kidney transplantation, and consequently, affects mortality rates. This issue is faced by a mobile health (mHealth) ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-018-0587-9
更新日期:2018-01-12 00:00:00
abstract:BACKGROUND:Diagnosis of neuromuscular diseases in primary care is often challenging. Rare diseases such as Pompe disease are easily overlooked by the general practitioner. We therefore aimed to develop a diagnostic support tool using patient-oriented questions and combined data mining algorithms recognizing answer patt...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章,多中心研究
doi:10.1186/s12911-016-0268-5
更新日期:2016-03-08 00:00:00
abstract:BACKGROUND:A major problem patients encounter when reading about health related issues is document interpretation, which limits reading comprehension and therefore negatively impacts health care. Currently, searching for medical definitions from an external source is time consuming, distracting, and negatively impacts ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-11-4
更新日期:2011-01-25 00:00:00
abstract:BACKGROUND:Mathematical models can be used to predict individual growth responses to growth hormone (GH) therapy. The aim of this study was to construct and validate high-precision models to predict the growth response to GH treatment of short children, independent of their GH status, birth size and gestational age. As...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-7-40
更新日期:2007-12-12 00:00:00
abstract:BACKGROUND:With the character of high incidence, high prevalence and high mortality, stroke has brought a heavy burden to families and society in China. In 2009, the Ministry of Health of China launched the China national stroke screening and intervention program, which screens stroke and its risk factors and conducts ...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/s12911-019-0998-2
更新日期:2019-12-10 00:00:00
abstract:BACKGROUND:Functional Magnetic Resonance Imaging (fMRI) has been proven to be useful for studying brain functions. However, due to the existence of noise and distortion, mapping between the fMRI signal and the actual neural activity is difficult. Because of the difficulty, differential pattern analysis of fMRI brain im...
journal_title:BMC medical informatics and decision making
pub_type: 杂志文章
doi:10.1186/1472-6947-9-S1-S6
更新日期:2009-11-03 00:00:00