Optimising the use of electronic health records to estimate the incidence of rheumatoid arthritis in primary care: what information is hidden in free text?

Abstract:

BACKGROUND:Primary care databases are a major source of data for epidemiological and health services research. However, most studies are based on coded information, ignoring information stored in free text. Using the early presentation of rheumatoid arthritis (RA) as an exemplar, our objective was to estimate the extent of data hidden within free text, using a keyword search. METHODS:We examined the electronic health records (EHRs) of 6,387 patients from the UK, aged 30 years and older, with a first coded diagnosis of RA between 2005 and 2008. We listed indicators for RA which were present in coded format and ran keyword searches for similar information held in free text. The frequency of indicator code groups and keywords from one year before to 14 days after RA diagnosis were compared, and temporal relationships examined. RESULTS:One or more keyword for RA was found in the free text in 29% of patients prior to the RA diagnostic code. Keywords for inflammatory arthritis diagnoses were present for 14% of patients whereas only 11% had a diagnostic code. Codes for synovitis were found in 3% of patients, but keywords were identified in an additional 17%. In 13% of patients there was evidence of a positive rheumatoid factor test in text only, uncoded. No gender differences were found. Keywords generally occurred close in time to the coded diagnosis of rheumatoid arthritis. They were often found under codes indicating letters and communications. CONCLUSIONS:Potential cases may be missed or wrongly dated when coded data alone are used to identify patients with RA, as diagnostic suspicions are frequently confined to text. The use of EHRs to create disease registers or assess quality of care will be misleading if free text information is not taken into account. Methods to facilitate the automated processing of text need to be developed and implemented.

journal_name

BMC Med Res Methodol

authors

Ford E,Nicholson A,Koeling R,Tate A,Carroll J,Axelrod L,Smith HE,Rait G,Davies KA,Petersen I,Williams T,Cassell JA

doi

10.1186/1471-2288-13-105

subject

Has Abstract

pub_date

2013-08-21 00:00:00

pages

105

issn

1471-2288

pii

1471-2288-13-105

journal_volume

13

pub_type

杂志文章
  • Item response models for the longitudinal analysis of health-related quality of life in cancer clinical trials.

    abstract:BACKGROUND:The use of health-related quality of life (HRQoL) as an endpoint in cancer clinical trials is growing rapidly. Hence, research into the statistical approaches used to analyze HRQoL data is of major importance, and could lead to a better understanding of the impact of treatments on the everyday life and care ...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章,评审

    doi:10.1186/s12874-017-0410-9

    authors: Barbieri A,Peyhardi J,Conroy T,Gourgou S,Lavergne C,Mollevi C

    更新日期:2017-09-26 00:00:00

  • Validation of death prediction after breast cancer relapses using joint models.

    abstract:BACKGROUND:Cancer relapses may be useful to predict the risk of death. To take into account relapse information, the Landmark approach is popular. As an alternative, we propose the joint frailty model for a recurrent event and a terminal event to derive dynamic predictions of the risk of death. METHODS:The proposed pr...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-015-0018-x

    authors: Mauguen A,Rachet B,Mathoulin-Pélissier S,Lawrence GM,Siesling S,MacGrogan G,Laurent A,Rondeau V

    更新日期:2015-04-01 00:00:00

  • Sample size calculation in multi-centre clinical trials.

    abstract:BACKGROUND:Multi-centre randomized controlled clinical trials play an important role in modern evidence-based medicine. Advantages of collecting data from more than one site are numerous, including accelerated recruitment and increased generalisability of results. Mixed models can be applied to account for potential cl...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-018-0602-y

    authors: Harden M,Friede T

    更新日期:2018-11-29 00:00:00

  • Validity of information on atopic disease and other illness in young children reported by parents in a prospective birth cohort study.

    abstract:BACKGROUND:The longitudinal birth cohort study is the preferred design for studies of childhood health, particularly atopic disease. Still, prospective data collection depends on recollection of the medical history since the previous visit representing a potential recall-bias. We aimed to ascertain the quality of infor...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-12-160

    authors: Vissing NH,Jensen SM,Bisgaard H

    更新日期:2012-10-22 00:00:00

  • The thresholds for statistical and clinical significance - a five-step procedure for evaluation of intervention effects in randomised clinical trials.

    abstract:BACKGROUND:Thresholds for statistical significance are insufficiently demonstrated by 95% confidence intervals or P-values when assessing results from randomised clinical trials. First, a P-value only shows the probability of getting a result assuming that the null hypothesis is true and does not reflect the probabilit...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-14-34

    authors: Jakobsen JC,Gluud C,Winkel P,Lange T,Wetterslev J

    更新日期:2014-03-04 00:00:00

  • Dealing with missing data in a multi-question depression scale: a comparison of imputation methods.

    abstract:BACKGROUND:Missing data present a challenge to many research projects. The problem is often pronounced in studies utilizing self-report scales, and literature addressing different strategies for dealing with missing data in such circumstances is scarce. The objective of this study was to compare six different imputatio...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-6-57

    authors: Shrive FM,Stuart H,Quan H,Ghali WA

    更新日期:2006-12-13 00:00:00

  • Heterogeneity and event dependence in the analysis of sickness absence.

    abstract:BACKGROUND:Sickness absence (SA) is an important social, economic and public health issue. Identifying and understanding the determinants, whether biological, regulatory or, health services-related, of variability in SA duration is essential for better management of SA. The conditional frailty model (CFM) is useful whe...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-13-114

    authors: Torá-Rocamora I,Gimeno D,Delclos G,Benavides FG,Manzanera R,Jardí J,Alberti C,Yasui Y,Martínez JM

    更新日期:2013-09-16 00:00:00

  • A probit- log- skew-normal mixture model for repeated measures data with excess zeros, with application to a cohort study of paediatric respiratory symptoms.

    abstract:BACKGROUND:A zero-inflated continuous outcome is characterized by occurrence of "excess" zeros that more than a single distribution can explain, with the positive observations forming a skewed distribution. Mixture models are employed for regression analysis of zero-inflated data. Moreover, for repeated measures zero-i...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-10-55

    authors: Mahmud S,Lou WW,Johnston NW

    更新日期:2010-06-14 00:00:00

  • Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data.

    abstract:BACKGROUND:Nowadays, more and more clinical scales consisting in responses given by the patients to some items (Patient Reported Outcomes - PRO), are validated with models based on Item Response Theory, and more specifically, with a Rasch model. In the validation sample, presence of missing data is frequent. The aim of...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-11-105

    authors: Hardouin JB,Conroy R,Sébille V

    更新日期:2011-07-14 00:00:00

  • Optimal likelihood-ratio multiple testing with application to Alzheimer's disease and questionable dementia.

    abstract:BACKGROUND:Controlling the false discovery rate is important when testing multiple hypotheses. To enhance the detection capability of a false discovery rate control test, we applied the likelihood ratio-based multiple testing method in neuroimage data and compared the performance with the existing methods. METHODS:We ...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-15-9

    authors: Lee D,Kang H,Kim E,Lee H,Kim H,Kim YK,Lee Y,Lee DS

    更新日期:2015-01-30 00:00:00

  • Impact of preconception enrollment on birth enrollment and timing of exposure assessment in the initial vanguard cohort of the U.S. National Children's Study.

    abstract:BACKGROUND:The initial vanguard cohort of the U.S. National Children's Study was a pregnancy and birth cohort study that sought to enroll some women prior to pregnancy, and to assess exposures early in pregnancy. METHODS:During the recruitment phase (2009-2010), geographically based sampling was used to recruit women ...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-015-0067-1

    authors: Stanford JB,Brenner R,Fetterer D,Palmer L,Schoendorf KC,U.S. National Children’s Study.

    更新日期:2015-09-24 00:00:00

  • Developing the clinical components of a complex intervention for a glaucoma screening trial: a mixed methods study.

    abstract:BACKGROUND:Glaucoma is a leading cause of avoidable blindness worldwide. Open angle glaucoma is the most common type of glaucoma. No randomised controlled trials have been conducted evaluating the effectiveness of glaucoma screening for reducing sight loss. It is unclear what the most appropriate intervention to be eva...

    journal_title:BMC medical research methodology

    pub_type: 临床试验,杂志文章

    doi:10.1186/1471-2288-11-54

    authors: Glaucoma screening Platform Study group.,Burr JM,Campbell MK,Campbell SE,Francis JJ,Greene A,Hernández R,Hopkins D,McCann SK,Vale LD

    更新日期:2011-04-21 00:00:00

  • Sample size calculations for cluster randomised controlled trials with a fixed number of clusters.

    abstract:BACKGROUND:Cluster randomised controlled trials (CRCTs) are frequently used in health service evaluation. Assuming an average cluster size, required sample sizes are readily computed for both binary and continuous outcomes, by estimating a design effect or inflation factor. However, where the number of clusters are fix...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-11-102

    authors: Hemming K,Girling AJ,Sitch AJ,Marsh J,Lilford RJ

    更新日期:2011-06-30 00:00:00

  • Assembling and validating a heart failure-free cohort from the Reasons for Geographic and Racial Differences in Stroke (REGARDS) study.

    abstract:BACKGROUND:Studies examining incident heart failure (HF) have been limited to select populations. To examine incident HF with broader generalizability, there is need to assemble a HF-free cohort using a geographically-diverse sample. We aimed to develop and validate a simple medication-based strategy for assembling a H...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-019-0890-x

    authors: Goyal P,Mefford MT,Chen L,Sterling MR,Durant RW,Safford MM,Levitan EB

    更新日期:2020-03-04 00:00:00

  • Network-meta analysis made easy: detection of inconsistency using factorial analysis-of-variance models.

    abstract:BACKGROUND:Network meta-analysis can be used to combine results from several randomized trials involving more than two treatments. Potential inconsistency among different types of trial (designs) differing in the set of treatments tested is a major challenge, and application of procedures for detecting and locating inc...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-14-61

    authors: Piepho HP

    更新日期:2014-05-10 00:00:00

  • Blood spots as an alternative to whole blood collection and the effect of a small monetary incentive to increase participation in genetic association studies.

    abstract:BACKGROUND:Collection of buccal cells from saliva for DNA extraction offers a less invasive and convenient alternative to venipuncture blood collection that may increase participation in genetic epidemiologic studies. However, dried blood spot collection, which is also a convenient method, offers a means of collecting ...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-9-76

    authors: Bhatti P,Kampa D,Alexander BH,McClure C,Ringer D,Doody MM,Sigurdson AJ

    更新日期:2009-11-13 00:00:00

  • The role of the clinical research coordinator--data manager--in oncology clinical trials.

    abstract:BACKGROUND:The purpose of this study was to determine the standard tasks performed by clinical research coordinators (CRCs) in oncology clinical trials. METHODS:Forty-one CRCs were anonymously surveyed, using a four-page self-administered questionnaire focused on demographics, qualifications, and professional experien...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-4-6

    authors: Rico-Villademoros F,Hernando T,Sanz JL,López-Alonso A,Salamanca O,Camps C,Rosell R

    更新日期:2004-03-25 00:00:00

  • Effort, reward and self-reported mental health: a simulation study on negative affectivity bias.

    abstract:BACKGROUND:In the present article, we propose an alternative method for dealing with negative affectivity (NA) biases in research, while investigating the association between a deleterious psychosocial environment at work and poor mental health. First, we investigated how strong NA must be to cause an observed correlat...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-11-121

    authors: Arial M,Wild P

    更新日期:2011-08-24 00:00:00

  • A mixed methods case study investigating how randomised controlled trials (RCTs) are reported, understood and interpreted in practice.

    abstract:BACKGROUND:While randomised controlled trials (RCTs) provide high-quality evidence to guide practice, much routine care is not based upon available RCTs. This disconnect between evidence and practice is not sufficiently well understood. This case study explores this relationship using a novel approach. Better understan...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-020-01009-8

    authors: Byrne BE,Rooshenas L,Lambert HS,Blazeby JM

    更新日期:2020-05-12 00:00:00

  • The challenges of recruiting cancer patient/caregiver dyads: informing randomized controlled trials.

    abstract:BACKGROUND:Family members are increasingly involved in the care of cancer patients, however many are not prepared for this challenging role. Intervention-based studies are valuable to inform the most appropriate and effective support for caregivers. Barriers in the recruitment of patient/caregiver dyads exist but the r...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章,多中心研究,随机对照试验

    doi:10.1186/s12874-018-0614-7

    authors: Heckel L,Gunn KM,Livingston PM

    更新日期:2018-11-21 00:00:00

  • Applying an intersectionality lens to the theoretical domains framework: a tool for thinking about how intersecting social identities and structures of power influence behaviour.

    abstract:BACKGROUND:A key component of the implementation process is identifying potential barriers and facilitators that need to be addressed. The Theoretical Domains Framework (TDF) is one of the most commonly used frameworks for this purpose. When applying the TDF, it is critical to understand the context in which behaviours...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-020-01056-1

    authors: Etherington N,Rodrigues IB,Giangregorio L,Graham ID,Hoens AM,Kasperavicius D,Kelly C,Moore JE,Ponzano M,Presseau J,Sibley KM,Straus S

    更新日期:2020-06-26 00:00:00

  • Reliability of the modified child and adolescent physical activity and nutrition survey, physical activity (CAPANS-PA) questionnaire among Chinese-Australian youth.

    abstract:BACKGROUND:Evidence suggests that differences exist in physical activity (PA) participation among Culturally and Linguistically Diverse (CALD) children and adolescents. It is possible that these differences could be influenced by variations in measurement technique and instrument reliability. However, culturally sensit...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-11-122

    authors: Strugnell C,Renzaho A,Ridley K,Burns C

    更新日期:2011-08-25 00:00:00

  • Eliciting parental support for the use of newborn blood spots for pediatric research.

    abstract:BACKGROUND:Biomarkers of exposures such as infection or environmental chemicals can be measured in small volumes of blood extracted from newborn dried blood spots (DBS) underscoring their potential utility for population-based research. However, few studies have evaluated the feasibility and utility of this resource; p...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-016-0120-8

    authors: Yeung EH,Louis GB,Lawrence D,Kannan K,McLain AC,Caggana M,Druschel C,Bell E

    更新日期:2016-02-04 00:00:00

  • Pan-Canadian assessment of pandemic immunization data collection: study methodology.

    abstract:BACKGROUND:The collection of individual-level pandemic (H1N1) 2009 influenza immunization data was considered important to facilitate optimal vaccine delivery and accurate assessment of vaccine coverage. These data are also critical for research aimed at evaluating the new vaccine's safety and effectiveness. Systems us...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-10-51

    authors: Pereira JA,Quach S,Heidebrecht C,Foisy J,Quan S,Finkelstein M,Sikora CA,Bettinger JA,Buckeridge DL,McCarthy A,Deeks S,Kwong JC,Public Health Agency of Canada\/Canadian Institutes of Health Research Influenza Research Network Vacc

    更新日期:2010-06-08 00:00:00

  • Consensus workshops on the development of an ADHD medication management protocol using QbTest: developing a clinical trial protocol with multidisciplinary stakeholders.

    abstract:BACKGROUND:The study design and protocol that underpin a randomised controlled trial (RCT) are critical for the ultimate success of the trial. Although RCTs are considered the gold standard for research, there are multiple threats to their validity such as participant recruitment and retention, identifying a meaningful...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-019-0772-2

    authors: Hall CL,Brown S,James M,Martin JL,Brown N,Selby K,Clarke J,Williams L,Sayal K,Hollis C,Groom MJ

    更新日期:2019-06-18 00:00:00

  • Recruitment of adolescents with suicidal ideation in the emergency department: lessons from a randomized controlled pilot trial of a youth suicide prevention intervention.

    abstract:BACKGROUND:Emergency Departments (EDs) are a first point-of-contact for many youth with mental health and suicidality concerns and can serve as an effective recruitment source for randomized controlled trials (RCTs) of mental health interventions. However, recruitment in acute care settings is impeded by several challe...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-020-01117-5

    authors: Tracey M,Finkelstein Y,Schachter R,Cleverley K,Monga S,Barwick M,Szatmari P,Moretti ME,Willan A,Henderson J,Korczak DJ

    更新日期:2020-09-14 00:00:00

  • Longitudinal studies that use data collected as part of usual care risk reporting biased results: a systematic review.

    abstract:BACKGROUND:Longitudinal studies using data collected as part of usual care risk providing biased results if visit times are related to the outcome of interest. Statistical methods for mitigating this bias are available but rarely used. This lack of use could be attributed to a lack of need or to a lack of awareness of ...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章,评审

    doi:10.1186/s12874-017-0418-1

    authors: Farzanfar D,Abumuamar A,Kim J,Sirotich E,Wang Y,Pullenayegum E

    更新日期:2017-09-06 00:00:00

  • Self-reported measures in health research for people with intellectual disabilities: an inclusive pilot study on suitability and reliability.

    abstract:BACKGROUND:The lack of suitable and reliable scales to measure self-reported health and health behaviour among people with intellectual disabilities (ID) is an important methodological challenge in health research. This study, which was undertaken together with co-researchers with ID, explores possibilities for self-re...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-018-0539-1

    authors: Vlot-van Anrooij K,Tobi H,Hilgenkamp TIM,Leusink GL,Naaldenberg J

    更新日期:2018-07-16 00:00:00

  • Psychometric analysis of the brief symptom inventory 18 (BSI-18) in a representative German sample.

    abstract:BACKGROUND:The BSI-18 contains the three six-item scales somatization, depression, and anxiety as well as the Global Severity Index (GSI), including all 18 items. The BSI-18 is the latest and shortest of the multidimensional versions of the Symptom-Checklist 90-R, but its psychometric properties have not been sufficien...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/s12874-016-0283-3

    authors: Franke GH,Jaeger S,Glaesmer H,Barkmann C,Petrowski K,Braehler E

    更新日期:2017-01-26 00:00:00

  • Determinants of non- response to a second assessment of lifestyle factors and body weight in the EPIC-PANACEA study.

    abstract:BACKGROUND:This paper discusses whether baseline demographic, socio-economic, health variables, length of follow-up and method of contacting the participants predict non-response to the invitation for a second assessment of lifestyle factors and body weight in the European multi-center EPIC-PANACEA study. METHODS:Over...

    journal_title:BMC medical research methodology

    pub_type: 杂志文章

    doi:10.1186/1471-2288-12-148

    authors: May AM,Adema LE,Romaguera D,Vergnaud AC,Agudo A,Ekelund U,Steffen A,Orfanos P,Slimani N,Rinaldi S,Mouw T,Rohrmann S,Hermann S,Boeing H,Bergmann MM,Jakobsen MU,Overvad K,Wareham NJ,Gonzalez C,Tjonneland A,Halkjaer

    更新日期:2012-09-24 00:00:00